1
|
O'Neil PT, Swint‐Kruse L, Fenton AW. Rheostatic contributions to protein stability can obscure a position's functional role. Protein Sci 2024; 33:e5075. [PMID: 38895978 PMCID: PMC11187868 DOI: 10.1002/pro.5075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 05/24/2024] [Accepted: 05/27/2024] [Indexed: 06/21/2024]
Abstract
Rheostat positions, which can be substituted with various amino acids to tune protein function across a range of outcomes, are a developing area for advancing personalized medicine and bioengineering. Current methods cannot accurately predict which proteins contain rheostat positions or their substitution outcomes. To compare the prevalence of rheostat positions in homologs, we previously investigated their occurrence in two pyruvate kinase (PYK) isozymes. Human liver PYK contained numerous rheostat positions that tuned the apparent affinity for the substrate phosphoenolpyruvate (Kapp-PEP) across a wide range. In contrast, no functional rheostat positions were identified in Zymomonas mobilis PYK (ZmPYK). Further, the set of ZmPYK substitutions included an unusually large number that lacked measurable activity. We hypothesized that the inactive substitution variants had reduced protein stability, precluding detection of Kapp-PEP tuning. Using modified buffers, robust enzymatic activity was obtained for 19 previously-inactive ZmPYK substitution variants at three positions. Surprisingly, both previously-inactive and previously-active substitution variants all had Kapp-PEP values close to wild-type. Thus, none of the three positions were functional rheostat positions, and, unlike human liver PYK, ZmPYK's Kapp-PEP remained poorly tunable by single substitutions. To directly assess effects on stability, we performed thermal denaturation experiments for all ZmPYK substitution variants. Many diminished stability, two enhanced stability, and the three positions showed different thermal sensitivity to substitution, with one position acting as a "stability rheostat." The differences between the two PYK homologs raises interesting questions about the underlying mechanism(s) that permit functional tuning by single substitutions in some proteins but not in others.
Collapse
Affiliation(s)
- Pierce T. O'Neil
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansasUSA
| | - Liskin Swint‐Kruse
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansasUSA
| | - Aron W. Fenton
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansasUSA
| |
Collapse
|
2
|
Manning MC, Holcomb RE, Payne RW, Stillahn JM, Connolly BD, Katayama DS, Liu H, Matsuura JE, Murphy BM, Henry CS, Crommelin DJA. Stability of Protein Pharmaceuticals: Recent Advances. Pharm Res 2024:10.1007/s11095-024-03726-x. [PMID: 38937372 DOI: 10.1007/s11095-024-03726-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Accepted: 06/03/2024] [Indexed: 06/29/2024]
Abstract
There have been significant advances in the formulation and stabilization of proteins in the liquid state over the past years since our previous review. Our mechanistic understanding of protein-excipient interactions has increased, allowing one to develop formulations in a more rational fashion. The field has moved towards more complex and challenging formulations, such as high concentration formulations to allow for subcutaneous administration and co-formulation. While much of the published work has focused on mAbs, the principles appear to apply to any therapeutic protein, although mAbs clearly have some distinctive features. In this review, we first discuss chemical degradation reactions. This is followed by a section on physical instability issues. Then, more specific topics are addressed: instability induced by interactions with interfaces, predictive methods for physical stability and interplay between chemical and physical instability. The final parts are devoted to discussions how all the above impacts (co-)formulation strategies, in particular for high protein concentration solutions.'
Collapse
Affiliation(s)
- Mark Cornell Manning
- Legacy BioDesign LLC, Johnstown, CO, USA.
- Department of Chemistry, Colorado State University, Fort Collins, CO, USA.
| | - Ryan E Holcomb
- Legacy BioDesign LLC, Johnstown, CO, USA
- Department of Chemistry, Colorado State University, Fort Collins, CO, USA
| | - Robert W Payne
- Legacy BioDesign LLC, Johnstown, CO, USA
- Department of Chemistry, Colorado State University, Fort Collins, CO, USA
| | - Joshua M Stillahn
- Legacy BioDesign LLC, Johnstown, CO, USA
- Department of Chemistry, Colorado State University, Fort Collins, CO, USA
| | | | | | | | | | | | - Charles S Henry
- Department of Chemistry, Colorado State University, Fort Collins, CO, USA
| | | |
Collapse
|
3
|
Chompunud Na Ayudhya C, Graidist P, Tipmanee V. Role of CSF1R 550th-tryptophan in kusunokinin and CSF1R inhibitor binding and ligand-induced structural effect. Sci Rep 2024; 14:12531. [PMID: 38822100 PMCID: PMC11143223 DOI: 10.1038/s41598-024-63505-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 05/29/2024] [Indexed: 06/02/2024] Open
Abstract
Binding affinity is an important factor in drug design to improve drug-target selectivity and specificity. In this study, in silico techniques based on molecular docking followed by molecular dynamics (MD) simulations were utilized to identify the key residue(s) for CSF1R binding affinity among 14 pan-tyrosine kinase inhibitors and 15 CSF1R-specific inhibitors. We found tryptophan at position 550 (W550) on the CSF1R binding site interacted with the inhibitors' aromatic ring in a π-π way that made the ligands better at binding. Upon W550-Alanine substitution (W550A), the binding affinity of trans-(-)-kusunokinin and imatinib to CSF1R was significantly decreased. However, in terms of structural features, W550 did not significantly affect overall CSF1R structure, but provided destabilizing effect upon mutation. The W550A also did not either cause ligand to change its binding site or conformational changes due to ligand binding. As a result of our findings, the π-π interaction with W550's aromatic ring could be still the choice for increasing binding affinity to CSF1R. Nevertheless, our study showed that the increasing binding to W550 of the design ligand may not ensure CSF1R specificity and inhibition since W550-ligand bound state did not induce significantly conformational change into inactive state.
Collapse
Affiliation(s)
- Chompunud Chompunud Na Ayudhya
- Department of Biomedical Sciences and Biomedical Engineering, Faculty of Medicine, Prince of Songkla University, Hat Yai, 90100, Songkhla, Thailand
| | - Potchanapond Graidist
- Department of Biomedical Sciences and Biomedical Engineering, Faculty of Medicine, Prince of Songkla University, Hat Yai, 90100, Songkhla, Thailand
- Bioactivity Testing Center, Faculty of Medicine, Prince of Songkla University, Hat Yai, 90100, Songkhla, Thailand
| | - Varomyalin Tipmanee
- Department of Biomedical Sciences and Biomedical Engineering, Faculty of Medicine, Prince of Songkla University, Hat Yai, 90100, Songkhla, Thailand.
- Bioactivity Testing Center, Faculty of Medicine, Prince of Songkla University, Hat Yai, 90100, Songkhla, Thailand.
| |
Collapse
|
4
|
Ali M, Greenig M, Oeller M, Atkinson M, Xu X, Sormanni P. Automated optimization of the solubility of a hyper-stable α-amylase. Open Biol 2024; 14:240014. [PMID: 38745462 DOI: 10.1098/rsob.240014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Accepted: 03/27/2024] [Indexed: 05/16/2024] Open
Abstract
Most successes in computational protein engineering to date have focused on enhancing one biophysical trait, while multi-trait optimization remains a challenge. Different biophysical properties are often conflicting, as mutations that improve one tend to worsen the others. In this study, we explored the potential of an automated computational design strategy, called CamSol Combination, to optimize solubility and stability of enzymes without affecting their activity. Specifically, we focus on Bacillus licheniformis α-amylase (BLA), a hyper-stable enzyme that finds diverse application in industry and biotechnology. We validate the computational predictions by producing 10 BLA variants, including the wild-type (WT) and three designed models harbouring between 6 and 8 mutations each. Our results show that all three models have substantially improved relative solubility over the WT, unaffected catalytic rate and retained hyper-stability, supporting the algorithm's capacity to optimize enzymes. High stability and solubility embody enzymes with superior resilience to chemical and physical stresses, enhance manufacturability and allow for high-concentration formulations characterized by extended shelf lives. This ability to readily optimize solubility and stability of enzymes will enable the rapid and reliable generation of highly robust and versatile reagents, poised to contribute to advancements in diverse scientific and industrial domains.
Collapse
Affiliation(s)
- Montader Ali
- Yusuf Hamied Department of Chemistry, University of Cambridge , Cambridge CB2 1EW, UK
| | - Matthew Greenig
- Yusuf Hamied Department of Chemistry, University of Cambridge , Cambridge CB2 1EW, UK
| | - Marc Oeller
- Yusuf Hamied Department of Chemistry, University of Cambridge , Cambridge CB2 1EW, UK
- Proteomics and Signal Transduction, Max Planck Institute of Biochemistry , Martinsried 82152, Germany
| | - Misha Atkinson
- Yusuf Hamied Department of Chemistry, University of Cambridge , Cambridge CB2 1EW, UK
| | - Xing Xu
- Yusuf Hamied Department of Chemistry, University of Cambridge , Cambridge CB2 1EW, UK
| | - Pietro Sormanni
- Yusuf Hamied Department of Chemistry, University of Cambridge , Cambridge CB2 1EW, UK
| |
Collapse
|
5
|
Hao M, Imamichi T, Chang W. Modeling and Analysis of HIV-1 Pol Polyprotein as a Case Study for Predicting Large Polyprotein Structures. Int J Mol Sci 2024; 25:1809. [PMID: 38339086 PMCID: PMC10855158 DOI: 10.3390/ijms25031809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 01/26/2024] [Accepted: 01/29/2024] [Indexed: 02/12/2024] Open
Abstract
Acquired immunodeficiency syndrome (AIDS) is caused by human immunodeficiency virus (HIV). HIV protease, reverse transcriptase, and integrase are targets of current drugs to treat the disease. However, anti-viral drug-resistant strains have emerged quickly due to the high mutation rate of the virus, leading to the demand for the development of new drugs. One attractive target is Gag-Pol polyprotein, which plays a key role in the life cycle of HIV. Recently, we found that a combination of M50I and V151I mutations in HIV-1 integrase can suppress virus release and inhibit the initiation of Gag-Pol autoprocessing and maturation without interfering with the dimerization of Gag-Pol. Additional mutations in integrase or RNase H domain in reverse transcriptase can compensate for the defect. However, the molecular mechanism is unknown. There is no tertiary structure of the full-length HIV-1 Pol protein available for further study. Therefore, we developed a workflow to predict the tertiary structure of HIV-1 NL4.3 Pol polyprotein. The modeled structure has comparable quality compared with the recently published partial HIV-1 Pol structure (PDB ID: 7SJX). Our HIV-1 NL4.3 Pol dimer model is the first full-length Pol tertiary structure. It can provide a structural platform for studying the autoprocessing mechanism of HIV-1 Pol and for developing new potent drugs. Moreover, the workflow can be used to predict other large protein structures that cannot be resolved via conventional experimental methods.
Collapse
Affiliation(s)
| | | | - Weizhong Chang
- Laboratory of Human Retrovirology and Immunoinformatics, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA; (M.H.); (T.I.)
| |
Collapse
|
6
|
Ghaedizadeh S, Zeinali M, Dabirmanesh B, Rasekh B, Khajeh K, Banaei-Moghaddam AM. Rational design engineering of a more thermostable Sulfurihydrogenibium yellowstonense carbonic anhydrase for potential application in carbon dioxide capture technologies. BIOCHIMICA ET BIOPHYSICA ACTA. PROTEINS AND PROTEOMICS 2024; 1872:140962. [PMID: 37716447 DOI: 10.1016/j.bbapap.2023.140962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Revised: 08/18/2023] [Accepted: 09/06/2023] [Indexed: 09/18/2023]
Abstract
Implementing hyperthermostable carbonic anhydrases into CO2 capture and storage technologies in order to increase the rate of CO2 absorption from the industrial flue gases is of great importance from technical and economical points of view. The present study employed a combination of in silico tools to further improve thermostability of a known thermostable carbonic anhydrase from Sulfurihydrogenibium yellowstonense. Experimental results showed that our rationally engineered K100G mutant not only retained the overall structure and catalytic efficiency but also showed a 3 °C increase in the melting temperature and a two-fold improvement in the enzyme half-life at 85 °C. Based on the molecular dynamics simulation results, rearrangement of salt bridges and hydrogen interactions network causes a reduction in local flexibility of the K100G variant. In conclusion, our study demonstrated that thermostability can be improved through imposing local structural rigidity by engineering a single-point mutation on the surface of the enzyme.
Collapse
Affiliation(s)
- Shima Ghaedizadeh
- Laboratory of Genomics and Epigenomics (LGE), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Majid Zeinali
- Microbiology and Biotechnology Research Group, Research Institute of Petroleum Industry (RIPI), Tehran, Iran.
| | - Bahareh Dabirmanesh
- Department of Biochemistry, Faculty of Biological Sciences, Tarbiat Modares University, Tehran, Iran
| | - Behnam Rasekh
- Microbiology and Biotechnology Research Group, Research Institute of Petroleum Industry (RIPI), Tehran, Iran
| | - Khosrow Khajeh
- Department of Biochemistry, Faculty of Biological Sciences, Tarbiat Modares University, Tehran, Iran
| | - Ali Mohammad Banaei-Moghaddam
- Laboratory of Genomics and Epigenomics (LGE), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran.
| |
Collapse
|
7
|
Wang S, Tang H, Shan P, Wu Z, Zuo L. ProS-GNN: Predicting effects of mutations on protein stability using graph neural networks. Comput Biol Chem 2023; 107:107952. [PMID: 37643501 DOI: 10.1016/j.compbiolchem.2023.107952] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 08/18/2023] [Accepted: 08/25/2023] [Indexed: 08/31/2023]
Abstract
Predicting protein stability change upon variation through a computational approach is a valuable tool to unveil the mechanisms of mutation-induced drug failure and develop immunotherapy strategies. Some previous machine learning-based techniques exhibit anti-symmetric bias toward destabilizing situations, whereas others struggle with generalization to unseen examples. To address these issues, we propose a gated graph neural network-based approach to predict changes in protein stability upon mutation. The model uses message passing to encode the links between the molecular structure and property after eliminating the non-mutant structure and creating input feature vectors. While doing so, it also incorporates the coordinates of the raw atoms to provide spatial insights into the chemical systems. We test the model on the Ssym, Myoglobin, Broom, and p53 datasets to demonstrate the generalization performance. Compared to existing approaches, our proposed method achieves improved linearity with symmetry in less time. The code for this study is available at: https://github.com/HongzhouTang/Pros-GNN.
Collapse
Affiliation(s)
- Shuyu Wang
- Department of Control Engineering, Northeastern University, Qinhuangdao Campus, Qinhuangdao 066001, China.
| | - Hongzhou Tang
- Department of Control Engineering, Northeastern University, Qinhuangdao Campus, Qinhuangdao 066001, China
| | - Peng Shan
- Department of Control Engineering, Northeastern University, Qinhuangdao Campus, Qinhuangdao 066001, China
| | - Zhaoxia Wu
- Department of Control Engineering, Northeastern University, Qinhuangdao Campus, Qinhuangdao 066001, China
| | - Lei Zuo
- Department of Marine Engineering, University of Michigan, Ann Arbor 48109, USA
| |
Collapse
|
8
|
Musil M, Jezik A, Horackova J, Borko S, Kabourek P, Damborsky J, Bednar D. FireProt 2.0: web-based platform for the fully automated design of thermostable proteins. Brief Bioinform 2023; 25:bbad425. [PMID: 38018911 PMCID: PMC10685400 DOI: 10.1093/bib/bbad425] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 10/25/2023] [Accepted: 11/01/2023] [Indexed: 11/30/2023] Open
Abstract
Thermostable proteins find their use in numerous biomedical and biotechnological applications. However, the computational design of stable proteins often results in single-point mutations with a limited effect on protein stability. However, the construction of stable multiple-point mutants can prove difficult due to the possibility of antagonistic effects between individual mutations. FireProt protocol enables the automated computational design of highly stable multiple-point mutants. FireProt 2.0 builds on top of the previously published FireProt web, retaining the original functionality and expanding it with several new stabilization strategies. FireProt 2.0 integrates the AlphaFold database and the homology modeling for structure prediction, enabling calculations starting from a sequence. Multiple-point designs are constructed using the Bron-Kerbosch algorithm minimizing the antagonistic effect between the individual mutations. Users can newly limit the FireProt calculation to a set of user-defined mutations, run a saturation mutagenesis of the whole protein or select rigidifying mutations based on B-factors. Evolution-based back-to-consensus strategy is complemented by ancestral sequence reconstruction. FireProt 2.0 is significantly faster and a reworked graphical user interface broadens the tool's availability even to users with older hardware. FireProt 2.0 is freely available at http://loschmidt.chemi.muni.cz/fireprotweb.
Collapse
Affiliation(s)
- Milos Musil
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Masaryk University, Brno, Czech Republic
- Department of Information Systems, Faculty of Information Technology, Brno University of Technology, Brno, Czech Republic
- International Clinical Research Centre, St. Anne’s University Hospital Brno, Brno, Czech Republic
| | - Andrej Jezik
- Department of Information Systems, Faculty of Information Technology, Brno University of Technology, Brno, Czech Republic
| | - Jana Horackova
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Masaryk University, Brno, Czech Republic
| | - Simeon Borko
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Masaryk University, Brno, Czech Republic
- Department of Information Systems, Faculty of Information Technology, Brno University of Technology, Brno, Czech Republic
- International Clinical Research Centre, St. Anne’s University Hospital Brno, Brno, Czech Republic
| | - Petr Kabourek
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Masaryk University, Brno, Czech Republic
- International Clinical Research Centre, St. Anne’s University Hospital Brno, Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Masaryk University, Brno, Czech Republic
- International Clinical Research Centre, St. Anne’s University Hospital Brno, Brno, Czech Republic
| | - David Bednar
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Masaryk University, Brno, Czech Republic
- International Clinical Research Centre, St. Anne’s University Hospital Brno, Brno, Czech Republic
| |
Collapse
|
9
|
Sapozhnikov Y, Patel JS, Ytreberg FM, Miller CR. Statistical modeling to quantify the uncertainty of FoldX-predicted protein folding and binding stability. BMC Bioinformatics 2023; 24:426. [PMID: 37953256 PMCID: PMC10642056 DOI: 10.1186/s12859-023-05537-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Accepted: 10/17/2023] [Indexed: 11/14/2023] Open
Abstract
BACKGROUND Computational methods of predicting protein stability changes upon missense mutations are invaluable tools in high-throughput studies involving a large number of protein variants. However, they are limited by a wide variation in accuracy and difficulty of assessing prediction uncertainty. Using a popular computational tool, FoldX, we develop a statistical framework that quantifies the uncertainty of predicted changes in protein stability. RESULTS We show that multiple linear regression models can be used to quantify the uncertainty associated with FoldX prediction for individual mutations. Comparing the performance among models with varying degrees of complexity, we find that the model precision improves significantly when we utilize molecular dynamics simulation as part of the FoldX workflow. Based on the model that incorporates information from molecular dynamics, biochemical properties, as well as FoldX energy terms, we can generally expect upper bounds on the uncertainty of folding stability predictions of ± 2.9 kcal/mol and ± 3.5 kcal/mol for binding stability predictions. The uncertainty for individual mutations varies; our model estimates it using FoldX energy terms, biochemical properties of the mutated residue, as well as the variability among snapshots from molecular dynamics simulation. CONCLUSIONS Using a linear regression framework, we construct models to predict the uncertainty associated with FoldX prediction of stability changes upon mutation. This technique is straightforward and can be extended to other computational methods as well.
Collapse
Affiliation(s)
- Yesol Sapozhnikov
- Program in Bioinformatics and Computational Biology, University of Idaho, Moscow, ID, 83844, USA
| | - Jagdish Suresh Patel
- Department of Chemical and Biological Engineering, University of Idaho, Moscow, ID, 83844, USA
- Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, ID, 83844, USA
| | - F Marty Ytreberg
- Department of Physics, University of Idaho, Moscow, ID, 83844, USA
- Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, ID, 83844, USA
| | - Craig R Miller
- Department of Biological Sciences, University of Idaho, Moscow, ID, 83844, USA.
- Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, ID, 83844, USA.
| |
Collapse
|
10
|
Rothfuss MT, Becht DC, Zeng B, McClelland LJ, Yates-Hansen C, Bowler BE. High-Accuracy Prediction of Stabilizing Surface Mutations to the Three-Helix Bundle, UBA(1), with EmCAST. J Am Chem Soc 2023; 145:22979-22992. [PMID: 37815921 PMCID: PMC10626973 DOI: 10.1021/jacs.3c04966] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/12/2023]
Abstract
The accurate modeling of energetic contributions to protein structure is a fundamental challenge in computational approaches to protein analysis and design. We describe a general computational method, EmCAST (empirical Cα stabilization), to score and optimize the sequence to the structure in proteins. The method relies on an empirical potential derived from the database of the Cα dihedral angle preferences for all possible four-residue sequences, using the data available in the Protein Data Bank. Our method produces stability predictions that naturally correlate one-to-one with the experimental results for solvent-exposed mutation sites. EmCAST predicted four mutations that increased the stability of a three-helix bundle, UBA(1), from 2.4 to 4.8 kcal/mol by optimizing residues in both helices and turns. For a set of eight variants, the predicted and experimental stabilizations correlate very well (R2 = 0.97) with a slope near 1 and with a 0.16 kcal/mol standard error for EmCAST predictions. Tests against literature data for the stability effects of surface-exposed mutations show that EmCAST outperforms the existing stability prediction methods. UBA(1) variants were crystallized to verify and analyze their structures at an atomic resolution. Thermodynamic and kinetic folding experiments were performed to determine the magnitude and mechanism of stabilization. Our method has the potential to enable the rapid, rational optimization of natural proteins, expand the analysis of the sequence/structure relationship, and supplement the existing protein design strategies.
Collapse
Affiliation(s)
- Michael T. Rothfuss
- Department of Chemistry and Biochemistry, University of Montana, Missoula, MT 59812, United States
| | - Dustin C. Becht
- Department of Chemistry and Biochemistry, University of Montana, Missoula, MT 59812, United States
| | - Baisen Zeng
- Center for Biomolecular Structure and Dynamics, University of Montana, Missoula, MT 59812, United States
| | - Levi J. McClelland
- Center for Biomolecular Structure and Dynamics, University of Montana, Missoula, MT 59812, United States
- Division of Biological Sciences, University of Montana, Missoula, MT 59812, United States
| | - Cindee Yates-Hansen
- Center for Biomolecular Structure and Dynamics, University of Montana, Missoula, MT 59812, United States
| | - Bruce E. Bowler
- Department of Chemistry and Biochemistry, University of Montana, Missoula, MT 59812, United States
- Center for Biomolecular Structure and Dynamics, University of Montana, Missoula, MT 59812, United States
| |
Collapse
|
11
|
Dong X, Liu W, Dong Y, Wang K, Li K, Bian L. Metallo-β-lactamase SMB-1 evolves into a more efficient hydrolase under the selective pressure of meropenem. J Inorg Biochem 2023; 247:112323. [PMID: 37478781 DOI: 10.1016/j.jinorgbio.2023.112323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 06/27/2023] [Accepted: 07/06/2023] [Indexed: 07/23/2023]
Abstract
Metallo-β-lactamases (MβLs) are the primary mechanism of resistance to carbapenem antibiotics. To elucidate how MβLs have evolved with the introduction and use of antibiotics, the mutation and evolution of SMB-1 from Serratia marcescens were investigated in microbial evolution plates containing discontinuous meropenem (MEM) concentration gradients. The results revealed 2-point mutations, A242G and S257R; 1 double-site mutation, C240G/E258G; and 3 frameshift mutations, M5, M12, and M13, which are all missense mutations situated at the C-terminus. Compared with that of the wild-type (WT), the minimum inhibitory concentrations (MICs) of MEM for A242G, C240G/E258G, M5, M12, and M13 increased at least 120-fold, and that of S257R increased 8-fold. The catalytic efficiency kcat/Km increased by 365% and 647%, respectively. Concerning the structural changes, the structure at the active site changed from an ordered structure to an unordered conformation. Simultaneously, the flexibility of loop 1 was enhanced. These changes increased the volume of the active site cavity; thus, this was more conducive to exposing the Zn2+ site, facilitating substrate binding and conversion to products. In A242G, structural changes in Gly-242 can be transmitted to the active region via a network of interactions between the side chains of Gly-242 and the amino acid side chains near the active pocket. Together, these results pointed to the process of persistent drug tolerance and resistance, the SMB-1 enzyme evolved into a more exquisite structure with increased flexibility and stability, and stronger hydrolysis activity via genetic mutations and structural changes.
Collapse
Affiliation(s)
- Xiaoting Dong
- College of Life Science, Northwest University, Xi'an 710069, China
| | - Wenli Liu
- College of Life Science, Northwest University, Xi'an 710069, China
| | - Yuxuan Dong
- College of Life Science, Northwest University, Xi'an 710069, China
| | - Kun Wang
- College of Life Science, Northwest University, Xi'an 710069, China
| | - Kewei Li
- College of Life Science, Northwest University, Xi'an 710069, China
| | - Liujiao Bian
- College of Life Science, Northwest University, Xi'an 710069, China.
| |
Collapse
|
12
|
Stern JA, Free TJ, Stern KL, Gardiner S, Dalley NA, Bundy BC, Price JL, Wingate D, Della Corte D. A probabilistic view of protein stability, conformational specificity, and design. Sci Rep 2023; 13:15493. [PMID: 37726313 PMCID: PMC10509192 DOI: 10.1038/s41598-023-42032-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 09/04/2023] [Indexed: 09/21/2023] Open
Abstract
Various approaches have used neural networks as probabilistic models for the design of protein sequences. These "inverse folding" models employ different objective functions, which come with trade-offs that have not been assessed in detail before. This study introduces probabilistic definitions of protein stability and conformational specificity and demonstrates the relationship between these chemical properties and the [Formula: see text] Boltzmann probability objective. This links the Boltzmann probability objective function to experimentally verifiable outcomes. We propose a novel sequence decoding algorithm, referred to as "BayesDesign", that leverages Bayes' Rule to maximize the [Formula: see text] objective instead of the [Formula: see text] objective common in inverse folding models. The efficacy of BayesDesign is evaluated in the context of two protein model systems, the NanoLuc enzyme and the WW structural motif. Both BayesDesign and the baseline ProteinMPNN algorithm increase the thermostability of NanoLuc and increase the conformational specificity of WW. The possible sources of error in the model are analyzed.
Collapse
Affiliation(s)
- Jacob A Stern
- Department of Computer Science, Brigham Young University, Provo, UT, USA
| | - Tyler J Free
- Department of Chemical Engineering, Brigham Young University, Provo, UT, USA
| | - Kimberlee L Stern
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, UT, USA
| | - Spencer Gardiner
- Department of Physics and Astronomy, Brigham Young University, Provo, UT, USA
| | - Nicholas A Dalley
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, UT, USA
| | - Bradley C Bundy
- Department of Chemical Engineering, Brigham Young University, Provo, UT, USA
| | - Joshua L Price
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, UT, USA
| | - David Wingate
- Department of Computer Science, Brigham Young University, Provo, UT, USA
| | - Dennis Della Corte
- Department of Physics and Astronomy, Brigham Young University, Provo, UT, USA.
| |
Collapse
|
13
|
Rosace A, Bennett A, Oeller M, Mortensen MM, Sakhnini L, Lorenzen N, Poulsen C, Sormanni P. Automated optimisation of solubility and conformational stability of antibodies and proteins. Nat Commun 2023; 14:1937. [PMID: 37024501 PMCID: PMC10079162 DOI: 10.1038/s41467-023-37668-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Accepted: 03/24/2023] [Indexed: 04/08/2023] Open
Abstract
Biologics, such as antibodies and enzymes, are crucial in research, biotechnology, diagnostics, and therapeutics. Often, biologics with suitable functionality are discovered, but their development is impeded by developability issues. Stability and solubility are key biophysical traits underpinning developability potential, as they determine aggregation, correlate with production yield and poly-specificity, and are essential to access parenteral and oral delivery. While advances for the optimisation of individual traits have been made, the co-optimization of multiple traits remains highly problematic and time-consuming, as mutations that improve one property often negatively impact others. In this work, we introduce a fully automated computational strategy for the simultaneous optimisation of conformational stability and solubility, which we experimentally validate on six antibodies, including two approved therapeutics. Our results on 42 designs demonstrate that the computational procedure is highly effective at improving developability potential, while not affecting antigen-binding. We make the method available as a webserver at www-cohsoftware.ch.cam.ac.uk.
Collapse
Affiliation(s)
- Angelo Rosace
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield road, CB2 1EW, Cambridge, UK
- Master in Bioinformatics for Health Sciences, Universitat Pompeu Fabra, Barcelona, Catalonia, Spain
- Institute for Research in Biomedicine (IRB), The Barcelona Institute of Science and Technology, Barcelona, Catalonia, Spain
| | - Anja Bennett
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield road, CB2 1EW, Cambridge, UK
- Department of Mammalian Expression, Global Research Technologies, Novo Nordisk A/S, Novo Nordisk Park 1, 2760, Måløv, Denmark
- BRIC, Faculty of Health and Medical Sciences, University of Copenhagen, Ole Maaløes Vej 5, 2200, Copenhagen, Denmark
| | - Marc Oeller
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield road, CB2 1EW, Cambridge, UK
| | - Mie M Mortensen
- Department of Purification Technologies, Global Research Technologies, Novo Nordisk A/S, Novo Nordisk Park 1, 2760, Måløv, Denmark
- Faculty of Engineering and Science, Department of Biotechnology, Chemistry and Environmental Engineering, University of Aalborg, Fredrik Bajers Vej 7H, 9220, Aalborg, Denmark
| | - Laila Sakhnini
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield road, CB2 1EW, Cambridge, UK
- Department of Biophysics and Injectable Formulation 2, Global Research Technologies, Novo Nordisk A/S, Måløv, 2760, Denmark
| | - Nikolai Lorenzen
- Department of Biophysics and Injectable Formulation 2, Global Research Technologies, Novo Nordisk A/S, Måløv, 2760, Denmark
| | - Christian Poulsen
- Department of Mammalian Expression, Global Research Technologies, Novo Nordisk A/S, Novo Nordisk Park 1, 2760, Måløv, Denmark
| | - Pietro Sormanni
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield road, CB2 1EW, Cambridge, UK.
| |
Collapse
|
14
|
Johansson KE, Lindorff-Larsen K, Winther JR. Global Analysis of Multi-Mutants to Improve Protein Function. J Mol Biol 2023; 435:168034. [PMID: 36863661 DOI: 10.1016/j.jmb.2023.168034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Revised: 02/22/2023] [Accepted: 02/22/2023] [Indexed: 03/04/2023]
Abstract
The identification of amino acid substitutions that both enhance the stability and function of a protein is a key challenge in protein engineering. Technological advances have enabled assaying thousands of protein variants in a single high-throughput experiment, and more recent studies use such data in protein engineering. We present a Global Multi-Mutant Analysis (GMMA) that exploits the presence of multiply-substituted variants to identify individual amino acid substitutions that are beneficial for the stability and function across a large library of protein variants. We have applied GMMA to a previously published experiment reporting on >54,000 variants of green fluorescent protein (GFP), each with known fluorescence output, and each carrying 1-15 amino acid substitutions (Sarkisyan et al., 2016). The GMMA method achieves a good fit to this dataset while being analytically transparent. We show experimentally that the six top-ranking substitutions progressively enhance GFP. More broadly, using only a single experiment as input our analysis recovers nearly all the substitutions previously reported to be beneficial for GFP folding and function. In conclusion, we suggest that large libraries of multiply-substituted variants may provide a unique source of information for protein engineering.
Collapse
Affiliation(s)
- Kristoffer E Johansson
- Linderstrøm-Lang Centre for Protein Science, Section for Biomolecular Sciences, Department of Biology of (University of Copenhagen), Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark.
| | - Kresten Lindorff-Larsen
- Linderstrøm-Lang Centre for Protein Science, Section for Biomolecular Sciences, Department of Biology of (University of Copenhagen), Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark.
| | - Jakob R Winther
- Linderstrøm-Lang Centre for Protein Science, Section for Biomolecular Sciences, Department of Biology of (University of Copenhagen), Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark.
| |
Collapse
|
15
|
Anderson DM, Jayanthi LP, Gosavi S, Meiering EM. Engineering the kinetic stability of a β-trefoil protein by tuning its topological complexity. Front Mol Biosci 2023; 10:1021733. [PMID: 36845544 PMCID: PMC9945329 DOI: 10.3389/fmolb.2023.1021733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 01/02/2023] [Indexed: 02/11/2023] Open
Abstract
Kinetic stability, defined as the rate of protein unfolding, is central to determining the functional lifetime of proteins, both in nature and in wide-ranging medical and biotechnological applications. Further, high kinetic stability is generally correlated with high resistance against chemical and thermal denaturation, as well as proteolytic degradation. Despite its significance, specific mechanisms governing kinetic stability remain largely unknown, and few studies address the rational design of kinetic stability. Here, we describe a method for designing protein kinetic stability that uses protein long-range order, absolute contact order, and simulated free energy barriers of unfolding to quantitatively analyze and predict unfolding kinetics. We analyze two β-trefoil proteins: hisactophilin, a quasi-three-fold symmetric natural protein with moderate stability, and ThreeFoil, a designed three-fold symmetric protein with extremely high kinetic stability. The quantitative analysis identifies marked differences in long-range interactions across the protein hydrophobic cores that partially account for the differences in kinetic stability. Swapping the core interactions of ThreeFoil into hisactophilin increases kinetic stability with close agreement between predicted and experimentally measured unfolding rates. These results demonstrate the predictive power of readily applied measures of protein topology for altering kinetic stability and recommend core engineering as a tractable target for rationally designing kinetic stability that may be widely applicable.
Collapse
Affiliation(s)
| | - Lakshmi P. Jayanthi
- Simons Centre for the Study of Living Machines, National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bangalore, India
| | - Shachi Gosavi
- Simons Centre for the Study of Living Machines, National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bangalore, India
| | - Elizabeth M. Meiering
- Department of Chemistry, University of Waterloo, Waterloo, ON, Canada,*Correspondence: Elizabeth M. Meiering,
| |
Collapse
|
16
|
Clifton BE, Kozome D, Laurino P. Efficient Exploration of Sequence Space by Sequence-Guided Protein Engineering and Design. Biochemistry 2023; 62:210-220. [PMID: 35245020 DOI: 10.1021/acs.biochem.1c00757] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
The rapid growth of sequence databases over the past two decades means that protein engineers faced with optimizing a protein for any given task will often have immediate access to a vast number of related protein sequences. These sequences encode information about the evolutionary history of the protein and the underlying sequence requirements to produce folded, stable, and functional protein variants. Methods that can take advantage of this information are an increasingly important part of the protein engineering tool kit. In this Perspective, we discuss the utility of sequence data in protein engineering and design, focusing on recent advances in three main areas: the use of ancestral sequence reconstruction as an engineering tool to generate thermostable and multifunctional proteins, the use of sequence data to guide engineering of multipoint mutants by structure-based computational protein design, and the use of unlabeled sequence data for unsupervised and semisupervised machine learning, allowing the generation of diverse and functional protein sequences in unexplored regions of sequence space. Altogether, these methods enable the rapid exploration of sequence space within regions enriched with functional proteins and therefore have great potential for accelerating the engineering of stable, functional, and diverse proteins for industrial and biomedical applications.
Collapse
Affiliation(s)
- Ben E Clifton
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| | - Dan Kozome
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| | - Paola Laurino
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| |
Collapse
|
17
|
Haji-Allahverdipoor K, Jalali Javaran M, Rashidi Monfared S, Khadem-Erfan MB, Nikkhoo B, Bahrami Rad Z, Eslami H, Nasseri S. Insights Into The Effects of Amino Acid Substitutions on The Stability of Reteplase Structure: A Molecular Dynamics Simulation Study. IRANIAN JOURNAL OF BIOTECHNOLOGY 2023; 21:e3175. [PMID: 36811105 PMCID: PMC9938932 DOI: 10.30498/ijb.2022.308798.3175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 07/06/2022] [Indexed: 02/24/2023]
Abstract
Background Reteplase (recombinant plasminogen activator, r-PA) is a recombinant protein designed to imitate the endogenous tissue plasminogen activator and catalyze the plasmin production. It is known that the application of reteplase is limited by the complex production processes and protein's stability challenges. Computational redesign of proteins has gained momentum in recent years, particularly as a powerful tool for improving protein stability and consequently its production efficiency. Hence, in the current study, we implemented computational approaches to improve r-PA conformational stability, which fairly correlates with protein's resistance to proteolysis. Objectives The current study was developed in order to evaluate the effect of amino acid substitutions on the stability of reteplase structure using molecular dynamic simulations and computational predictions. Materials and Methods Several web servers designed for mutation analysis were utilized to select appropriate mutations. Additionally, the experimentally reported mutation, R103S, converting wild type r-PA into non-cleavable form, was also employed. Firstly, mutant collection, consisting of 15 structures, was constructed based on the combinations of four designated mutations. Then, 3D structures were generated using MODELLER. Finally, 17 independent 20-ns molecular dynamics (MD) simulations were conducted and different analysis were performed like root-mean-square deviation (RMSD), root-mean-square fluctuations (RMSF), secondary structure analysis, number of hydrogen bonds, principal components analysis (PCA), eigenvector projection, and density analysis. Results Predicted mutations successfully compensated the more flexible conformation caused by R103S substitution, so, improved conformational stability was analyzed from MD simulations. In particular, R103S/A286I/G322I indicated the best results and remarkably enhanced the protein stability. Conclusion The conformational stability conferred by these mutations will probably lead to more protection of r-PA in protease-rich environments in various recombinant systems and potentially enhance its production and expression level.
Collapse
Affiliation(s)
- Kaveh Haji-Allahverdipoor
- Cellular and Molecular Research Center, Research Institute for Health Development, Kurdistan University of Medical Sciences, Sanandaj, Iran
| | - Mokhtar Jalali Javaran
- Department of Biotechnology, Faculty of Agriculture, Tarbiat Modares University, Tehran, Iran
| | - Sajad Rashidi Monfared
- Department of Biotechnology, Faculty of Agriculture, Tarbiat Modares University, Tehran, Iran
| | - Mohamad Bagher Khadem-Erfan
- Cellular and Molecular Research Center, Research Institute for Health Development, Kurdistan University of Medical Sciences, Sanandaj, Iran
| | - Bahram Nikkhoo
- Cellular and Molecular Research Center, Research Institute for Health Development, Kurdistan University of Medical Sciences, Sanandaj, Iran
| | - Zhila Bahrami Rad
- Cellular and Molecular Research Center, Research Institute for Health Development, Kurdistan University of Medical Sciences, Sanandaj, Iran
| | - Habib Eslami
- Department of Pharmacology and Toxicology, School of Pharmacy, Hormozgan University of Medicinal sciences, Bandar Abbas, Iran
| | - Sherko Nasseri
- Cellular and Molecular Research Center, Research Institute for Health Development, Kurdistan University of Medical Sciences, Sanandaj, Iran
| |
Collapse
|
18
|
Svilenov HL, Arosio P, Menzen T, Tessier P, Sormanni P. Approaches to expand the conventional toolbox for discovery and selection of antibodies with drug-like physicochemical properties. MAbs 2023; 15:2164459. [PMID: 36629855 PMCID: PMC9839375 DOI: 10.1080/19420862.2022.2164459] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 12/22/2022] [Accepted: 12/29/2022] [Indexed: 01/12/2023] Open
Abstract
Antibody drugs should exhibit not only high-binding affinity for their target antigens but also favorable physicochemical drug-like properties. Such drug-like biophysical properties are essential for the successful development of antibody drug products. The traditional approaches used in antibody drug development require significant experimentation to produce, optimize, and characterize many candidates. Therefore, it is attractive to integrate new methods that can optimize the process of selecting antibodies with both desired target-binding and drug-like biophysical properties. Here, we summarize a selection of techniques that can complement the conventional toolbox used to de-risk antibody drug development. These techniques can be integrated at different stages of the antibody development process to reduce the frequency of physicochemical liabilities in antibody libraries during initial discovery and to co-optimize multiple antibody features during early-stage antibody engineering and affinity maturation. Moreover, we highlight biophysical and computational approaches that can be used to predict physical degradation pathways relevant for long-term storage and in-use stability to reduce the need for extensive experimentation.
Collapse
Affiliation(s)
- Hristo L. Svilenov
- Laboratory of General Biochemistry and Physical Pharmacy, Faculty of Pharmaceutical Sciences, Ghent University, Gent, Belgium
| | - Paolo Arosio
- Department of Chemistry and Applied Biosciences, ETH Zürich, Zürich, Switzerland
| | - Tim Menzen
- Coriolis Pharma Research GmbH, Martinsried, 82152, Germany
| | - Peter Tessier
- Department of Pharmaceutical Sciences, University of Michigan, Ann Arbor, MI, USA
- Biointerfaces Institute, University of Michigan, Ann Arbor, MI, USA
- Department of Chemical Engineering, University of Michigan, Ann Arbor, MI, USA
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI, USA
| | - Pietro Sormanni
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK
| |
Collapse
|
19
|
Conformational Stability and Denaturation Processes of Proteins Investigated by Electrophoresis under Extreme Conditions. Molecules 2022; 27:molecules27206861. [PMID: 36296453 PMCID: PMC9610776 DOI: 10.3390/molecules27206861] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Revised: 10/10/2022] [Accepted: 10/10/2022] [Indexed: 11/17/2022] Open
Abstract
The functional structure of proteins results from marginally stable folded conformations. Reversible unfolding, irreversible denaturation, and deterioration can be caused by chemical and physical agents due to changes in the physicochemical conditions of pH, ionic strength, temperature, pressure, and electric field or due to the presence of a cosolvent that perturbs the delicate balance between stabilizing and destabilizing interactions and eventually induces chemical modifications. For most proteins, denaturation is a complex process involving transient intermediates in several reversible and eventually irreversible steps. Knowledge of protein stability and denaturation processes is mandatory for the development of enzymes as industrial catalysts, biopharmaceuticals, analytical and medical bioreagents, and safe industrial food. Electrophoresis techniques operating under extreme conditions are convenient tools for analyzing unfolding transitions, trapping transient intermediates, and gaining insight into the mechanisms of denaturation processes. Moreover, quantitative analysis of electrophoretic mobility transition curves allows the estimation of the conformational stability of proteins. These approaches include polyacrylamide gel electrophoresis and capillary zone electrophoresis under cold, heat, and hydrostatic pressure and in the presence of non-ionic denaturing agents or stabilizers such as polyols and heavy water. Lastly, after exposure to extremes of physical conditions, electrophoresis under standard conditions provides information on irreversible processes, slow conformational drifts, and slow renaturation processes. The impressive developments of enzyme technology with multiple applications in fine chemistry, biopharmaceutics, and nanomedicine prompted us to revisit the potentialities of these electrophoretic approaches. This feature review is illustrated with published and unpublished results obtained by the authors on cholinesterases and paraoxonase, two physiologically and toxicologically important enzymes.
Collapse
|
20
|
Thieker DF, Maguire JB, Kudlacek ST, Leaver‐Fay A, Lyskov S, Kuhlman B. Stabilizing proteins, simplified: A Rosetta-based webtool for predicting favorable mutations. Protein Sci 2022; 31:e4428. [PMID: 36173174 PMCID: PMC9490798 DOI: 10.1002/pro.4428] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2022] [Revised: 08/06/2022] [Accepted: 08/21/2022] [Indexed: 11/07/2022]
Abstract
Many proteins have low thermodynamic stability, which can lead to low expression yields and limit functionality in research, industrial and clinical settings. This article introduces two, web-based tools that use the high-resolution structure of a protein along with the Rosetta molecular modeling program to predict stabilizing mutations. The protocols were recently applied to three genetically and structurally distinct proteins and successfully predicted mutations that improved thermal stability and/or protein yield. In all three cases, combining the stabilizing mutations raised the protein unfolding temperatures by more than 20°C. The first protocol evaluates point mutations and can generate a site saturation mutagenesis heatmap. The second identifies mutation clusters around user-defined positions. Both applications only require a protein structure and are particularly valuable when a deep multiple sequence alignment is not available. These tools were created to simplify protein engineering and enable research that would otherwise be infeasible due to poor expression and stability of the native molecule.
Collapse
Affiliation(s)
- David F. Thieker
- Department of Biochemistry and BiophysicsUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
| | - Jack B. Maguire
- Department of Biochemistry and BiophysicsUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
| | - Stephan T. Kudlacek
- Department of Biochemistry and BiophysicsUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
| | - Andrew Leaver‐Fay
- Department of Biochemistry and BiophysicsUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
| | - Sergey Lyskov
- Department of Chemical and Biomolecular EngineeringJohns Hopkins UniversityBaltimoreMarylandUSA
| | - Brian Kuhlman
- Department of Biochemistry and BiophysicsUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
- Lineburger Comprehensive Cancer CenterUniversity of North Carolina at Chapel HillChapel HillNorth CarolinaUSA
| |
Collapse
|
21
|
Mahmood MS, Afzal M, Batool H, Saif A, Aqdas T, Ashraf NM, Saleem M. Screening of Pathogenic Missense Single Nucleotide Variants From LHPP Gene Associated With the Hepatocellular Carcinoma: An In silico Approach. Bioinform Biol Insights 2022; 16:11779322221115547. [PMID: 35966807 PMCID: PMC9373111 DOI: 10.1177/11779322221115547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Accepted: 06/11/2022] [Indexed: 11/15/2022] Open
Abstract
LHPP gene encodes a phospholysine phosphohistidine inorganic pyrophosphate phosphatase, which functions as a tumor-suppressor protein. The tumor suppression by this protein has been confirmed in various cancers, including hepatocellular carcinoma (HCC). LHPP downregulation promotes cell growth and proliferation by modulating the PI3K/AKT signaling pathway. This study identifies potentially deleterious missense single nucleotide variants (SNVs) associated with the LHPP gene using multiple computational tools based on different algorithms. A total of 4 destabilizing mutants are identified as L22P, I212T, G227R, and G236R, from the conserved region of the phosphatase. The 3-dimensional (3D) modeling and structural comparison of variants with the native protein reveals significant structural and conformational variations after mutations, suggesting disruption in the function of phospholysine phosphohistidine inorganic pyrophosphate phosphatase. The identified mutations might, therefore, participate in the cause of HCC.
Collapse
Affiliation(s)
- Malik Siddique Mahmood
- School of Biochemistry & Biotechnology, University of the Punjab, Lahore, Pakistan.,Department of Biochemistry, NUR International University, Lahore, Pakistan
| | - Maryam Afzal
- School of Biochemistry & Biotechnology, University of the Punjab, Lahore, Pakistan
| | - Hina Batool
- Department of Life Sciences, University of Management and Technology, Lahore, Pakistan
| | - Amara Saif
- Department of Life Sciences, University of Management and Technology, Lahore, Pakistan
| | - Tahreem Aqdas
- School of Biochemistry & Biotechnology, University of the Punjab, Lahore, Pakistan
| | - Naeem Mahmood Ashraf
- Department of Biochemistry & Biotechnology, University of Gujrat, Gujrat, Pakistan
| | - Mahjabeen Saleem
- School of Biochemistry & Biotechnology, University of the Punjab, Lahore, Pakistan
| |
Collapse
|
22
|
Rahban M, Zolghadri S, Salehi N, Ahmad F, Haertlé T, Rezaei-Ghaleh N, Sawyer L, Saboury AA. Thermal stability enhancement: Fundamental concepts of protein engineering strategies to manipulate the flexible structure. Int J Biol Macromol 2022; 214:642-654. [DOI: 10.1016/j.ijbiomac.2022.06.154] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Revised: 06/22/2022] [Accepted: 06/23/2022] [Indexed: 01/28/2023]
|
23
|
Medina-Ortiz D, Contreras S, Amado-Hinojosa J, Torres-Almonacid J, Asenjo JA, Navarrete M, Olivera-Nappa Á. Generalized Property-Based Encoders and Digital Signal Processing Facilitate Predictive Tasks in Protein Engineering. Front Mol Biosci 2022; 9:898627. [PMID: 35911960 PMCID: PMC9329607 DOI: 10.3389/fmolb.2022.898627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Accepted: 06/23/2022] [Indexed: 11/13/2022] Open
Abstract
Computational methods in protein engineering often require encoding amino acid sequences, i.e., converting them into numeric arrays. Physicochemical properties are a typical choice to define encoders, where we replace each amino acid by its value for a given property. However, what property (or group thereof) is best for a given predictive task remains an open problem. In this work, we generalize property-based encoding strategies to maximize the performance of predictive models in protein engineering. First, combining text mining and unsupervised learning, we partitioned the AAIndex database into eight semantically-consistent groups of properties. We then applied a non-linear PCA within each group to define a single encoder to represent it. Then, in several case studies, we assess the performance of predictive models for protein and peptide function, folding, and biological activity, trained using the proposed encoders and classical methods (One Hot Encoder and TAPE embeddings). Models trained on datasets encoded with our encoders and converted to signals through the Fast Fourier Transform (FFT) increased their precision and reduced their overfitting substantially, outperforming classical approaches in most cases. Finally, we propose a preliminary methodology to create de novo sequences with desired properties. All these results offer simple ways to increase the performance of general and complex predictive tasks in protein engineering without increasing their complexity.
Collapse
Affiliation(s)
- David Medina-Ortiz
- Centre for Biotechnology and Bioengineering, Universidad de Chile, Santiago, Chile
- Departamento de Ingeniería en Computación, Universidad de Magallanes, Punta Arenas, Chile
| | - Sebastian Contreras
- Max Planck Institute for Dynamics and Self-Organization, Göttingen, Germany
- *Correspondence: Sebastian Contreras, ; Álvaro Olivera-Nappa,
| | - Juan Amado-Hinojosa
- Centre for Biotechnology and Bioengineering, Universidad de Chile, Santiago, Chile
- Departamento de Ingeniería Química, Biotecnología y Materiales, Facultad de Ciencias Físicas y Matemáticas, Universidad de Chile, Santiago, Chile
| | - Jorge Torres-Almonacid
- Departamento de Ingeniería en Computación, Universidad de Magallanes, Punta Arenas, Chile
| | - Juan A. Asenjo
- Centre for Biotechnology and Bioengineering, Universidad de Chile, Santiago, Chile
- Departamento de Ingeniería Química, Biotecnología y Materiales, Facultad de Ciencias Físicas y Matemáticas, Universidad de Chile, Santiago, Chile
| | | | - Álvaro Olivera-Nappa
- Centre for Biotechnology and Bioengineering, Universidad de Chile, Santiago, Chile
- Departamento de Ingeniería Química, Biotecnología y Materiales, Facultad de Ciencias Físicas y Matemáticas, Universidad de Chile, Santiago, Chile
- *Correspondence: Sebastian Contreras, ; Álvaro Olivera-Nappa,
| |
Collapse
|
24
|
Deol HK, Broom HR, Sienbeneichler B, Lee B, Leonenko Z, Meiering EM. Immature ALS-associated mutant superoxide dismutases form variable aggregate structures through distinct oligomerization processes. Biophys Chem 2022; 288:106844. [DOI: 10.1016/j.bpc.2022.106844] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 05/31/2022] [Accepted: 06/02/2022] [Indexed: 11/15/2022]
|
25
|
Casadio R, Savojardo C, Fariselli P, Capriotti E, Martelli PL. Turning Failures into Applications: The Problem of Protein ΔΔG Prediction. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2449:169-185. [PMID: 35507262 DOI: 10.1007/978-1-0716-2095-3_6] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
After nearly two decades of research in the field of computational methods based on machine learning and knowledge-based potentials for ΔG and ΔΔG prediction upon variations, we now realize that all the approaches are poorly performing when tested on specific cases and that there is large space for improvement. Why this is so? Is it wrong the underlying assumption that experimental protein thermodynamics in solution reflects the thermodynamics of a single protein? Both machine learning and knowledge-based computational methods are rigorous and we know the solid theory behind. We are now in a critical situation, which suggests that predictions of protein instability upon variation should be considered with care. In the following, we will show how to cope with the problem of understanding which protein positions may be of interest for biotechnological and biomedical purposes. By applying a consensus procedure, we indicate possible strategies for the result interpretation.
Collapse
Affiliation(s)
- Rita Casadio
- Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy.
| | - Castrense Savojardo
- Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| | - Piero Fariselli
- Department of Medical Sciences, University of Torino, Turin, Italy
| | - Emidio Capriotti
- Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| | - Pier Luigi Martelli
- Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| |
Collapse
|
26
|
Muronetz VI, Kudryavtseva SS, Leisi EV, Kurochkina LP, Barinova KV, Schmalhausen EV. Regulation by Different Types of Chaperones of Amyloid Transformation of Proteins Involved in the Development of Neurodegenerative Diseases. Int J Mol Sci 2022; 23:ijms23052747. [PMID: 35269889 PMCID: PMC8910861 DOI: 10.3390/ijms23052747] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Revised: 02/21/2022] [Accepted: 02/28/2022] [Indexed: 02/06/2023] Open
Abstract
The review highlights various aspects of the influence of chaperones on amyloid proteins associated with the development of neurodegenerative diseases and includes studies conducted in our laboratory. Different sections of the article are devoted to the role of chaperones in the pathological transformation of alpha-synuclein and the prion protein. Information about the interaction of the chaperonins GroE and TRiC as well as polymer-based artificial chaperones with amyloidogenic proteins is summarized. Particular attention is paid to the effect of blocking chaperones by misfolded and amyloidogenic proteins. It was noted that the accumulation of functionally inactive chaperones blocked by misfolded proteins might cause the formation of amyloid aggregates and prevent the disassembly of fibrillar structures. Moreover, the blocking of chaperones by various forms of amyloid proteins might lead to pathological changes in the vital activity of cells due to the impaired folding of newly synthesized proteins and their subsequent processing. The final section of the article discusses both the little data on the role of gut microbiota in the propagation of synucleinopathies and prion diseases and the possible involvement of the bacterial chaperone GroE in these processes.
Collapse
Affiliation(s)
- Vladimir I. Muronetz
- Belozersky Institute of Physico Chemical Biology, Lomonosov Moscow State University, 119991 Moscow, Russia; (L.P.K.); (K.V.B.); (E.V.S.)
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, 119991 Moscow, Russia;
- Correspondence:
| | - Sofia S. Kudryavtseva
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, 119991 Moscow, Russia;
- Faculty of Biology, Lomonosov Moscow State University, 119991 Moscow, Russia;
| | - Evgeniia V. Leisi
- Faculty of Biology, Lomonosov Moscow State University, 119991 Moscow, Russia;
| | - Lidia P. Kurochkina
- Belozersky Institute of Physico Chemical Biology, Lomonosov Moscow State University, 119991 Moscow, Russia; (L.P.K.); (K.V.B.); (E.V.S.)
| | - Kseniya V. Barinova
- Belozersky Institute of Physico Chemical Biology, Lomonosov Moscow State University, 119991 Moscow, Russia; (L.P.K.); (K.V.B.); (E.V.S.)
| | - Elena V. Schmalhausen
- Belozersky Institute of Physico Chemical Biology, Lomonosov Moscow State University, 119991 Moscow, Russia; (L.P.K.); (K.V.B.); (E.V.S.)
| |
Collapse
|
27
|
Capturing a Crucial ‘Disorder-to-Order Transition’ at the Heart of the Coronavirus Molecular Pathology—Triggered by Highly Persistent, Interchangeable Salt-Bridges. Vaccines (Basel) 2022; 10:vaccines10020301. [PMID: 35214759 PMCID: PMC8875383 DOI: 10.3390/vaccines10020301] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Revised: 01/27/2022] [Accepted: 02/05/2022] [Indexed: 02/05/2023] Open
Abstract
The COVID-19 origin debate has greatly been influenced by genome comparison studies of late, revealing the emergence of the Furin-like cleavage site at the S1/S2 junction of the SARS-CoV-2 Spike (FLCSSpike) containing its 681PRRAR685 motif, absent in other related respiratory viruses. Being the rate-limiting (i.e., the slowest) step, the host Furin cleavage is instrumental in the abrupt increase in transmissibility in COVID-19, compared to earlier onsets of respiratory viral diseases. In such a context, the current paper entraps a ‘disorder-to-order transition’ of the FLCSSpike (concomitant to an entropy arrest) upon binding to Furin. The interaction clearly seems to be optimized for a more efficient proteolytic cleavage in SARS-CoV-2. The study further shows the formation of dynamically interchangeable and persistent networks of salt-bridges at the Spike–Furin interface in SARS-CoV-2 involving the three arginines (R682, R683, R685) of the FLCSSpike with several anionic residues (E230, E236, D259, D264, D306) coming from Furin, strategically distributed around its catalytic triad. Multiplicity and structural degeneracy of plausible salt-bridge network archetypes seem to be the other key characteristic features of the Spike–Furin binding in SARS-CoV-2, allowing the system to breathe—a trademark of protein disorder transitions. Interestingly, with respect to the homologous interaction in SARS-CoV (2002/2003) taken as a baseline, the Spike–Furin binding events, generally, in the coronavirus lineage, seems to have preference for ionic bond formation, even with a lesser number of cationic residues at their potentially polybasic FLCSSpike patches. The interaction energies are suggestive of characteristic metastabilities attributed to Spike–Furin interactions, generally to the coronavirus lineage, which appears to be favorable for proteolytic cleavages targeted at flexible protein loops. The current findings not only offer novel mechanistic insights into the coronavirus molecular pathology and evolution, but also add substantially to the existing theories of proteolytic cleavages.
Collapse
|
28
|
Assessment of Therapeutic Antibody Developability by Combinations of In Vitro and In Silico Methods. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2313:57-113. [PMID: 34478132 DOI: 10.1007/978-1-0716-1450-1_4] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Although antibodies have become the fastest-growing class of therapeutics on the market, it is still challenging to develop them for therapeutic applications, which often require these molecules to withstand stresses that are not present in vivo. We define developability as the likelihood of an antibody candidate with suitable functionality to be developed into a manufacturable, stable, safe, and effective drug that can be formulated to high concentrations while retaining a long shelf life. The implementation of reliable developability assessments from the early stages of antibody discovery enables flagging and deselection of potentially problematic candidates, while focussing available resources on the development of the most promising ones. Currently, however, thorough developability assessment requires multiple in vitro assays, which makes it labor intensive and time consuming to implement at early stages. Furthermore, accurate in vitro analysis at the early stage is compromised by the high number of potential candidates that are often prepared at low quantities and purity. Recent improvements in the performance of computational predictors of developability potential are beginning to change this scenario. Many computational methods only require the knowledge of the amino acid sequences and can be used to identify possible developability issues or to rank available candidates according to a range of biophysical properties. Here, we describe how the implementation of in silico tools into antibody discovery pipelines is increasingly offering time- and cost-effective alternatives to in vitro experimental screening, thus streamlining the drug development process. We discuss in particular the biophysical and biochemical properties that underpin developability potential and their trade-offs, review various in vitro assays to measure such properties or parameters that are predictive of developability, and give an overview of the growing number of in silico tools available to predict properties important for antibody development, including the CamSol method developed in our laboratory.
Collapse
|
29
|
Sorokina I, Mushegian AR, Koonin EV. Is Protein Folding a Thermodynamically Unfavorable, Active, Energy-Dependent Process? Int J Mol Sci 2022; 23:521. [PMID: 35008947 PMCID: PMC8745595 DOI: 10.3390/ijms23010521] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Revised: 12/30/2021] [Accepted: 12/31/2021] [Indexed: 02/04/2023] Open
Abstract
The prevailing current view of protein folding is the thermodynamic hypothesis, under which the native folded conformation of a protein corresponds to the global minimum of Gibbs free energy G. We question this concept and show that the empirical evidence behind the thermodynamic hypothesis of folding is far from strong. Furthermore, physical theory-based approaches to the prediction of protein folds and their folding pathways so far have invariably failed except for some very small proteins, despite decades of intensive theory development and the enormous increase of computer power. The recent spectacular successes in protein structure prediction owe to evolutionary modeling of amino acid sequence substitutions enhanced by deep learning methods, but even these breakthroughs provide no information on the protein folding mechanisms and pathways. We discuss an alternative view of protein folding, under which the native state of most proteins does not occupy the global free energy minimum, but rather, a local minimum on a fluctuating free energy landscape. We further argue that ΔG of folding is likely to be positive for the majority of proteins, which therefore fold into their native conformations only through interactions with the energy-dependent molecular machinery of living cells, in particular, the translation system and chaperones. Accordingly, protein folding should be modeled as it occurs in vivo, that is, as a non-equilibrium, active, energy-dependent process.
Collapse
Affiliation(s)
| | - Arcady R. Mushegian
- Division of Molecular and Cellular Biosciences, National Science Foundation, Alexandria, VA 22314, USA;
- Clare Hall College, University of Cambridge, Cambridge CB3 9AL, UK
| | - Eugene V. Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
30
|
Ittisoponpisan S, Jeerapan I. In Silico Analysis of Glucose Oxidase from Aspergillus niger: Potential Cysteine Mutation Sites for Enhancing Protein Stability. Bioengineering (Basel) 2021; 8:bioengineering8110188. [PMID: 34821754 PMCID: PMC8615187 DOI: 10.3390/bioengineering8110188] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 11/12/2021] [Accepted: 11/13/2021] [Indexed: 12/28/2022] Open
Abstract
Glucose oxidase (GOx) holds considerable advantages for various applications. Nevertheless, the thermal instability of the enzyme remains a grand challenge, impeding the success in applications outside the well-controlled laboratories, particularly in practical bioelectronics. Many strategies to modify GOx to achieve better thermal stability have been proposed. However, modification of this enzyme by adding extra disulfide bonds is yet to be explored. This work describes the in silico bioengineering of GOx from Aspergillus niger by judiciously analyzing characteristics of disulfide bonds found in the Top8000 protein database, then scanning for amino acid residue pairs that are suitable to be replaced with cysteines in order to establish disulfide bonds. Next, we predicted and assessed the mutant GOx models in terms of disulfide bond quality (bond length and α angles), functional impact by means of residue conservation, and structural impact as indicated by Gibbs free energy. We found eight putative residue pairs that can be engineered to form disulfide bonds. Five of these are located in less conserved regions and, therefore, are unlikely to have a deleterious impact on functionality. Finally, two mutations, Pro149Cys and His158Cys, showed potential for stabilizing the protein structure as confirmed by a structure-based stability analysis tool. The findings in this study highlight the opportunity of using disulfide bond modification as a new alternative technique to enhance the thermal stability of GOx.
Collapse
Affiliation(s)
- Sirawit Ittisoponpisan
- Center for Genomics and Bioinformatics Research, Division of Biological Science, Faculty of Science, Prince of Songkla University, Hat Yai, Songkhla 90110, Thailand
- Correspondence: (S.I.); (I.J.)
| | - Itthipon Jeerapan
- Center of Excellence for Trace Analysis and Biosensor, Prince of Songkla University, Hat Yai, Songkhla 90110, Thailand
- Division of Physical Science, Faculty of Science, Prince of Songkla University, Hat Yai, Songkhla 90110, Thailand
- Center of Excellence for Innovation in Chemistry, Faculty of Science, Prince of Songkla University, Hat Yai, Songkhla 90110, Thailand
- Correspondence: (S.I.); (I.J.)
| |
Collapse
|
31
|
Yang Y, Zeng L, Vihinen M. PON-Sol2: Prediction of Effects of Variants on Protein Solubility. Int J Mol Sci 2021; 22:8027. [PMID: 34360790 PMCID: PMC8348231 DOI: 10.3390/ijms22158027] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 07/19/2021] [Accepted: 07/22/2021] [Indexed: 01/13/2023] Open
Abstract
Genetic variations have a multitude of effects on proteins. A substantial number of variations affect protein-solvent interactions, either aggregation or solubility. Aggregation is often related to structural alterations, whereas solubilizable proteins in the solid phase can be made again soluble by dilution. Solubility is a central protein property and when reduced can lead to diseases. We developed a prediction method, PON-Sol2, to identify amino acid substitutions that increase, decrease, or have no effect on the protein solubility. The method is a machine learning tool utilizing gradient boosting algorithm and was trained on a large dataset of variants with different outcomes after the selection of features among a large number of tested properties. The method is fast and has high performance. The normalized correct prediction rate for three states is 0.656, and the normalized GC2 score is 0.312 in 10-fold cross-validation. The corresponding numbers in the blind test were 0.545 and 0.157. The performance was superior in comparison to previous methods. The PON-Sol2 predictor is freely available. It can be used to predict the solubility effects of variants for any organism, even in large-scale projects.
Collapse
Affiliation(s)
- Yang Yang
- School of Computer Science and Technology, Soochow University, Suzhou 215006, China; (Y.Y.); (L.Z.)
| | - Lianjie Zeng
- School of Computer Science and Technology, Soochow University, Suzhou 215006, China; (Y.Y.); (L.Z.)
- Collaborative Innovation Center of Novel Software Technology and Industrialization, Nanjing 210000, China
| | - Mauno Vihinen
- Department of Experimental Medical Science, Lund University, BMC B13, SE-221 84 Lund, Sweden
| |
Collapse
|
32
|
Kozuka K, Nakano S, Asano Y, Ito S. Partial Consensus Design and Enhancement of Protein Function by Secondary-Structure-Guided Consensus Mutations. Biochemistry 2021; 60:2309-2319. [PMID: 34254784 DOI: 10.1021/acs.biochem.1c00309] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Consensus design (CD) is a representative sequence-based protein design method that enables the design of highly functional proteins by analyzing vast amounts of protein sequence data. This study proposes a partial consensus design (PCD) of a protein as a derivative approach of CD. The method replaces the target protein sequence with a consensus sequence in a secondary-structure-dependent manner (i.e., regionally dependent and divided into α-helix, β-sheet, and loop regions). In this study, we generated several artificial partial consensus l-threonine 3-dehydrogenases (PcTDHs) by PCD using the TDH from Cupriavidus necator (CnTDH) as a target protein. Structural and functional analysis of PcTDHs suggested that thermostability would be independently improved when consensus mutations are introduced into the loop region of TDHs. On the other hand, enzyme kinetic parameters (kcat/Km) and average productivity would be synergistically enhanced by changing the combination of the mutations-replacement of one region of CnTDH with a consensus sequence provided only negative effects, but the negative effects were nullified when the two regions were replaced simultaneously. Taken together, we propose the hypothesis that there are protein regions that encode individual protein properties, such as thermostability and activity, and that the introduction of consensus mutations into these regions could additively or synergistically modify their functions.
Collapse
Affiliation(s)
- Kohei Kozuka
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka, 422-8526, Japan
| | - Shogo Nakano
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka, 422-8526, Japan.,PREST, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012, Japan
| | - Yasuhisa Asano
- Biotechnology Research Center and Department of Biotechnology, Toyama Prefectural University, 5180 Kurokawa, Imizu, Toyama 939-0398, Japan
| | - Sohei Ito
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka, 422-8526, Japan
| |
Collapse
|
33
|
Enhancement of protein thermostability by three consecutive mutations using loop-walking method and machine learning. Sci Rep 2021; 11:11883. [PMID: 34088952 PMCID: PMC8178419 DOI: 10.1038/s41598-021-91339-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2021] [Accepted: 05/25/2021] [Indexed: 01/22/2023] Open
Abstract
We developed a method to improve protein thermostability, “loop-walking method”. Three consecutive positions in 12 loops of Burkholderia cepacia lipase were subjected to random mutagenesis to make 12 libraries. Screening allowed us to identify L7 as a hot-spot loop having an impact on thermostability, and the P233G/L234E/V235M mutant was found from 214 variants in the L7 library. Although a more excellent mutant might be discovered by screening all the 8000 P233X/L234X/V235X mutants, it was difficult to assay all of them. We therefore employed machine learning. Using thermostability data of the 214 mutants, a computational discrimination model was constructed to predict thermostability potentials. Among 7786 combinations ranked in silico, 20 promising candidates were selected and assayed. The P233D/L234P/V235S mutant retained 66% activity after heat treatment at 60 °C for 30 min, which was higher than those of the wild-type enzyme (5%) and the P233G/L234E/V235M mutant (35%).
Collapse
|
34
|
Wilson CJ, Chang M, Karttunen M, Choy WY. KEAP1 Cancer Mutants: A Large-Scale Molecular Dynamics Study of Protein Stability. Int J Mol Sci 2021; 22:5408. [PMID: 34065616 PMCID: PMC8161161 DOI: 10.3390/ijms22105408] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Revised: 05/11/2021] [Accepted: 05/13/2021] [Indexed: 12/30/2022] Open
Abstract
We have performed 280 μs of unbiased molecular dynamics (MD) simulations to investigate the effects of 12 different cancer mutations on Kelch-like ECH-associated protein 1 (KEAP1) (G333C, G350S, G364C, G379D, R413L, R415G, A427V, G430C, R470C, R470H, R470S and G476R), one of the frequently mutated proteins in lung cancer. The aim was to provide structural insight into the effects of these mutants, including a new class of ANCHOR (additionally NRF2-complexed hypomorph) mutant variants. Our work provides additional insight into the structural dynamics of mutants that could not be analyzed experimentally, painting a more complete picture of their mutagenic effects. Notably, blade-wise analysis of the Kelch domain points to stability as a possible target of cancer in KEAP1. Interestingly, structural analysis of the R470C ANCHOR mutant, the most prevalent missense mutation in KEAP1, revealed no significant change in structural stability or NRF2 binding site dynamics, possibly indicating an covalent modification as this mutant's mode of action.
Collapse
Affiliation(s)
- Carter J. Wilson
- Department of Biochemistry, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5C1, Canada; (C.J.W.); (M.C.)
- Department of Applied Mathematics, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5B7, Canada
| | - Megan Chang
- Department of Biochemistry, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5C1, Canada; (C.J.W.); (M.C.)
| | - Mikko Karttunen
- Department of Applied Mathematics, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5B7, Canada
- Department of Chemistry, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 3K7, Canada
- Centre for Advanced Materials and Biomaterials Research, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5B7, Canada
| | - Wing-Yiu Choy
- Department of Biochemistry, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5C1, Canada; (C.J.W.); (M.C.)
| |
Collapse
|
35
|
Structure and function of naturally evolved de novo proteins. Curr Opin Struct Biol 2021; 68:175-183. [PMID: 33567396 DOI: 10.1016/j.sbi.2020.11.010] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 11/16/2020] [Accepted: 11/27/2020] [Indexed: 01/05/2023]
Abstract
Comparative evolutionary genomics has revealed that novel protein coding genes can emerge randomly from non-coding DNA. While most of the myriad of transcripts which continuously emerge vanish rapidly, some attain regulatory regions, become translated and survive. More surprisingly, sequence properties of de novo proteins are almost indistinguishable from randomly obtained sequences, yet de novo proteins may gain functions and integrate into eukaryotic cellular networks quite easily. We here discuss current knowledge on de novo proteins, their structures, functions and evolution. Since the existence of de novo proteins seems at odds with decade-long attempts to construct proteins with novel structures and functions from scratch, we suggest that a better understanding of de novo protein evolution may fuel new strategies for protein design.
Collapse
|
36
|
Sun Y, Qu J, Wang J, Zhao R, Wang C, Chen L, Hou X. Clinical and Functional Characteristics of a Novel KLF11 Cys354Phe Variant Involved in Maturity-Onset Diabetes of the Young. J Diabetes Res 2021; 2021:7136869. [PMID: 33604390 PMCID: PMC7870296 DOI: 10.1155/2021/7136869] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Revised: 11/18/2020] [Accepted: 01/10/2021] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND Mutations in human KLF11 may lead to the development of maturity-onset diabetes of the young 7 (MODY7). This occurs due to impaired insulin synthesis in the pancreas. To date, the clinical and functional characteristics of the novel KLF11 mutation c.1061G > T have not yet been reported. METHODS Whole-exon sequencing was used to screen the proband and family members with clinical suspicion of the KLF11 variant. Luciferase reporter assays were used to investigate whether the KLF11 variant binds to the insulin promoter. Real-time PCR, western blotting, and glucose-stimulated insulin secretion (GSIS) analysis were used to analyze the KLF11 variant that regulates insulin expression and insulin secretion activity in beta cell lines. The Freestyle Libre H (Abbott Diabetes Care Ltd) was used to dynamically monitor the proband daily blood glucose levels. RESULTS Mutation screening for the whole exon genes identified a heterozygous KLF11 (c.1061G > T) variant in the proband, her mother, and her maternal grandfather. Cell-based luciferase reporter assays using wild-type and mutant transgenes revealed that the KLF11 (c.1061G > T) variant had impaired insulin promoter regulation activity. Moreover, this variant was found to impair insulin expression and insulin secretion in pancreatic beta cells. The proband had better blood glucose control without staple food intake (p < 0.05). CONCLUSIONS Herein, for the first time, we report a novel KLF11 (c.1061G > T) monogenic mutation associated with MODY7. This variant has impaired insulin promoter regulation activity and impairs insulin expression and secretion in pancreatic beta cells. Therefore, administering oral antidiabetic drugs along with dietary intervention may benefit the proband.
Collapse
Affiliation(s)
- Yujing Sun
- Department of Endocrinology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
- Institute of Endocrine and Metabolic Diseases of Shandong University, Jinan, 250012 Shandong Province, China
- Jinan Clinical Research Center for Endocrine and Metabolic Diseases, Jinan, 250012 Shandong Province, China
| | - Jingru Qu
- Department of Endocrinology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
- Institute of Endocrine and Metabolic Diseases of Shandong University, Jinan, 250012 Shandong Province, China
- Jinan Clinical Research Center for Endocrine and Metabolic Diseases, Jinan, 250012 Shandong Province, China
| | - Jing Wang
- Department of Endocrinology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
- Institute of Endocrine and Metabolic Diseases of Shandong University, Jinan, 250012 Shandong Province, China
- Jinan Clinical Research Center for Endocrine and Metabolic Diseases, Jinan, 250012 Shandong Province, China
| | - Ruxing Zhao
- Department of Endocrinology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
- Institute of Endocrine and Metabolic Diseases of Shandong University, Jinan, 250012 Shandong Province, China
- Jinan Clinical Research Center for Endocrine and Metabolic Diseases, Jinan, 250012 Shandong Province, China
| | - Chuan Wang
- Department of Endocrinology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
- Institute of Endocrine and Metabolic Diseases of Shandong University, Jinan, 250012 Shandong Province, China
- Jinan Clinical Research Center for Endocrine and Metabolic Diseases, Jinan, 250012 Shandong Province, China
| | - Li Chen
- Department of Endocrinology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
- Institute of Endocrine and Metabolic Diseases of Shandong University, Jinan, 250012 Shandong Province, China
- Jinan Clinical Research Center for Endocrine and Metabolic Diseases, Jinan, 250012 Shandong Province, China
| | - Xinguo Hou
- Department of Endocrinology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
- Institute of Endocrine and Metabolic Diseases of Shandong University, Jinan, 250012 Shandong Province, China
- Jinan Clinical Research Center for Endocrine and Metabolic Diseases, Jinan, 250012 Shandong Province, China
| |
Collapse
|
37
|
Chen Y, Lu H, Zhang N, Zhu Z, Wang S, Li M. PremPS: Predicting the impact of missense mutations on protein stability. PLoS Comput Biol 2020; 16:e1008543. [PMID: 33378330 PMCID: PMC7802934 DOI: 10.1371/journal.pcbi.1008543] [Citation(s) in RCA: 93] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 01/12/2021] [Accepted: 11/16/2020] [Indexed: 12/12/2022] Open
Abstract
Computational methods that predict protein stability changes induced by missense mutations have made a lot of progress over the past decades. Most of the available methods however have very limited accuracy in predicting stabilizing mutations because existing experimental sets are dominated by mutations reducing protein stability. Moreover, few approaches could consistently perform well across different test cases. To address these issues, we developed a new computational method PremPS to more accurately evaluate the effects of missense mutations on protein stability. The PremPS method is composed of only ten evolutionary- and structure-based features and parameterized on a balanced dataset with an equal number of stabilizing and destabilizing mutations. A comprehensive comparison of the predictive performance of PremPS with other available methods on nine benchmark datasets confirms that our approach consistently outperforms other methods and shows considerable improvement in estimating the impacts of stabilizing mutations. A protein could have multiple structures available, and if another structure of the same protein is used, the predicted change in stability for structure-based methods might be different. Thus, we further estimated the impact of using different structures on prediction accuracy, and demonstrate that our method performs well across different types of structures except for low-resolution structures and models built based on templates with low sequence identity. PremPS can be used for finding functionally important variants, revealing the molecular mechanisms of functional influences and protein design. PremPS is freely available at https://lilab.jysw.suda.edu.cn/research/PremPS/, which allows to do large-scale mutational scanning and takes about four minutes to perform calculations for a single mutation per protein with ~ 300 residues and requires ~ 0.4 seconds for each additional mutation. The development of computational methods to accurately predict the impacts of amino acid substitutions on protein stability is of paramount importance for the field of protein design and understanding the roles of missense mutations in disease. However, most of the available methods have very limited predictive accuracy for mutations increasing stability and few could consistently perform well across different test cases. Here we present a new computational approach PremPS, which is capable of predicting the effects of single point mutations on protein stability. PremPS employs only ten evolutionary- and structure-based features and is trained on a symmetrical dataset consisting of the same number of cases of stabilizing and destabilizing mutations. Our method was tested against numerous blind datasets and shows a considerable improvement especially in evaluating the effects of stabilizing mutations, outperforming previously developed methods. PremPS is freely available as a user-friendly web server at http://lilab.jysw.suda.edu.cn/research/PremPS/, which is fast enough to handle the large number of cases.
Collapse
Affiliation(s)
- Yuting Chen
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Haoyu Lu
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Ning Zhang
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Zefeng Zhu
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Shuqin Wang
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Minghui Li
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
- * E-mail:
| |
Collapse
|
38
|
Pechmann S. Programmed Trade-offs in Protein Folding Networks. Structure 2020; 28:1361-1375.e4. [PMID: 33053320 DOI: 10.1016/j.str.2020.09.009] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Revised: 07/25/2020] [Accepted: 09/23/2020] [Indexed: 12/14/2022]
Abstract
Molecular chaperones as specialized protein quality control enzymes form the core of cellular protein homeostasis. How chaperones selectively interact with their substrate proteins thus allocate their overall limited capacity remains poorly understood. Here, I present an integrated analysis of sequence and structural determinants that define interactions of protein domains as the basic protein folding unit with the Saccharomyces cerevisiae Hsp70 Ssb. Structural homologs of single-domain proteins that differentially interact with Ssb for de novo folding were found to systematically differ in complexity of their folding landscapes, selective use of nonoptimal codons, and presence of short discriminative sequences, thus highlighting pervasive trade-offs in chaperone-assisted protein folding landscapes. However, short discriminative sequences were found to contribute by far the strongest signal toward explaining Ssb interactions. This observation suggested that some chaperone interactions may be directly programmed in the amino acid sequences rather than responding to folding challenges, possibly for regulatory advantages.
Collapse
Affiliation(s)
- Sebastian Pechmann
- Département de biochimie, Université de Montréal, 2900 Boulevard Edouard-Montpetit, Montréal, QC H3T 1J4, Canada.
| |
Collapse
|
39
|
Sarkar A, Yang Y, Vihinen M. Variation benchmark datasets: update, criteria, quality and applications. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020; 2020:5710862. [PMID: 32016318 PMCID: PMC6997940 DOI: 10.1093/database/baz117] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Revised: 06/03/2019] [Accepted: 07/01/2019] [Indexed: 02/07/2023]
Abstract
Development of new computational methods and testing their performance has to be carried out using experimental data. Only in comparison to existing knowledge can method performance be assessed. For that purpose, benchmark datasets with known and verified outcome are needed. High-quality benchmark datasets are valuable and may be difficult, laborious and time consuming to generate. VariBench and VariSNP are the two existing databases for sharing variation benchmark datasets used mainly for variation interpretation. They have been used for training and benchmarking predictors for various types of variations and their effects. VariBench was updated with 419 new datasets from 109 papers containing altogether 329 014 152 variants; however, there is plenty of redundancy between the datasets. VariBench is freely available at http://structure.bmc.lu.se/VariBench/. The contents of the datasets vary depending on information in the original source. The available datasets have been categorized into 20 groups and subgroups. There are datasets for insertions and deletions, substitutions in coding and non-coding region, structure mapped, synonymous and benign variants. Effect-specific datasets include DNA regulatory elements, RNA splicing, and protein property for aggregation, binding free energy, disorder and stability. Then there are several datasets for molecule-specific and disease-specific applications, as well as one dataset for variation phenotype effects. Variants are often described at three molecular levels (DNA, RNA and protein) and sometimes also at the protein structural level including relevant cross references and variant descriptions. The updated VariBench facilitates development and testing of new methods and comparison of obtained performances to previously published methods. We compared the performance of the pathogenicity/tolerance predictor PON-P2 to several benchmark studies, and show that such comparisons are feasible and useful, however, there may be limitations due to lack of provided details and shared data. Database URL: http://structure.bmc.lu.se/VariBench
Collapse
Affiliation(s)
- Anasua Sarkar
- Department of Experimental Medical Science, BMC B13, Lund University, SE-22 184 Lund, Sweden
| | - Yang Yang
- School of Computer Science and Technology, Soochow University, No1. Shizi Street, Suzhou, 215006 Jiangsu, China.,Provincial Key Laboratory for Computer Information Processing Technology, No1. Shizi Street, Soochow University, Suzhou, 215006 Jiangsu, China
| | - Mauno Vihinen
- Department of Experimental Medical Science, BMC B13, Lund University, SE-22 184 Lund, Sweden
| |
Collapse
|
40
|
DMAKit: A user-friendly web platform for bringing state-of-the-art data analysis techniques to non-specific users. INFORM SYST 2020. [DOI: 10.1016/j.is.2020.101557] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
41
|
Tunstall T, Portelli S, Phelan J, Clark TG, Ascher DB, Furnham N. Combining structure and genomics to understand antimicrobial resistance. Comput Struct Biotechnol J 2020; 18:3377-3394. [PMID: 33294134 PMCID: PMC7683289 DOI: 10.1016/j.csbj.2020.10.017] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Revised: 10/15/2020] [Accepted: 10/17/2020] [Indexed: 02/07/2023] Open
Abstract
Antimicrobials against bacterial, viral and parasitic pathogens have transformed human and animal health. Nevertheless, their widespread use (and misuse) has led to the emergence of antimicrobial resistance (AMR) which poses a potentially catastrophic threat to public health and animal husbandry. There are several routes, both intrinsic and acquired, by which AMR can develop. One major route is through non-synonymous single nucleotide polymorphisms (nsSNPs) in coding regions. Large scale genomic studies using high-throughput sequencing data have provided powerful new ways to rapidly detect and respond to such genetic mutations linked to AMR. However, these studies are limited in their mechanistic insight. Computational tools can rapidly and inexpensively evaluate the effect of mutations on protein function and evolution. Subsequent insights can then inform experimental studies, and direct existing or new computational methods. Here we review a range of sequence and structure-based computational tools, focussing on tools successfully used to investigate mutational effect on drug targets in clinically important pathogens, particularly Mycobacterium tuberculosis. Combining genomic results with the biophysical effects of mutations can help reveal the molecular basis and consequences of resistance development. Furthermore, we summarise how the application of such a mechanistic understanding of drug resistance can be applied to limit the impact of AMR.
Collapse
Affiliation(s)
- Tanushree Tunstall
- Department of Infection Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - Stephanie Portelli
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Australia
| | - Jody Phelan
- Department of Infection Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - Taane G. Clark
- Department of Infection Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
- Department of Infectious Disease Epidemiology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - David B. Ascher
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Australia
| | - Nicholas Furnham
- Department of Infection Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| |
Collapse
|
42
|
Hewitt CS, Krabill AD, Das C, Flaherty DP. Development of Ubiquitin Variants with Selectivity for Ubiquitin C-Terminal Hydrolase Deubiquitinases. Biochemistry 2020; 59:3447-3462. [PMID: 32865982 DOI: 10.1021/acs.biochem.9b01076] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Ubiquitin (Ub) is a highly conserved protein that is covalently attached to substrate proteins as a post-translational modification to regulate signaling pathways such as proteasomal degradation and cell cycle/transcriptional regulation in the eukaryotic cellular environment. Ub signaling is regulated by the homeostasis of substrate protein ubiquitination/deubiquitination by E3 ligases and deubiquitinating enzymes (DUBs) in healthy eukaryotic systems. One such DUB, ubiquitin C-terminal hydrolase L1 (UCHL1), is endogenously expressed in the central nervous system under normal physiological conditions, but overexpression and/or mutation has been linked to various cancers and neurodegenerative diseases. The lack of UCHL1 probing strategies suggests development of a selective Ub variant (UbV) for probing UCHL1's role in these disease states would be beneficial. We describe a computational design approach to investigate UbVs that lend selectivity, both binding and inhibition, to UCHL1 over the close structural homologue UCHL3 and members of other DUB families. A number of UbVs, mainly those containing Thr9 mutations, displayed appreciable binding and inhibition selectivity for UCHL1 over UCHL3, compared to wild-type Ub in in vitro assays. By appending reactive electrophiles to the C-terminus of the UbVs, we created the first activity-based probe (ABP) with demonstrated reaction selectivity for UCH family DUBs over other families in cell lysates. Further kinetic analysis of covalent inhibition by the UbV-ABP with UCHL1 and UCHL3 offers insight into the future design of UCHL1 selective UbV-ABP. These studies serve as a proof of concept of the viability of the in silico design of ubiquitin variants for UCH family DUBs as a step toward the development of macromolecular UCHL1 inhibitors.
Collapse
Affiliation(s)
- Chad S Hewitt
- Department of Medicinal Chemistry and Molecular Pharmacology, College of Pharmacy, Purdue University, 575 Stadium Mall Drive, West Lafayette, Indiana 47907, United States
| | - Aaron D Krabill
- Department of Medicinal Chemistry and Molecular Pharmacology, College of Pharmacy, Purdue University, 575 Stadium Mall Drive, West Lafayette, Indiana 47907, United States
| | - Chittaranjan Das
- Department of Chemistry, College of Science, Purdue University, 560 Oval Drive, West Lafayette, Indiana 47907, United States.,Purdue Center for Cancer Research, Purdue University, West Lafayette, Indiana 47907, United States
| | - Daniel P Flaherty
- Department of Medicinal Chemistry and Molecular Pharmacology, College of Pharmacy, Purdue University, 575 Stadium Mall Drive, West Lafayette, Indiana 47907, United States.,Purdue Center for Cancer Research, Purdue University, West Lafayette, Indiana 47907, United States.,Purdue Institute for Drug Discovery, Purdue University, West Lafayette, Indiana 47907, United States
| |
Collapse
|
43
|
Mazurenko S. Predicting protein stability and solubility changes upon mutations: data perspective. ChemCatChem 2020. [DOI: 10.1002/cctc.202000933] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- Stanislav Mazurenko
- Loschmidt Laboratories Department of Experimental Biology and RECETOX Faculty of Science Masaryk University Zerotinovo nam. 617/9 601 77 Brno Czech Republic
| |
Collapse
|
44
|
Agarwal P, Meena S, Meena LS. Comprehensive analysis of GTP cyclohydrolase I activity in Mycobacterium tuberculosis H 37 Rv via in silico studies. Biotechnol Appl Biochem 2020; 68:756-768. [PMID: 32691412 DOI: 10.1002/bab.1988] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Accepted: 07/14/2020] [Indexed: 11/06/2022]
Abstract
GTP cyclohydrolase I enzyme (GTPCH-I) is a rate limiting enzyme in the biosynthesis pathway of tetrahydrobiopterin (BH4) and tetrahydrofolate (THF) compounds; latter being are an essential compounds involved in many biological functions. This enzyme has been evaluated structurally and functionally in many organisms to understand its putative role in cell processes, kinetics, regulations, drug targeting in infectious diseases, pain sensitivity in humans, and so on. In Mycobacterium tuberculosis (a human pathogen causing tuberculosis), this GTPCH-I activity has been predicted to be present in Rv3609c gene (folE) of H37 Rv strain, which till date has not been studied in detail. In order to understand in depth, the structure and function of folE protein in M. tuberculosis H37 Rv, in silico study was designed by using many different bioinformatics tools. Comparative and structural analysis predicts that Rv3609c gene is similar to folE protein ortholog of Listeria monocytogenes (cause food born disease), and uses zinc ion as a cofactor for its catalysis. Result shows that mutation of folE protein at 52th residue from tyrosine to glycine or variation in pH and temperature can lead to high destability in protein structure. Studies here have also predicted about the functional regions and interacting partners involved with folE protein. This study has provided clues to carry out experimentally the analysis of folE protein in mycobacteria and if found suitable will be used for drug targeting.
Collapse
Affiliation(s)
- Preeti Agarwal
- CSIR-Institute of Genomics and Integrative Biology, Mall Road, Delhi, India
| | - Swati Meena
- CSIR-Institute of Genomics and Integrative Biology, Mall Road, Delhi, India
| | - Laxman S Meena
- CSIR-Institute of Genomics and Integrative Biology, Mall Road, Delhi, India
| |
Collapse
|
45
|
Sanavia T, Birolo G, Montanucci L, Turina P, Capriotti E, Fariselli P. Limitations and challenges in protein stability prediction upon genome variations: towards future applications in precision medicine. Comput Struct Biotechnol J 2020; 18:1968-1979. [PMID: 32774791 PMCID: PMC7397395 DOI: 10.1016/j.csbj.2020.07.011] [Citation(s) in RCA: 59] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2020] [Revised: 07/10/2020] [Accepted: 07/14/2020] [Indexed: 12/13/2022] Open
Abstract
Protein stability predictions are becoming essential in medicine to develop novel immunotherapeutic agents and for drug discovery. Despite the large number of computational approaches for predicting the protein stability upon mutation, there are still critical unsolved problems: 1) the limited number of thermodynamic measurements for proteins provided by current databases; 2) the large intrinsic variability of ΔΔG values due to different experimental conditions; 3) biases in the development of predictive methods caused by ignoring the anti-symmetry of ΔΔG values between mutant and native protein forms; 4) over-optimistic prediction performance, due to sequence similarity between proteins used in training and test datasets. Here, we review these issues, highlighting new challenges required to improve current tools and to achieve more reliable predictions. In addition, we provide a perspective of how these methods will be beneficial for designing novel precision medicine approaches for several genetic disorders caused by mutations, such as cancer and neurodegenerative diseases.
Collapse
Affiliation(s)
- Tiziana Sanavia
- Department of Medical Sciences, University of Torino, Via Santena 19, 10126 Torino, Italy
| | - Giovanni Birolo
- Department of Medical Sciences, University of Torino, Via Santena 19, 10126 Torino, Italy
| | - Ludovica Montanucci
- Department of Comparative Biomedicine and Food Science (BCA), University of Padova, Viale dell'Università 16, 35020 Legnaro, Italy
| | - Paola Turina
- Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Via F. Selmi 3, 40126 Bologna, Italy
| | - Emidio Capriotti
- Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Via F. Selmi 3, 40126 Bologna, Italy
| | - Piero Fariselli
- Department of Medical Sciences, University of Torino, Via Santena 19, 10126 Torino, Italy
| |
Collapse
|
46
|
Marabotti A, Scafuri B, Facchiano A. Predicting the stability of mutant proteins by computational approaches: an overview. Brief Bioinform 2020; 22:5850907. [PMID: 32496523 DOI: 10.1093/bib/bbaa074] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 04/07/2020] [Accepted: 04/10/2020] [Indexed: 01/06/2023] Open
Abstract
A very large number of computational methods to predict the change in thermodynamic stability of proteins due to mutations have been developed during the last 30 years, and many different web servers are currently available. Nevertheless, most of them suffer from severe drawbacks that decrease their general reliability and, consequently, their applicability to different goals such as protein engineering or the predictions of the effects of mutations in genetic diseases. In this review, we have summarized all the main approaches used to develop these tools, with a survey of the web servers currently available. Moreover, we have also reviewed the different assessments made during the years, in order to allow the reader to check directly the different performances of these tools, to select the one that best fits his/her needs, and to help naïve users in finding the best option for their needs.
Collapse
|
47
|
Broom A, Trainor K, Jacobi Z, Meiering EM. Computational Modeling of Protein Stability: Quantitative Analysis Reveals Solutions to Pervasive Problems. Structure 2020; 28:717-726.e3. [DOI: 10.1016/j.str.2020.04.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2019] [Revised: 03/26/2020] [Accepted: 04/06/2020] [Indexed: 12/20/2022]
|
48
|
Predicting the viability of beta-lactamase: How folding and binding free energies correlate with beta-lactamase fitness. PLoS One 2020; 15:e0233509. [PMID: 32470971 PMCID: PMC7259980 DOI: 10.1371/journal.pone.0233509] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Accepted: 05/06/2020] [Indexed: 12/25/2022] Open
Abstract
One of the long-standing holy grails of molecular evolution has been the ability to predict an organism’s fitness directly from its genotype. With such predictive abilities in hand, researchers would be able to more accurately forecast how organisms will evolve and how proteins with novel functions could be engineered, leading to revolutionary advances in medicine and biotechnology. In this work, we assemble the largest reported set of experimental TEM-1 β-lactamase folding free energies and use this data in conjunction with previously acquired fitness data and computational free energy predictions to determine how much of the fitness of β-lactamase can be directly predicted by thermodynamic folding and binding free energies. We focus upon β-lactamase because of its long history as a model enzyme and its central role in antibiotic resistance. Based upon a set of 21 β-lactamase single and double mutants expressly designed to influence protein folding, we first demonstrate that modeling software designed to compute folding free energies such as FoldX and PyRosetta can meaningfully, although not perfectly, predict the experimental folding free energies of single mutants. Interestingly, while these techniques also yield sensible double mutant free energies, we show that they do so for the wrong physical reasons. We then go on to assess how well both experimental and computational folding free energies explain single mutant fitness. We find that folding free energies account for, at most, 24% of the variance in β-lactamase fitness values according to linear models and, somewhat surprisingly, complementing folding free energies with computationally-predicted binding free energies of residues near the active site only increases the folding-only figure by a few percent. This strongly suggests that the majority of β-lactamase’s fitness is controlled by factors other than free energies. Overall, our results shed a bright light on to what extent the community is justified in using thermodynamic measures to infer protein fitness as well as how applicable modern computational techniques for predicting free energies will be to the large data sets of multiply-mutated proteins forthcoming.
Collapse
|
49
|
Kuroda D, Tsumoto K. Engineering Stability, Viscosity, and Immunogenicity of Antibodies by Computational Design. J Pharm Sci 2020; 109:1631-1651. [DOI: 10.1016/j.xphs.2020.01.011] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 12/25/2019] [Accepted: 01/10/2020] [Indexed: 12/18/2022]
|
50
|
CYP2R1 and CYP27A1 genes: An in silico approach to identify the deleterious mutations, impact on structure and their differential expression in disease conditions. Genomics 2020; 112:3677-3686. [PMID: 32344004 DOI: 10.1016/j.ygeno.2020.04.017] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Revised: 11/22/2019] [Accepted: 04/23/2020] [Indexed: 01/27/2023]
Abstract
Mutations in CYP2R1 and CYP27A1 involved in the conversion of Cholecalciferol into Calcidiol were associated with the impaired 25-hydroxylase activity therefore affecting the Vitamin D metabolism. Hence, this study attempted to understand the influence of genetic variations at the sequence and structural level via computational approach. The non-synonymous mutations retrieved from dbSNP database were assessed for their pathogenicity, stability as well as conservancy using various computational tools. The above analysis predicted 11/260 and 35/489 non-synonymous mutations to be deleterious in CYP2R1 and CYP27A1 genes respectively. Native and mutant forms of the corresponding proteins were modeled. Further, interacting native and mutant proteins with cholecalciferol showed difference in hydrogen bonds, hydrophobic bonds and their binding affinities suggesting the possible influence of these mutations in their function. Also, expression of these genes in various disease conditions was investigated using GEO datasets which predicted that there is a differential expression in cancer and arthritis.
Collapse
|