1
|
Arnaudi M, Utichi M, Tiberti M, Papaleo E. Predicting the structure-altering mechanisms of disease variants. Curr Opin Struct Biol 2025; 91:102994. [PMID: 40020537 DOI: 10.1016/j.sbi.2025.102994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2024] [Revised: 12/19/2024] [Accepted: 01/13/2025] [Indexed: 03/03/2025]
Abstract
Missense variants can affect the severity of disease, choice of treatment, and treatment outcomes. While the number of known variants has been increasing at a rapid pace, available evidence of their clinical effect has been lagging behind, constituting a challenge for clinicians and researchers. Multiplexed assays of variant effects (MAVEs) are important to close the gap; nonetheless, computational predictions of pathogenicity are still often the only available data for scoring variants. Such methods are not designed to provide a mechanistic explanation for the effect of amino acid substitutions. To this purpose, we propose structure-based frameworks as ensemble methodologies, with each method tailored to predict a different aspect among those exerted by amino acid substitutions to link predicted pathogenicity to mechanistic indicators. We review available frameworks, as well as advancements in underlying structure-based methods that predict variant effects on several protein features, such as protein stability, biomolecular interactions, allostery, post-translational modifications, and more.
Collapse
Affiliation(s)
- Matteo Arnaudi
- Cancer Structural Biology, Danish Cancer Institute, Strandboulevarden 49, 2100, Copenhagen, Denmark; Cancer Systems Biology, Section of Bioinformatics, Health and Technology Department, Technical University of Denmark, Lyngby, Denmark
| | - Mattia Utichi
- Cancer Structural Biology, Danish Cancer Institute, Strandboulevarden 49, 2100, Copenhagen, Denmark; Cancer Systems Biology, Section of Bioinformatics, Health and Technology Department, Technical University of Denmark, Lyngby, Denmark
| | - Matteo Tiberti
- Cancer Structural Biology, Danish Cancer Institute, Strandboulevarden 49, 2100, Copenhagen, Denmark.
| | - Elena Papaleo
- Cancer Structural Biology, Danish Cancer Institute, Strandboulevarden 49, 2100, Copenhagen, Denmark; Cancer Systems Biology, Section of Bioinformatics, Health and Technology Department, Technical University of Denmark, Lyngby, Denmark.
| |
Collapse
|
2
|
Stockinger P, Niederhauser C, Farnaud S, Buller R. Computational analysis reveals temperature-induced stabilization of FAST-PETase. Comput Struct Biotechnol J 2025; 27:969-977. [PMID: 40151525 PMCID: PMC11946493 DOI: 10.1016/j.csbj.2025.03.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2025] [Revised: 03/03/2025] [Accepted: 03/04/2025] [Indexed: 03/29/2025] Open
Abstract
More than 10 % of global solid waste consists of poly(ethyleneterephthalate) (PET). Among other techniques, PET hydrolases (PETases) can be used to depolymerize this plastic. However, wildtype PETases exhibit poor specific activities and insufficient thermostability, limiting their use in depolymerization processes which require high temperatures. In 2022, machine learning-aided enzyme engineering of a PETase stemming from the bacterium Ideonella sakaiensis (IsPETase) resulted in a more functional, active, stable, and tolerant variant (FAST-PETase). To rationalize the molecular basis of FAST-PETase's improved thermal stability, we performed comparative Constraint Network Analysis (CNAnalysis) and Molecular Dynamics (MD) simulations of wildtype IsPETase (WT-PETase) and FAST-PETase at 30°C and 50°C identifying thermolabile sequence stretches in the wildtype enzyme. Further analysis of the backbone flexibility revealed that all mutations of FAST-PETase affected these critical regions. Counterintuitively, the in-silico analyses additionally highlighted that the flexibility of these regions decreased at 50°C in FAST-PETase, instead of exhibiting increased flexibility at higher temperature as would be expected from thermodynamic considerations. This effect was confirmed by physical energy calculations, which suggest that temperature-dependent conformational changes of FAST-PETase decrease the free energy of unfolding (ΔG(stability)) and rigidify the enzyme at elevated temperatures enhancing stability. Looking forward, these findings might help guide the rational engineering of protein thermostability and contribute to our understanding of the thermal adaptation of thermophilic enzymes.
Collapse
Affiliation(s)
- Peter Stockinger
- Research Centre for Health & Life Sciences, Coventry University, Coventry CV1 5FB, United Kingdom
- Competence Center for Biocatalysis, Zurich University of Applied Sciences, Einsiedlerstrasse 31, Wädenswil 8820, Switzerland
| | - Cornel Niederhauser
- Competence Center for Biocatalysis, Zurich University of Applied Sciences, Einsiedlerstrasse 31, Wädenswil 8820, Switzerland
| | - Sebastien Farnaud
- Research Centre for Health & Life Sciences, Coventry University, Coventry CV1 5FB, United Kingdom
| | - Rebecca Buller
- Competence Center for Biocatalysis, Zurich University of Applied Sciences, Einsiedlerstrasse 31, Wädenswil 8820, Switzerland
| |
Collapse
|
3
|
Xu W, Li A, Zhao Y, Peng Y. Decoding the effects of mutation on protein interactions using machine learning. BIOPHYSICS REVIEWS 2025; 6:011307. [PMID: 40013003 PMCID: PMC11857871 DOI: 10.1063/5.0249920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2024] [Accepted: 01/14/2025] [Indexed: 02/28/2025]
Abstract
Accurately predicting mutation-caused binding free energy changes (ΔΔGs) on protein interactions is crucial for understanding how genetic variations affect interactions between proteins and other biomolecules, such as proteins, DNA/RNA, and ligands, which are vital for regulating numerous biological processes. Developing computational approaches with high accuracy and efficiency is critical for elucidating the mechanisms underlying various diseases, identifying potential biomarkers for early diagnosis, and developing targeted therapies. This review provides a comprehensive overview of recent advancements in predicting the impact of mutations on protein interactions across different interaction types, which are central to understanding biological processes and disease mechanisms, including cancer. We summarize recent progress in predictive approaches, including physicochemical-based, machine learning, and deep learning methods, evaluating the strengths and limitations of each. Additionally, we discuss the challenges related to the limitations of mutational data, including biases, data quality, and dataset size, and explore the difficulties in developing accurate prediction tools for mutation-induced effects on protein interactions. Finally, we discuss future directions for advancing these computational tools, highlighting the capabilities of advancing technologies, such as artificial intelligence to drive significant improvements in mutational effects prediction.
Collapse
Affiliation(s)
- Wang Xu
- Institute of Biophysics and Department of Physics, Central China Normal University, Wuhan 430079, China
| | - Anbang Li
- Institute of Biophysics and Department of Physics, Central China Normal University, Wuhan 430079, China
| | - Yunjie Zhao
- Institute of Biophysics and Department of Physics, Central China Normal University, Wuhan 430079, China
| | - Yunhui Peng
- Institute of Biophysics and Department of Physics, Central China Normal University, Wuhan 430079, China
| |
Collapse
|
4
|
Chen Z, He S, Chi X, Bo X. Predicting Antibody Affinity Changes upon Mutation Based on Unbound Protein Structures. Int J Mol Sci 2025; 26:1343. [PMID: 39941111 PMCID: PMC11818220 DOI: 10.3390/ijms26031343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2025] [Revised: 01/24/2025] [Accepted: 01/31/2025] [Indexed: 02/16/2025] Open
Abstract
Antibodies are key proteins in the immune system that can reversibly and non-covalently bind specifically to their corresponding antigens, forming antigen-antibody complexes. They play a crucial role in recognizing foreign or self-antigens during the adaptive immune response. Monoclonal antibodies have emerged as a promising class of biological macromolecule therapeutics with broad market prospects. In the process of antibody drug development, a key engineering challenge is to improve the affinity of candidate antibodies, without experimentally resolved structures of the antigen-antibody complexes as input for computer-aided predictive methods. In this work, we present an approach for predicting the effect of residue mutations on antibody affinity without the structures of the antigen-antibody complexes. The method involves the graph representation of proteins and utilizes a pre-trained encoder. The encoder captures the residue-level microenvironment of the target residue on the antibody along with the antigen context pre- and post-mutation. The encoder inherently possesses the potential to identify paratope residues. In addition, we curated a benchmark dataset specifically for mutations of the antibody. Compared to baseline methods based on complex structures and sequences, our approach achieves superior or comparable average accuracy on benchmark datasets. Additionally, we validate its advantage of not requiring antigen-antibody complex structures as input for predicting the effects of mutations in antibodies against SARS-CoV-2, influenza, and human cytomegalovirus. Our method shows its potential for identifying mutations that improve antibody affinity in practical antibody engineering applications.
Collapse
Affiliation(s)
| | | | - Xiangyang Chi
- Academy of Military Medical Sciences, Beijing 100850, China; (Z.C.); or (S.H.)
| | - Xiaochen Bo
- Academy of Military Medical Sciences, Beijing 100850, China; (Z.C.); or (S.H.)
| |
Collapse
|
5
|
Van der Pijl RJ, Ma W, Lewis CTA, Haar L, Buhl A, Farman GP, Rhodehamel M, Jani VP, Nelson OL, Zhang C, Granzier H, Ochala J. Increased cardiac myosin super-relaxation as an energy saving mechanism in hibernating grizzly bears. Mol Metab 2025; 92:102084. [PMID: 39694092 PMCID: PMC11732570 DOI: 10.1016/j.molmet.2024.102084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/24/2024] [Revised: 12/09/2024] [Accepted: 12/09/2024] [Indexed: 12/20/2024] Open
Abstract
AIM The aim of the present study was to define whether cardiac myosin contributes to energy conservation in the heart of hibernating mammals. METHODS Thin cardiac strips were isolated from the left ventricles of active and hibernating grizzly bears; and subjected to loaded Mant-ATP chase assays, X-ray diffraction and proteomics. MAIN FINDINGS Hibernating grizzly bears displayed an unusually high proportion of ATP-conserving super-relaxed cardiac myosin molecules that are likely due to altered levels of phosphorylation and rod region stability. CONCLUSIONS Cardiac myosin depresses the heart's energetic demand during hibernation by modulating its function.
Collapse
Affiliation(s)
| | - Weikang Ma
- BioCAT, Department of Biology, Illinois Institute of Technology, Chicago, IL, USA
| | | | - Line Haar
- Department of Biomedical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Amalie Buhl
- Department of Biomedical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Gerrie P Farman
- Department of Cellular and Molecular Medicine, University of Arizona, Tucson, AZ, USA
| | - Marcus Rhodehamel
- Division of Cardiology, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Vivek P Jani
- Division of Cardiology, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - O Lynne Nelson
- College of Veterinary Medicine, Washington State University, Pullman, WA, USA
| | - Chengxin Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Henk Granzier
- Department of Cellular and Molecular Medicine, University of Arizona, Tucson, AZ, USA
| | - Julien Ochala
- Department of Biomedical Sciences, University of Copenhagen, Copenhagen, Denmark; Myocardial Homeostasis and Cardiac Injury Program, Centro Nacional de Investigaciones Cardiovasculares (CNIC), Madrid, Spain.
| |
Collapse
|
6
|
Lewis CTA, Melhedegaard EG, Ognjanovic MM, Olsen MS, Laitila J, Seaborne RAE, Gronset M, Zhang C, Iwamoto H, Hessel AL, Kuehn MN, Merino C, Amigo N, Frobert O, Giroud S, Staples JF, Goropashnaya AV, Fedorov VB, Barnes B, Toien O, Drew K, Sprenger RJ, Ochala J. Remodeling of skeletal muscle myosin metabolic states in hibernating mammals. eLife 2024; 13:RP94616. [PMID: 38752835 PMCID: PMC11098559 DOI: 10.7554/elife.94616] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/18/2024] Open
Abstract
Hibernation is a period of metabolic suppression utilized by many small and large mammal species to survive during winter periods. As the underlying cellular and molecular mechanisms remain incompletely understood, our study aimed to determine whether skeletal muscle myosin and its metabolic efficiency undergo alterations during hibernation to optimize energy utilization. We isolated muscle fibers from small hibernators, Ictidomys tridecemlineatus and Eliomys quercinus and larger hibernators, Ursus arctos and Ursus americanus. We then conducted loaded Mant-ATP chase experiments alongside X-ray diffraction to measure resting myosin dynamics and its ATP demand. In parallel, we performed multiple proteomics analyses. Our results showed a preservation of myosin structure in U. arctos and U. americanus during hibernation, whilst in I. tridecemlineatus and E. quercinus, changes in myosin metabolic states during torpor unexpectedly led to higher levels in energy expenditure of type II, fast-twitch muscle fibers at ambient lab temperatures (20 °C). Upon repeating loaded Mant-ATP chase experiments at 8 °C (near the body temperature of torpid animals), we found that myosin ATP consumption in type II muscle fibers was reduced by 77-107% during torpor compared to active periods. Additionally, we observed Myh2 hyper-phosphorylation during torpor in I. tridecemilineatus, which was predicted to stabilize the myosin molecule. This may act as a potential molecular mechanism mitigating myosin-associated increases in skeletal muscle energy expenditure during periods of torpor in response to cold exposure. Altogether, we demonstrate that resting myosin is altered in hibernating mammals, contributing to significant changes to the ATP consumption of skeletal muscle. Additionally, we observe that it is further altered in response to cold exposure and highlight myosin as a potentially contributor to skeletal muscle non-shivering thermogenesis.
Collapse
Affiliation(s)
| | | | - Marija M Ognjanovic
- Department of Biomedical Sciences, University of CopenhagenCopenhagenDenmark
| | - Mathilde S Olsen
- Department of Biomedical Sciences, University of CopenhagenCopenhagenDenmark
| | - Jenni Laitila
- Department of Biomedical Sciences, University of CopenhagenCopenhagenDenmark
| | - Robert AE Seaborne
- Department of Biomedical Sciences, University of CopenhagenCopenhagenDenmark
- Centre for Human and Applied Physiological Sciences, Faculty of Life Sciences & Medicine, King’s College LondonLondonUnited Kingdom
| | - Magnus Gronset
- Department of Cellular and Molecular Medicine, University of CopenhagenCopenhagenDenmark
| | - Changxin Zhang
- Department of Computational Medicine and Bioinformatics, University of MichiganAnn ArborUnited States
| | - Hiroyuki Iwamoto
- Spring-8, Japan Synchrotron Radiation Research InstituteHyogoJapan
| | - Anthony L Hessel
- Institute of Physiology II, University of MuensterMuensterGermany
- Accelerated Muscle Biotechnologies ConsultantsBostonUnited States
| | - Michel N Kuehn
- Institute of Physiology II, University of MuensterMuensterGermany
- Accelerated Muscle Biotechnologies ConsultantsBostonUnited States
| | | | | | - Ole Frobert
- Department of Clinical Medicine, Faculty of Health, Aarhus UniversityAarhusDenmark
- Faculty of Health, Department of Cardiology, Örebro UniversityÖrebroSweden
| | - Sylvain Giroud
- Energetics Lab, Department of Biology, Northern Michigan UniversityMarquetteUnited States
- Research Institute of Wildlife Ecology, Department of Interdisciplinary Life Sciences, University of Veterinary Medicine ViennaViennaAustria
| | - James F Staples
- Department of Biology, University of Western OntarioLondonCanada
| | - Anna V Goropashnaya
- Center for Transformative Research in Metabolism, Institute of Arctic Biology, University of Alaska FairbanksFairbanksUnited States
| | - Vadim B Fedorov
- Center for Transformative Research in Metabolism, Institute of Arctic Biology, University of Alaska FairbanksFairbanksUnited States
| | - Brian Barnes
- Center for Transformative Research in Metabolism, Institute of Arctic Biology, University of Alaska FairbanksFairbanksUnited States
| | - Oivind Toien
- Center for Transformative Research in Metabolism, Institute of Arctic Biology, University of Alaska FairbanksFairbanksUnited States
| | - Kelly Drew
- Center for Transformative Research in Metabolism, Institute of Arctic Biology, University of Alaska FairbanksFairbanksUnited States
| | - Ryan J Sprenger
- Department of Zoology, University of British ColumbiaVancouverCanada
| | - Julien Ochala
- Department of Biomedical Sciences, University of CopenhagenCopenhagenDenmark
| |
Collapse
|
7
|
Fazekas Z, K Menyhárd D, Perczel A. LoCoHD: a metric for comparing local environments of proteins. Nat Commun 2024; 15:4029. [PMID: 38740745 DOI: 10.1038/s41467-024-48225-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Accepted: 04/22/2024] [Indexed: 05/16/2024] Open
Abstract
Protein folds and the local environments they create can be compared using a variety of differently designed measures, such as the root mean squared deviation, the global distance test, the template modeling score or the local distance difference test. Although these measures have proven to be useful for a variety of tasks, each fails to fully incorporate the valuable chemical information inherent to atoms and residues, and considers these only partially and indirectly. Here, we develop the highly flexible local composition Hellinger distance (LoCoHD) metric, which is based on the chemical composition of local residue environments. Using LoCoHD, we analyze the chemical heterogeneity of amino acid environments and identify valines having the most conserved-, and arginines having the most variable chemical environments. We use LoCoHD to investigate structural ensembles, to evaluate critical assessment of structure prediction (CASP) competitors, to compare the results with the local distance difference test (lDDT) scoring system, and to evaluate a molecular dynamics simulation. We show that LoCoHD measurements provide unique information about protein structures that is distinct from, for example, those derived using the alignment-based RMSD metric, or the similarly distance matrix-based but alignment-free lDDT metric.
Collapse
Affiliation(s)
- Zsolt Fazekas
- Laboratory of Structural Chemistry and Biology, Institute of Chemistry, ELTE Eötvös Loránd University, Budapest, Hungary
- ELTE Hevesy György PhD School of Chemistry, ELTE Eötvös Loránd University, Budapest, Hungary
| | - Dóra K Menyhárd
- Laboratory of Structural Chemistry and Biology, Institute of Chemistry, ELTE Eötvös Loránd University, Budapest, Hungary
- HUN-REN-ELTE Protein Modeling Research Group, ELTE Eötvös Loránd University, Budapest, Hungary
| | - András Perczel
- Laboratory of Structural Chemistry and Biology, Institute of Chemistry, ELTE Eötvös Loránd University, Budapest, Hungary.
- HUN-REN-ELTE Protein Modeling Research Group, ELTE Eötvös Loránd University, Budapest, Hungary.
| |
Collapse
|
8
|
Ozden B, Şamiloğlu E, Özsan A, Erguven M, Yükrük C, Koşaca M, Oktayoğlu M, Menteş M, Arslan N, Karakülah G, Barlas AB, Savaş B, Karaca E. Benchmarking the accuracy of structure-based binding affinity predictors on Spike-ACE2 deep mutational interaction set. Proteins 2024; 92:529-539. [PMID: 37991066 DOI: 10.1002/prot.26645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 10/25/2023] [Accepted: 11/13/2023] [Indexed: 11/23/2023]
Abstract
Since the start of COVID-19 pandemic, a huge effort has been devoted to understanding the Spike (SARS-CoV-2)-ACE2 recognition mechanism. To this end, two deep mutational scanning studies traced the impact of all possible mutations across receptor binding domain (RBD) of Spike and catalytic domain of human ACE2. By concentrating on the interface mutations of these experimental data, we benchmarked six commonly used structure-based binding affinity predictors (FoldX, EvoEF1, MutaBind2, SSIPe, HADDOCK, and UEP). These predictors were selected based on their user-friendliness, accessibility, and speed. As a result of our benchmarking efforts, we observed that none of the methods could generate a meaningful correlation with the experimental binding data. The best correlation is achieved by FoldX (R = -0.51). When we simplified the prediction problem to a binary classification, that is, whether a mutation is enriching or depleting the binding, we showed that the highest accuracy is achieved by FoldX with a 64% success rate. Surprisingly, on this set, simple energetic scoring functions performed significantly better than the ones using extra evolutionary-based terms, as in Mutabind and SSIPe. Furthermore, we demonstrated that recent AI approaches, mmCSM-PPI and TopNetTree, yielded comparable performances to the force field-based techniques. These observations suggest plenty of room to improve the binding affinity predictors in guessing the variant-induced binding profile changes of a host-pathogen system, such as Spike-ACE2. To aid such improvements we provide our benchmarking data at https://github.com/CSB-KaracaLab/RBD-ACE2-MutBench with the option to visualize our mutant models at https://rbd-ace2-mutbench.github.io/.
Collapse
Affiliation(s)
- Burcu Ozden
- Izmir Biomedicine and Genome Center, Dokuz Eylul University Health Campus, Izmir, Turkey
- Izmir International Biomedicine and Genome Institute, Dokuz Eylül University, Izmir, Turkey
| | - Eda Şamiloğlu
- Izmir Biomedicine and Genome Center, Dokuz Eylul University Health Campus, Izmir, Turkey
- Izmir International Biomedicine and Genome Institute, Dokuz Eylül University, Izmir, Turkey
| | - Atakan Özsan
- Izmir Biomedicine and Genome Center, Dokuz Eylul University Health Campus, Izmir, Turkey
| | - Mehmet Erguven
- Izmir Biomedicine and Genome Center, Dokuz Eylul University Health Campus, Izmir, Turkey
| | - Can Yükrük
- Izmir Biomedicine and Genome Center, Dokuz Eylul University Health Campus, Izmir, Turkey
| | - Mehdi Koşaca
- Izmir Biomedicine and Genome Center, Dokuz Eylul University Health Campus, Izmir, Turkey
- Izmir International Biomedicine and Genome Institute, Dokuz Eylül University, Izmir, Turkey
| | - Melis Oktayoğlu
- Izmir Biomedicine and Genome Center, Dokuz Eylul University Health Campus, Izmir, Turkey
| | - Muratcan Menteş
- Izmir Biomedicine and Genome Center, Dokuz Eylul University Health Campus, Izmir, Turkey
| | - Nazmiye Arslan
- Izmir Biomedicine and Genome Center, Dokuz Eylul University Health Campus, Izmir, Turkey
| | - Gökhan Karakülah
- Izmir Biomedicine and Genome Center, Dokuz Eylul University Health Campus, Izmir, Turkey
- Izmir International Biomedicine and Genome Institute, Dokuz Eylül University, Izmir, Turkey
| | - Ayşe Berçin Barlas
- Izmir Biomedicine and Genome Center, Dokuz Eylul University Health Campus, Izmir, Turkey
- Izmir International Biomedicine and Genome Institute, Dokuz Eylül University, Izmir, Turkey
| | - Büşra Savaş
- Izmir Biomedicine and Genome Center, Dokuz Eylul University Health Campus, Izmir, Turkey
- Izmir International Biomedicine and Genome Institute, Dokuz Eylül University, Izmir, Turkey
| | - Ezgi Karaca
- Izmir Biomedicine and Genome Center, Dokuz Eylul University Health Campus, Izmir, Turkey
- Izmir International Biomedicine and Genome Institute, Dokuz Eylül University, Izmir, Turkey
| |
Collapse
|
9
|
Lewis CTA, Melhedegaard EG, Ognjanovic MM, Olsen MS, Laitila J, Seaborne RAE, Gronset MN, Zhang C, Iwamoto H, Hessel AL, Kuehn MN, Merino C, Amigo N, Frobert O, Giroud S, Staples JF, Goropashnaya AV, Fedorov VB, Barnes BM, Toien O, Drew KL, Sprenger RJ, Ochala J. Remodelling of Skeletal Muscle Myosin Metabolic States in Hibernating Mammals. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.14.566992. [PMID: 38014200 PMCID: PMC10680686 DOI: 10.1101/2023.11.14.566992] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Hibernation is a period of metabolic suppression utilized by many small and large mammal species to survive during winter periods. As the underlying cellular and molecular mechanisms remain incompletely understood, our study aimed to determine whether skeletal muscle myosin and its metabolic efficiency undergo alterations during hibernation to optimize energy utilization. We isolated muscle fibers from small hibernators, Ictidomys tridecemlineatus and Eliomys quercinus and larger hibernators, Ursus arctos and Ursus americanus. We then conducted loaded Mant-ATP chase experiments alongside X-ray diffraction to measure resting myosin dynamics and its ATP demand. In parallel, we performed multiple proteomics analyses. Our results showed a preservation of myosin structure in U. arctos and U. americanus during hibernation, whilst in I. tridecemlineatus and E. quercinus, changes in myosin metabolic states during torpor unexpectedly led to higher levels in energy expenditure of type II, fast-twitch muscle fibers at ambient lab temperatures (20°C). Upon repeating loaded Mant-ATP chase experiments at 8°C (near the body temperature of torpid animals), we found that myosin ATP consumption in type II muscle fibers was reduced by 77-107% during torpor compared to active periods. Additionally, we observed Myh2 hyper-phosphorylation during torpor in I. tridecemilineatus, which was predicted to stabilize the myosin molecule. This may act as a potential molecular mechanism mitigating myosin-associated increases in skeletal muscle energy expenditure during periods of torpor in response to cold exposure. Altogether, we demonstrate that resting myosin is altered in hibernating mammals, contributing to significant changes to the ATP consumption of skeletal muscle. Additionally, we observe that it is further altered in response to cold exposure and highlight myosin as a potentially contributor to skeletal muscle non-shivering thermogenesis.
Collapse
|
10
|
Liu Z, Zhang C, Zhang Q, Zhang Y, Yu DJ. TM-search: An Efficient and Effective Tool for Protein Structure Database Search. J Chem Inf Model 2024; 64:1043-1049. [PMID: 38270339 DOI: 10.1021/acs.jcim.3c01455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2024]
Abstract
The quickly increasing size of the Protein Data Bank is challenging biologists to develop a more scalable protein structure alignment tool for fast structure database search. Although many protein structure search algorithms and programs have been designed and implemented for this purpose, most require a large amount of computational time. We propose a novel protein structure search approach, TM-search, which is based on the pairwise structure alignment program TM-align and a new iterative clustering algorithm. Benchmark tests demonstrate that TM-search is 27 times faster than a TM-align full database search while still being able to identify ∼90% of all high TM-score hits, which is 2-10 times more than other existing programs such as Foldseek, Dali, and PSI-BLAST.
Collapse
Affiliation(s)
- Zi Liu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China
- Computer Department, Jingdezhen Ceramic University, Jingdezhen 333403, China
| | - Chengxin Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw, Ann Arbor, Michigan 48109-2218, United States
| | - Qidi Zhang
- Computer Department, Jingdezhen Ceramic University, Jingdezhen 333403, China
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw, Ann Arbor, Michigan 48109-2218, United States
| | - Dong-Jun Yu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing 210094, China
| |
Collapse
|
11
|
Vincenzi M, Mercurio FA, Leone M. Virtual Screening of Peptide Libraries: The Search for Peptide-Based Therapeutics Using Computational Tools. Int J Mol Sci 2024; 25:1798. [PMID: 38339078 PMCID: PMC10855943 DOI: 10.3390/ijms25031798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 01/26/2024] [Accepted: 01/30/2024] [Indexed: 02/12/2024] Open
Abstract
Over the last few decades, we have witnessed growing interest from both academic and industrial laboratories in peptides as possible therapeutics. Bioactive peptides have a high potential to treat various diseases with specificity and biological safety. Compared to small molecules, peptides represent better candidates as inhibitors (or general modulators) of key protein-protein interactions. In fact, undruggable proteins containing large and smooth surfaces can be more easily targeted with the conformational plasticity of peptides. The discovery of bioactive peptides, working against disease-relevant protein targets, generally requires the high-throughput screening of large libraries, and in silico approaches are highly exploited for their low-cost incidence and efficiency. The present review reports on the potential challenges linked to the employment of peptides as therapeutics and describes computational approaches, mainly structure-based virtual screening (SBVS), to support the identification of novel peptides for therapeutic implementations. Cutting-edge SBVS strategies are reviewed along with examples of applications focused on diverse classes of bioactive peptides (i.e., anticancer, antimicrobial/antiviral peptides, peptides blocking amyloid fiber formation).
Collapse
Affiliation(s)
| | | | - Marilisa Leone
- Institute of Biostructures and Bioimaging, Via Pietro Castellino 111, 80131 Naples, Italy; (M.V.); (F.A.M.)
| |
Collapse
|
12
|
Hung TI, Hsieh YJ, Lu WL, Wu KP, Chang CEA. What Strengthens Protein-Protein Interactions: Analysis and Applications of Residue Correlation Networks. J Mol Biol 2023; 435:168337. [PMID: 37918563 PMCID: PMC11637584 DOI: 10.1016/j.jmb.2023.168337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 10/13/2023] [Accepted: 10/26/2023] [Indexed: 11/04/2023]
Abstract
Identifying residues critical to protein-protein binding and efficient design of stable and specific protein binders are challenging tasks. Extending beyond the direct contacts in a protein-protein binding interface, our study employs computational modeling to reveal the essential network of residue interactions and dihedral angle correlations critical in protein-protein recognition. We hypothesized that mutating residues exhibiting highly correlated dynamic motion within the interaction network could efficiently optimize protein-protein interactions to create tight and selective protein binders. We tested this hypothesis using the ubiquitin (Ub) and MERS coronaviral papain-like protease (PLpro) complex, since Ub is a central player in multiple cellular functions and PLpro is an antiviral drug target. Our designed ubiquitin variant (UbV) hosting three mutated residues displayed a ∼3,500-fold increase in functional inhibition relative to wild-type Ub. Further optimization of two C-terminal residues within the Ub network resulted in a KD of 1.5 nM and IC50 of 9.7 nM for the five-point Ub mutant, eliciting 27,500-fold and 5,500-fold enhancements in affinity and potency, respectively, as well as improved selectivity, without destabilizing the UbV structure. Our study highlights residue correlation and interaction networks in protein-protein interactions, and introduces an effective approach to design high-affinity protein binders for cell biology research and future therapeutics.
Collapse
Affiliation(s)
- Ta I Hung
- Department of Chemistry, University of California, Riverside, United States; Department of Bioengineering, University of California, Riverside, United States
| | - Yun-Jung Hsieh
- Institute of Biological Chemistry, Academia Sinica, Taipei, Taiwan; Institute of Biochemical Sciences, National Taiwan University, Taipei, Taiwan
| | - Wei-Lin Lu
- Institute of Biological Chemistry, Academia Sinica, Taipei, Taiwan
| | - Kuen-Phon Wu
- Institute of Biological Chemistry, Academia Sinica, Taipei, Taiwan; Institute of Biochemical Sciences, National Taiwan University, Taipei, Taiwan.
| | - Chia-En A Chang
- Department of Chemistry, University of California, Riverside, United States.
| |
Collapse
|
13
|
Maia RT, Silva ISDS, Fernandes de Souza A, Frazão NF, de Lima RM, Campos MDA. Miraculin-based sweeteners in the protein-engineering era: an alternative for developing more efficient and safer products. J Biomol Struct Dyn 2023; 42:11342-11350. [PMID: 37753742 DOI: 10.1080/07391102.2023.2262589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 09/16/2023] [Indexed: 09/28/2023]
Abstract
The current sweeteners available are very efficient in providing sweet taste. However, they are associated with several chronic diseases. Some glycoproteins, such as miraculins, are extremely interesting from a biotechnological point of view because they perform the bitter into sweet taste modifying function excellently, in addition to being safer as food. In contrast, purifying and synthesizing these proteins represents a major challenge for the food industry, as these proteins are large and complex molecules, which would make the final product expensive and economically unviable. In this context, emerging techniques from computational biology and molecular modelling have been promoting a remarkable revolution in protein bioengineering. Bioinspired peptides can provide many possibilities in sweeteners development through rational design. Once these peptides are smaller molecules than an entire protein, its synthesis on a large scale tends to be much easier and more economical, besides presenting a potential for better bioavailability in the organism. The techniques discussed here allow, through sophisticated pipelines and algorithms, to perform the rational design of mimetic peptides and with smaller size, which can carry out the activation of sweet taste of miraculins and to be more viable for industrial production. In this review, the premises and tools for the elaboration of synthetic peptides bioinspired in proteins with sweetening activity that mimic this action will be emphasized.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Rafael Trindade Maia
- Center for Sustainable Development of Semiarid, Federal University of Campina Grande, Sumé, Brazil
- Post-Graduation Program in Natural Science and Biotechnology, Center of Education and Health, Federal University of Campina Grande, Cuité, Brazil
| | - Ivânia Samara Dos Santos Silva
- Post-Graduation Program in Natural Science and Biotechnology, Center of Education and Health, Federal University of Campina Grande, Cuité, Brazil
| | - Adeilma Fernandes de Souza
- Post-Graduation Program in Natural Science and Biotechnology, Center of Education and Health, Federal University of Campina Grande, Cuité, Brazil
| | - Nilton Ferreira Frazão
- Post-Graduation Program in Natural Science and Biotechnology, Center of Education and Health, Federal University of Campina Grande, Cuité, Brazil
| | - Rafael Medeiros de Lima
- Post-Graduation Program in Natural Science and Biotechnology, Center of Education and Health, Federal University of Campina Grande, Cuité, Brazil
| | - Magnólia de Araújo Campos
- Post-Graduation Program in Natural Science and Biotechnology, Center of Education and Health, Federal University of Campina Grande, Cuité, Brazil
| |
Collapse
|
14
|
Huang X, Sun Y, Osawa Y, Chen YE, Zhang H. Computational redesign of cytochrome P450 CYP102A1 for highly stereoselective omeprazole hydroxylation by UniDesign. J Biol Chem 2023; 299:105050. [PMID: 37451479 PMCID: PMC10413352 DOI: 10.1016/j.jbc.2023.105050] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 07/03/2023] [Accepted: 07/09/2023] [Indexed: 07/18/2023] Open
Abstract
Cytochrome P450 CYP102A1 is a prototypic biocatalyst that has great potential in chemical synthesis, drug discovery, and biotechnology. CYP102A1 variants engineered by directed evolution and/or rational design are capable of catalyzing the oxidation of a wide range of organic compounds. However, it is difficult to foresee the outcome of engineering CYP102A1 for a compound of interest. Here, we introduce UniDesign as a computational framework for enzyme design and engineering. We tested UniDesign by redesigning CYP102A1 for stereoselective metabolism of omeprazole (OMP), a proton pump inhibitor, starting from an active but nonstereoselective triple mutant (TM: A82F/F87V/L188Q). To shift stereoselectivity toward (R)-OMP, we computationally scanned three active site positions (75, 264, and 328) for mutations that would stabilize the binding of the transition state of (R)-OMP while destabilizing that of (S)-OMP and picked three variants, namely UD1 (TM/L75I), UD2 (TM/A264G), and UD3 (TM/A328V), for experimentation, based on computed energy scores and models. UD1, UD2, and UD3 exhibit high turnover rates of 55 ± 4.7, 84 ± 4.8, and 79 ± 5.7 min-1, respectively, for (R)-OMP hydroxylation, whereas the corresponding rates for (S)-OMP are only 2.2 ± 0.19, 6.0 ± 0.68, and 14 ± 2.8 min-1, yielding an enantiomeric excess value of 92, 87, and 70%, respectively. These results suggest the critical roles of L75I, A264G, and A328V in steering OMP in the optimal orientation for stereoselective oxidation and demonstrate the utility of UniDesign for engineering CYP102A1 to produce drug metabolites of interest. The results are discussed in the context of protein structures.
Collapse
Affiliation(s)
- Xiaoqiang Huang
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA.
| | - Yudong Sun
- Department of Pharmacology, University of Michigan, Ann Arbor, Michigan, USA
| | - Yoichi Osawa
- Department of Pharmacology, University of Michigan, Ann Arbor, Michigan, USA
| | - Y Eugene Chen
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA
| | - Haoming Zhang
- Department of Pharmacology, University of Michigan, Ann Arbor, Michigan, USA.
| |
Collapse
|
15
|
Huang X, Zhou J, Yang D, Zhang J, Xia X, Chen YE, Xu J. Decoding CRISPR-Cas PAM recognition with UniDesign. Brief Bioinform 2023; 24:bbad133. [PMID: 37078688 PMCID: PMC10199764 DOI: 10.1093/bib/bbad133] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Revised: 02/09/2023] [Accepted: 03/16/2023] [Indexed: 04/21/2023] Open
Abstract
The critical first step in Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-associated (CRISPR-Cas) protein-mediated gene editing is recognizing a preferred protospacer adjacent motif (PAM) on target DNAs by the protein's PAM-interacting amino acids (PIAAs). Thus, accurate computational modeling of PAM recognition is useful in assisting CRISPR-Cas engineering to relax or tighten PAM requirements for subsequent applications. Here, we describe a universal computational protein design framework (UniDesign) for designing protein-nucleic acid interactions. As a proof of concept, we applied UniDesign to decode the PAM-PIAA interactions for eight Cas9 and two Cas12a proteins. We show that, given native PIAAs, the UniDesign-predicted PAMs are largely identical to the natural PAMs of all Cas proteins. In turn, given natural PAMs, the computationally redesigned PIAA residues largely recapitulated the native PIAAs (74% and 86% in terms of identity and similarity, respectively). These results demonstrate that UniDesign faithfully captures the mutual preference between natural PAMs and native PIAAs, suggesting it is a useful tool for engineering CRISPR-Cas and other nucleic acid-interacting proteins. UniDesign is open-sourced at https://github.com/tommyhuangthu/UniDesign.
Collapse
Affiliation(s)
- Xiaoqiang Huang
- Center for Advanced Models for Translational Sciences and Therapeutics, Department of Internal Medicine, University of Michigan Medical School, 2800 Plymouth Road, Ann Arbor, MI 48109, USA
| | - Jun Zhou
- Center for Advanced Models for Translational Sciences and Therapeutics, Department of Internal Medicine, University of Michigan Medical School, 2800 Plymouth Road, Ann Arbor, MI 48109, USA
| | - Dongshan Yang
- Center for Advanced Models for Translational Sciences and Therapeutics, Department of Internal Medicine, University of Michigan Medical School, 2800 Plymouth Road, Ann Arbor, MI 48109, USA
| | - Jifeng Zhang
- Center for Advanced Models for Translational Sciences and Therapeutics, Department of Internal Medicine, University of Michigan Medical School, 2800 Plymouth Road, Ann Arbor, MI 48109, USA
| | - Xiaofeng Xia
- Research & Development, ATGC Inc., 100 E Lancaster Avenue, LIMR Building Lab 129, Wynnewood, PA 19096, USA
| | - Yuqing Eugene Chen
- Center for Advanced Models for Translational Sciences and Therapeutics, Department of Internal Medicine, University of Michigan Medical School, 2800 Plymouth Road, Ann Arbor, MI 48109, USA
| | - Jie Xu
- Center for Advanced Models for Translational Sciences and Therapeutics, Department of Internal Medicine, University of Michigan Medical School, 2800 Plymouth Road, Ann Arbor, MI 48109, USA
| |
Collapse
|
16
|
Kalita P, Tripathi T, Padhi AK. Computational Protein Design for COVID-19 Research and Emerging Therapeutics. ACS CENTRAL SCIENCE 2023; 9:602-613. [PMID: 37122454 PMCID: PMC10042144 DOI: 10.1021/acscentsci.2c01513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Indexed: 05/03/2023]
Abstract
As the world struggles with the ongoing COVID-19 pandemic, unprecedented obstacles have continuously been traversed as new SARS-CoV-2 variants continually emerge. Infectious disease outbreaks are unavoidable, but the knowledge gained from the successes and failures will help create a robust health management system to deal with such pandemics. Previously, scientists required years to develop diagnostics, therapeutics, or vaccines; however, we have seen that, with the rapid deployment of high-throughput technologies and unprecedented scientific collaboration worldwide, breakthrough discoveries can be accelerated and insights broadened. Computational protein design (CPD) is a game-changing new technology that has provided alternative therapeutic strategies for pandemic management. In addition to the development of peptide-based inhibitors, miniprotein binders, decoys, biosensors, nanobodies, and monoclonal antibodies, CPD has also been used to redesign native SARS-CoV-2 proteins and human ACE2 receptors. We discuss how novel CPD strategies have been exploited to develop rationally designed and robust COVID-19 treatment strategies.
Collapse
Affiliation(s)
- Parismita Kalita
- Molecular
and Structural Biophysics Laboratory, Department of Biochemistry, North-Eastern Hill University, Shillong 793022, India
| | - Timir Tripathi
- Molecular
and Structural Biophysics Laboratory, Department of Biochemistry, North-Eastern Hill University, Shillong 793022, India
- Regional
Director’s Office, Indira Gandhi
National Open University, Regional Centre Kohima, Kenuozou, Kohima 797001, India
| | - Aditya K. Padhi
- Laboratory
for Computational Biology & Biomolecular Design, School of Biochemical
Engineering, Indian Institute of Technology
(BHU), Varanasi 221005, Uttar Pradesh, India
| |
Collapse
|
17
|
Koşaca M, Yılmazbilek İ, Karaca E. PROT-ON: A structure-based detection of designer PROTein interface MutatiONs. Front Mol Biosci 2023; 10:1063971. [PMID: 36936988 PMCID: PMC10018488 DOI: 10.3389/fmolb.2023.1063971] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Accepted: 01/31/2023] [Indexed: 03/06/2023] Open
Abstract
The mutation-induced changes across protein-protein interfaces have often been observed to lead to severe diseases. Therefore, several computational tools have been developed to predict the impact of such mutations. Among these tools, FoldX and EvoEF1 stand out as fast and accurate alternatives. Expanding on the capabilities of these tools, we have developed the PROT-ON (PROTein-protein interface mutatiONs) framework, which aims at delivering the most critical protein interface mutations that can be used to design new protein binders. To realize this aim, PROT-ON takes the 3D coordinates of a protein dimer as an input. Then, it probes all possible interface mutations on the selected protein partner with EvoEF1 or FoldX. The calculated mutational energy landscape is statistically analyzed to find the most enriching and depleting mutations. Afterward, these extreme mutations are filtered out according to stability and optionally according to evolutionary criteria. The final remaining mutation list is presented to the user as the designer mutation set. Together with this set, PROT-ON provides several residue- and energy-based plots, portraying the synthetic energy landscape of the probed mutations. The stand-alone version of PROT-ON is deposited at https://github.com/CSB-KaracaLab/prot-on. The users can also use PROT-ON through our user-friendly web service http://proton.tools.ibg.edu.tr:8001/ (runs with EvoEF1 only). Considering its speed and the range of analysis provided, we believe that PROT-ON presents a promising means to estimate designer mutations.
Collapse
Affiliation(s)
- Mehdi Koşaca
- Izmir Biomedicine and Genome Center, Dokuz Eylul Health Campus, Izmir, Türkiye
- Izmir International Biomedicine and Genome Institute, Dokuz Eylul University, Izmir, Türkiye
| | - İrem Yılmazbilek
- Izmir Biomedicine and Genome Center, Dokuz Eylul Health Campus, Izmir, Türkiye
- Middle East Technical University, Ankara, Türkiye
| | - Ezgi Karaca
- Izmir Biomedicine and Genome Center, Dokuz Eylul Health Campus, Izmir, Türkiye
- Izmir International Biomedicine and Genome Institute, Dokuz Eylul University, Izmir, Türkiye
| |
Collapse
|
18
|
Pearce R, Huang X, Omenn GS, Zhang Y. De novo protein fold design through sequence-independent fragment assembly simulations. Proc Natl Acad Sci U S A 2023; 120:e2208275120. [PMID: 36656852 PMCID: PMC9942881 DOI: 10.1073/pnas.2208275120] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Accepted: 12/22/2022] [Indexed: 01/20/2023] Open
Abstract
De novo protein design generally consists of two steps, including structure and sequence design. Many protein design studies have focused on sequence design with scaffolds adapted from native structures in the PDB, which renders novel areas of protein structure and function space unexplored. We developed FoldDesign to create novel protein folds from specific secondary structure (SS) assignments through sequence-independent replica-exchange Monte Carlo (REMC) simulations. The method was tested on 354 non-redundant topologies, where FoldDesign consistently created stable structural folds, while recapitulating on average 87.7% of the SS elements. Meanwhile, the FoldDesign scaffolds had well-formed structures with buried residues and solvent-exposed areas closely matching their native counterparts. Despite the high fidelity to the input SS restraints and local structural characteristics of native proteins, a large portion of the designed scaffolds possessed global folds completely different from natural proteins in the PDB, highlighting the ability of FoldDesign to explore novel areas of protein fold space. Detailed data analyses revealed that the major contributions to the successful structure design lay in the optimal energy force field, which contains a balanced set of SS packing terms, and REMC simulations, which were coupled with multiple auxiliary movements to efficiently search the conformational space. Additionally, the ability to recognize and assemble uncommon super-SS geometries, rather than the unique arrangement of common SS motifs, was the key to generating novel folds. These results demonstrate a strong potential to explore both structural and functional spaces through computational design simulations that natural proteins have not reached through evolution.
Collapse
Affiliation(s)
- Robin Pearce
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI48109
| | - Xiaoqiang Huang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI48109
| | - Gilbert S. Omenn
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI48109
- Department of Internal Medicine, University of Michigan, Ann Arbor, MI48109
- Department of Human Genetics, University of Michigan, Ann Arbor, MI48109
- School of Public Health, University of Michigan, Ann Arbor, MI48109
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI48109
- Department of Biological Chemistry, University of Michigan, Ann Arbor, MI48109
- Department of Computer Science, School of Computing, National University of Singapore117417, Singapore
- Cancer Science Institute of Singapore, National University of Singapore117599, Singapore
| |
Collapse
|
19
|
Clifton BE, Kozome D, Laurino P. Efficient Exploration of Sequence Space by Sequence-Guided Protein Engineering and Design. Biochemistry 2023; 62:210-220. [PMID: 35245020 DOI: 10.1021/acs.biochem.1c00757] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
The rapid growth of sequence databases over the past two decades means that protein engineers faced with optimizing a protein for any given task will often have immediate access to a vast number of related protein sequences. These sequences encode information about the evolutionary history of the protein and the underlying sequence requirements to produce folded, stable, and functional protein variants. Methods that can take advantage of this information are an increasingly important part of the protein engineering tool kit. In this Perspective, we discuss the utility of sequence data in protein engineering and design, focusing on recent advances in three main areas: the use of ancestral sequence reconstruction as an engineering tool to generate thermostable and multifunctional proteins, the use of sequence data to guide engineering of multipoint mutants by structure-based computational protein design, and the use of unlabeled sequence data for unsupervised and semisupervised machine learning, allowing the generation of diverse and functional protein sequences in unexplored regions of sequence space. Altogether, these methods enable the rapid exploration of sequence space within regions enriched with functional proteins and therefore have great potential for accelerating the engineering of stable, functional, and diverse proteins for industrial and biomedical applications.
Collapse
Affiliation(s)
- Ben E Clifton
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| | - Dan Kozome
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| | - Paola Laurino
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| |
Collapse
|
20
|
Woodard J, Iqbal S, Mashaghi A. Circuit topology predicts pathogenicity of missense mutations. Proteins 2022; 90:1634-1644. [PMID: 35394672 PMCID: PMC9543832 DOI: 10.1002/prot.26342] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 03/07/2022] [Accepted: 03/30/2022] [Indexed: 12/05/2022]
Abstract
The contact topology of a protein determines important aspects of the folding process. The topological measure of contact order has been shown to be predictive of the rate of folding. Circuit topology is emerging as another fundamental descriptor of biomolecular structure, with predicted effects on the folding rate. We analyze the residue‐based circuit topological environments of 21 K mutations labeled as pathogenic or benign. Multiple statistical lines of reasoning support the conclusion that the number of contacts in two specific circuit topological arrangements, namely inverse parallel and cross relations, with contacts involving the mutated residue have discriminatory value in determining the pathogenicity of human variants. We investigate how results vary with residue type and according to whether the gene is essential. We further explore the relationship to a number of structural features and find that circuit topology provides nonredundant information on protein structures and pathogenicity of mutations. Results may have implications for the polymer physics of protein folding and suggest that “local” topological information, including residue‐based circuit topology and residue contact order, could be useful in improving state‐of‐the‐art machine learning algorithms for pathogenicity prediction.
Collapse
Affiliation(s)
- Jaie Woodard
- Medical Systems Biophysics and Bioengineering, Leiden Academic Centre for Drug Research, Faculty of Science, Leiden University, Leiden, The Netherlands.,Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA
| | - Sumaiya Iqbal
- Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, USA
| | - Alireza Mashaghi
- Medical Systems Biophysics and Bioengineering, Leiden Academic Centre for Drug Research, Faculty of Science, Leiden University, Leiden, The Netherlands.,Centre for Interdisciplinary Genome Research, Faculty of Science, Leiden University, Leiden, The Netherlands
| |
Collapse
|
21
|
Hurwitz N, Zaidman D, Wolfson HJ. Pep–Whisperer: Inhibitory peptide design. Proteins 2022; 90:1886-1895. [DOI: 10.1002/prot.26384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Revised: 04/07/2022] [Accepted: 04/29/2022] [Indexed: 11/07/2022]
Affiliation(s)
- Naama Hurwitz
- Blavatnik School of Computer Science Tel Aviv University Tel Aviv Israel
| | - Daniel Zaidman
- Department of Organic Chemistry Weizmann Institute of Science Rehovot Israel
| | - Haim J. Wolfson
- Blavatnik School of Computer Science Tel Aviv University Tel Aviv Israel
| |
Collapse
|
22
|
Talluri S. Algorithms for protein design. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2022; 130:1-38. [PMID: 35534105 DOI: 10.1016/bs.apcsb.2022.01.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Computational Protein Design has the potential to contribute to major advances in enzyme technology, vaccine design, receptor-ligand engineering, biomaterials, nanosensors, and synthetic biology. Although Protein Design is a challenging problem, proteins can be designed by experts in Protein Design, as well as by non-experts whose primary interests are in the applications of Protein Design. The increased accessibility of Protein Design technology is attributable to the accumulated knowledge and experience with Protein Design as well as to the availability of software and online resources. The objective of this review is to serve as a guide to the relevant literature with a focus on the novel methods and algorithms that have been developed or applied for Protein Design, and to assist in the selection of algorithms for Protein Design. Novel algorithms and models that have been introduced to utilize the enormous amount of experimental data and novel computational hardware have the potential for producing substantial increases in the accuracy, reliability and range of applications of designed proteins.
Collapse
Affiliation(s)
- Sekhar Talluri
- Department of Biotechnology, GITAM, Visakhapatnam, India.
| |
Collapse
|
23
|
Shah M, Ung Moon S, Hyun Kim J, Thanh Thao T, Goo Woo H. SARS-CoV-2 pan-variant inhibitory peptides deter S1-ACE2 interaction and neutralize delta and omicron pseudoviruses. Comput Struct Biotechnol J 2022; 20:2042-2056. [PMID: 35495107 PMCID: PMC9040525 DOI: 10.1016/j.csbj.2022.04.030] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Revised: 04/20/2022] [Accepted: 04/20/2022] [Indexed: 12/12/2022] Open
Abstract
Approved neutralizing antibodies that target the prototype Spike are losing their potency against the emerging variants of concern (VOCs) of SARS-CoV-2, particularly Omicron. Although SARS-CoV-2 is continuously adapting the host environment, emerging variants recognize the same ACE2 receptor for cell entry. Protein and peptide decoys derived from ACE2 or Spike proteins may hold the pan-variant inhibitory potential. Here, we deployed interactive structure- and pharmacophore-based approaches to design short and stable peptides -Coronavirus Spike Neutralizing Peptides (CSNPs)- capable of neutralizing all SARS-CoV-2 VOCs. After in silico structural stability investigation and free energies perturbation of the isolated and target-bound peptides, nine candidate peptides were evaluated for the biophysical interaction through SPR assay. CSNP1, CSNP2, and Pep1 dose-dependently bind the S1 domain of the prototype Spike, whereas CSNP4 binds both S1 and ACE2. After safety and immunocytochemistry evaluation, peptides were probed for their pan-variant inhibitory effects. CSNP1, CSNP2, and CSNP4 inhibited all VOCs dose-dependently, whereas Pep1 had a moderate effect. CSNP2 and CSNP4 could neutralize the wild-type pseudovirus up to 80 % when treated at 0.5 µM. Furthermore, CSNP4 synergize the neutralization effect of monoclonal antibody and CSNP1 in Delta variant pseudovirus assay as they target different regions on the RBD. Thus, we suggest that CSNPs are SARS-CoV-2 pan-variant inhibitory candidates for COVID-19 therapy, which may pave the way for combating the emerging immune-escaping variants. We also propose that CSNP1/2-CSNP4 peptide cocktail or CSNP1/4 mAbs cocktail with no overlapping epitopes could be effective therapeutic strategies against COVID-19.
Collapse
Affiliation(s)
- Masaud Shah
- Department of Physiology, Ajou University School of Medicine, Suwon, Republic of Korea
| | - Sung Ung Moon
- Department of Physiology, Ajou University School of Medicine, Suwon, Republic of Korea
| | - Jang Hyun Kim
- Department of Physiology, Ajou University School of Medicine, Suwon, Republic of Korea
| | - Trinh Thanh Thao
- Department of Physiology, Ajou University School of Medicine, Suwon, Republic of Korea
| | - Hyun Goo Woo
- Department of Physiology, Ajou University School of Medicine, Suwon, Republic of Korea
- Department of Biomedical Science, Graduate School, Ajou University, Suwon, Republic of Korea
| |
Collapse
|
24
|
Gupta S, Azadvari N, Hosseinzadeh P. Design of Protein Segments and Peptides for Binding to Protein Targets. BIODESIGN RESEARCH 2022; 2022:9783197. [PMID: 37850124 PMCID: PMC10521657 DOI: 10.34133/2022/9783197] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Accepted: 03/16/2022] [Indexed: 10/19/2023] Open
Abstract
Recent years have witnessed a rise in methods for accurate prediction of structure and design of novel functional proteins. Design of functional protein fragments and peptides occupy a small, albeit unique, space within the general field of protein design. While the smaller size of these peptides allows for more exhaustive computational methods, flexibility in their structure and sparsity of data compared to proteins, as well as presence of noncanonical building blocks, add additional challenges to their design. This review summarizes the current advances in the design of protein fragments and peptides for binding to targets and discusses the challenges in the field, with an eye toward future directions.
Collapse
Affiliation(s)
- Suchetana Gupta
- Knight Campus Center for Accelerating Scientific Impact, University of Oregon, Eugene OR 97403, USA
| | - Noora Azadvari
- Knight Campus Center for Accelerating Scientific Impact, University of Oregon, Eugene OR 97403, USA
| | - Parisa Hosseinzadeh
- Knight Campus Center for Accelerating Scientific Impact, University of Oregon, Eugene OR 97403, USA
| |
Collapse
|
25
|
Ozer EA, Simons LM, Adewumi OM, Fowotade AA, Omoruyi EC, Adeniji JA, Olayinka OA, Dean TJ, Zayas J, Bhimalli PP, Ash MK, Maiga AI, Somboro AM, Maiga M, Godzik A, Schneider JR, Mamede JI, Taiwo BO, Hultquist JF, Lorenzo-Redondo R. Multiple expansions of globally uncommon SARS-CoV-2 lineages in Nigeria. Nat Commun 2022; 13:688. [PMID: 35115515 PMCID: PMC8813984 DOI: 10.1038/s41467-022-28317-5] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Accepted: 01/18/2022] [Indexed: 12/28/2022] Open
Abstract
Disparities in SARS-CoV-2 genomic surveillance have limited our understanding of the viral population dynamics and may delay identification of globally important variants. Despite being the most populated country in Africa, Nigeria has remained critically under sampled. Here, we report sequences from 378 SARS-CoV-2 isolates collected in Oyo State, Nigeria between July 2020 and August 2021. In early 2021, most isolates belonged to the Alpha "variant of concern" (VOC) or the Eta lineage. Eta outcompeted Alpha in Nigeria and across West Africa, persisting in the region even after expansion of an otherwise rare Delta sub-lineage. Spike protein from the Eta variant conferred increased infectivity and decreased neutralization by convalescent sera in vitro. Phylodynamic reconstructions suggest that Eta originated in West Africa before spreading globally and represented a VOC in early 2021. These results demonstrate a distinct distribution of SARS-CoV-2 lineages in Nigeria, and emphasize the need for improved genomic surveillance worldwide.
Collapse
Affiliation(s)
- Egon A Ozer
- Division of Infectious Diseases, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
- Center for Pathogen Genomics and Microbial Evolution, Northwestern University Havey Institute for Global Health, Chicago, IL, USA
| | - Lacy M Simons
- Division of Infectious Diseases, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
- Center for Pathogen Genomics and Microbial Evolution, Northwestern University Havey Institute for Global Health, Chicago, IL, USA
| | - Olubusuyi M Adewumi
- Department of Virology, College of Medicine, University of Ibadan, Ibadan, Nigeria
- Infectious Disease Institute, College of Medicine, University of Ibadan, Ibadan, Nigeria
| | - Adeola A Fowotade
- Infectious Disease Institute, College of Medicine, University of Ibadan, Ibadan, Nigeria
- Biorepository and Clinical Virology Laboratory, College of Medicine, University College Hospital, University of Ibadan, Ibadan, Nigeria
| | - Ewean C Omoruyi
- Biorepository and Clinical Virology Laboratory, College of Medicine, University College Hospital, University of Ibadan, Ibadan, Nigeria
| | - Johnson A Adeniji
- Department of Virology, College of Medicine, University of Ibadan, Ibadan, Nigeria
- Infectious Disease Institute, College of Medicine, University of Ibadan, Ibadan, Nigeria
| | - Oluseyi A Olayinka
- Department of Virology, College of Medicine, University of Ibadan, Ibadan, Nigeria
- Infectious Disease Institute, College of Medicine, University of Ibadan, Ibadan, Nigeria
| | - Taylor J Dean
- Division of Infectious Diseases, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
- Center for Pathogen Genomics and Microbial Evolution, Northwestern University Havey Institute for Global Health, Chicago, IL, USA
| | - Janet Zayas
- Department of Microbial Pathogens and Immunity, Rush University Medical Center, Chicago, IL, USA
| | - Pavan P Bhimalli
- Department of Microbial Pathogens and Immunity, Rush University Medical Center, Chicago, IL, USA
| | - Michelle K Ash
- Department of Microbial Pathogens and Immunity, Rush University Medical Center, Chicago, IL, USA
| | - Almoustapha I Maiga
- University Clinical Research Center (UCRC), University of Sciences, Techniques et Technologies of Bamako (USTTB), Bamako, Mali
| | - Anou M Somboro
- University Clinical Research Center (UCRC), University of Sciences, Techniques et Technologies of Bamako (USTTB), Bamako, Mali
- School of Laboratory Medicine and Medical Sciences, University of KwaZulu-Natal, Durban, South Africa
| | - Mamoudou Maiga
- University Clinical Research Center (UCRC), University of Sciences, Techniques et Technologies of Bamako (USTTB), Bamako, Mali
- Biomedical Engineering and Preventive Medicine Department, Northwestern University, Evanston, IL, USA
| | - Adam Godzik
- Biosciences Division, University of California Riverside School of Medicine, Riverside, CA, USA
| | - Jeffrey R Schneider
- Department of Microbial Pathogens and Immunity, Rush University Medical Center, Chicago, IL, USA
| | - João I Mamede
- Department of Microbial Pathogens and Immunity, Rush University Medical Center, Chicago, IL, USA
| | - Babafemi O Taiwo
- Division of Infectious Diseases, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Judd F Hultquist
- Division of Infectious Diseases, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA.
- Center for Pathogen Genomics and Microbial Evolution, Northwestern University Havey Institute for Global Health, Chicago, IL, USA.
| | - Ramon Lorenzo-Redondo
- Division of Infectious Diseases, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA.
- Center for Pathogen Genomics and Microbial Evolution, Northwestern University Havey Institute for Global Health, Chicago, IL, USA.
| |
Collapse
|
26
|
Merchant A, Tania VH, Baptiste M, Ehsan H, Kaneko G. Severe acute respiratory syndrome coronavirus-2: An era of struggle and discovery leading to the emergency use authorization of treatment and prevention measures based on computational analysis. COMPUTATIONAL APPROACHES FOR NOVEL THERAPEUTIC AND DIAGNOSTIC DESIGNING TO MITIGATE SARS-COV-2 INFECTION 2022. [PMCID: PMC9300482 DOI: 10.1016/b978-0-323-91172-6.00009-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
Severe acute respiratory syndrome-Coronavirus-2 (SARS-CoV-2), a novel betacoronavirus, has surprised the world with its disease spread and mortality rate. SARS-CoV-2 is a positive-sense, enveloped RNA virus that can infect various organs of the body, potentially leading to multiple organ dysfunction and eventual death. While various medications have received emergency use authorizations (EUAs) for the treatment of Coronavirus disease-2019 (COVID-19), as of April 30, 2021, only one drug has been Food and Drug Administration (FDA)-approved: remdesivir. Currently, three vaccines have received EUAs in the United States, but none are FDA-approved. This shortage of treatments and prevention measures is extremely problematic. Thus computational approaches would provide important data about drug resistance and variants. Such data will be useful for the development of drugs and vaccines. This chapter is a synopsis of SARS-CoV-2 clinical presentation, COVID-19 symptomology, treatment, prevention mechanisms, and SARS-CoV-2 variants using computational analysis.
Collapse
|
27
|
Deep generative modeling for protein design. Curr Opin Struct Biol 2021; 72:226-236. [PMID: 34963082 DOI: 10.1016/j.sbi.2021.11.008] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Revised: 11/01/2021] [Accepted: 11/22/2021] [Indexed: 11/21/2022]
Abstract
Deep learning approaches have produced substantial breakthroughs in fields such as image classification and natural language processing and are making rapid inroads in the area of protein design. Many generative models of proteins have been developed that encompass all known protein sequences, model specific protein families, or extrapolate the dynamics of individual proteins. Those generative models can learn protein representations that are often more informative of protein structure and function than hand-engineered features. Furthermore, they can be used to quickly propose millions of novel proteins that resemble the native counterparts in terms of expression level, stability, or other attributes. The protein design process can further be guided by discriminative oracles to select candidates with the highest probability of having the desired properties. In this review, we discuss five classes of generative models that have been most successful at modeling proteins and provide a framework for model guided protein design.
Collapse
|
28
|
Woodard J, Zheng W, Zhang Y. Protein structural features predict responsiveness to pharmacological chaperone treatment for three lysosomal storage disorders. PLoS Comput Biol 2021; 17:e1009370. [PMID: 34529671 PMCID: PMC8478239 DOI: 10.1371/journal.pcbi.1009370] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 09/28/2021] [Accepted: 08/21/2021] [Indexed: 12/15/2022] Open
Abstract
Three-dimensional structures of proteins can provide important clues into the efficacy of personalized treatment. We perform a structural analysis of variants within three inherited lysosomal storage disorders, comparing variants responsive to pharmacological chaperone treatment to those unresponsive to such treatment. We find that predicted ΔΔG of mutation is higher on average for variants unresponsive to treatment, in the case of datasets for both Fabry disease and Pompe disease, in line with previous findings. Using both a single decision tree and an advanced machine learning approach based on the larger Fabry dataset, we correctly predict responsiveness of three Gaucher disease variants, and we provide predictions for untested variants. Many variants are predicted to be responsive to treatment, suggesting that drug-based treatments may be effective for a number of variants in Gaucher disease. In our analysis, we observe dependence on a topological feature reporting on contact arrangements which is likely connected to the order of folding of protein residues, and we provide a potential justification for this observation based on steady-state cellular kinetics. Pharmacological chaperones are small molecule drugs that bind to proteins to help stabilize the folded state. One set of diseases for which this treatment has been effective is the lysosomal storage disorders, which are caused by defective lysosomal enzymes. However, not all genotypes are equally responsive to treatment. For instance, missense mutants that are particularly destabilized relative to WT are less likely to respond. The availability of datasets containing responsiveness data for large numbers of mutants, along with crystal structures of the protein involved in each disease, make machine learning methods incorporating sequence-based and structural data feasible. We hypothesize that data from two diseases, Fabry and Pompe disease, may be useful for predicting responsiveness of variants in the related Gaucher disease. Results suggest that many rare variants in Gaucher disease could be amenable to existing drugs. Results also suggest that drug responsiveness depends on protein topology in such a way that mutations in early-to-fold residues are more likely to be non-responsive to pharmacological chaperone treatment, which is consistent with a simple kinetic model of stability rescue. This study provides an example of how machine learning can be used to inform further studies towards personalized treatment in medicine.
Collapse
Affiliation(s)
- Jaie Woodard
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Wei Zheng
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Biological Chemistry, University of Michigan, Ann Arbor, Michigan, United States of America
- * E-mail:
| |
Collapse
|
29
|
Basit A, Karim AM, Asif M, Ali T, Lee JH, Jeon JH, Rehman SU, Lee SH. Designing Short Peptides to Block the Interaction of SARS-CoV-2 and Human ACE2 for COVID-19 Therapeutics. Front Pharmacol 2021; 12:731828. [PMID: 34512357 PMCID: PMC8430035 DOI: 10.3389/fphar.2021.731828] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Accepted: 08/17/2021] [Indexed: 12/18/2022] Open
Abstract
To date, the current COVID-19 pandemic caused by SARS-CoV-2 has infected 99.2 million while killed 2.2 million people throughout the world and is still spreading widely. The unavailability of potential therapeutics against this virus urges to search and develop new drugs. SARS-CoV-2 enters human cells by interacting with human angiotensin-converting enzyme 2 (ACE2) receptor expressed on human cell surface through utilizing receptor-binding domain (RBD) of its spike glycoprotein. The RBD is highly conserved and is also a potential target for blocking its interaction with human cell surface receptor. We designed short peptides on the basis of our previously reported truncated ACE2 (tACE2) for increasing the binding affinity as well as the binding interaction network with RBD. These peptides can selectively bind to RBD with much higher affinities than the cell surface receptor. Thus, these can block all the binding residues required for binding to cell surface receptor. We used selected amino acid regions (21–40 and 65–75) of ACE2 as scaffold for the de novo peptide design. Our designed peptide Pep1 showed interactions with RBD covering almost all of its binding residues with significantly higher binding affinity (−13.2 kcal mol−1) than the cell surface receptor. The molecular dynamics (MD) simulation results showed that designed peptides form a stabilized complex with RBD. We suggest that blocking the RBD through de novo designed peptides can serve as a potential candidate for COVID-19 treatment after further clinical investigations.
Collapse
Affiliation(s)
- Abdul Basit
- Institute of Microbiology and Molecular Genetics, University of the Punjab, Lahore, Pakistan
| | - Asad Mustafa Karim
- Department of Bioscience and Biotechnology, The University of Suwon, Hwaseong, South Korea
| | - Muhammad Asif
- Institute of Microbiology and Molecular Genetics, University of the Punjab, Lahore, Pakistan
| | - Tanveer Ali
- Department of Host Defense, Graduate School of Medicine, University of the Ryukyus, Nishihara, Japan
| | - Jung Hun Lee
- National Leading Research Laboratory, Department of Biological Sciences, Myongji University, Yongin, South Korea
| | - Jeong Ho Jeon
- National Leading Research Laboratory, Department of Biological Sciences, Myongji University, Yongin, South Korea
| | - Shafiq Ur Rehman
- Institute of Microbiology and Molecular Genetics, University of the Punjab, Lahore, Pakistan
| | - Sang Hee Lee
- National Leading Research Laboratory, Department of Biological Sciences, Myongji University, Yongin, South Korea
| |
Collapse
|
30
|
Karakulak T, Rifaioglu AS, Rodrigues JPGLM, Karaca E. Predicting the Specificity- Determining Positions of Receptor Tyrosine Kinase Axl. Front Mol Biosci 2021; 8:658906. [PMID: 34195226 PMCID: PMC8236827 DOI: 10.3389/fmolb.2021.658906] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Accepted: 04/20/2021] [Indexed: 11/22/2022] Open
Abstract
Owing to its clinical significance, modulation of functionally relevant amino acids in protein-protein complexes has attracted a great deal of attention. To this end, many approaches have been proposed to predict the partner-selecting amino acid positions in evolutionarily close complexes. These approaches can be grouped into sequence-based machine learning and structure-based energy-driven methods. In this work, we assessed these methods’ ability to map the specificity-determining positions of Axl, a receptor tyrosine kinase involved in cancer progression and immune system diseases. For sequence-based predictions, we used SDPpred, Multi-RELIEF, and Sequence Harmony. For structure-based predictions, we utilized HADDOCK refinement and molecular dynamics simulations. As a result, we observed that (i) sequence-based methods overpredict partner-selecting residues of Axl and that (ii) combining Multi-RELIEF with HADDOCK-based predictions provides the key Axl residues, covered by the extensive molecular dynamics simulations. Expanding on these results, we propose that a sequence-structure-based approach is necessary to determine specificity-determining positions of Axl, which can guide the development of therapeutic molecules to combat Axl misregulation.
Collapse
Affiliation(s)
- Tülay Karakulak
- Izmir Biomedicine and Genome Center, Izmir, Turkey.,Izmir International Biomedicine and Genome Institute, Dokuz Eylul University, Izmir, Turkey.,Institute of Molecular Life Sciences, University of Zurich, Zurich, Switzerland.,Department of Pathology and Molecular Pathology, University Hospital Zurich, Zurich, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Ahmet Sureyya Rifaioglu
- Department of Electrical - Electronics Engineering, İskenderun Technical University, Hatay, Turkey
| | - João P G L M Rodrigues
- Department of Structural Biology, Stanford University School of Medicine, Stanford, CA, United States
| | - Ezgi Karaca
- Izmir Biomedicine and Genome Center, Izmir, Turkey.,Izmir International Biomedicine and Genome Institute, Dokuz Eylul University, Izmir, Turkey
| |
Collapse
|
31
|
Pearce R, Zhang Y. Deep learning techniques have significantly impacted protein structure prediction and protein design. Curr Opin Struct Biol 2021; 68:194-207. [PMID: 33639355 PMCID: PMC8222070 DOI: 10.1016/j.sbi.2021.01.007] [Citation(s) in RCA: 64] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Revised: 01/09/2021] [Accepted: 01/18/2021] [Indexed: 12/26/2022]
Abstract
Protein structure prediction and design can be regarded as two inverse processes governed by the same folding principle. Although progress remained stagnant over the past two decades, the recent application of deep neural networks to spatial constraint prediction and end-to-end model training has significantly improved the accuracy of protein structure prediction, largely solving the problem at the fold level for single-domain proteins. The field of protein design has also witnessed dramatic improvement, where noticeable examples have shown that information stored in neural-network models can be used to advance functional protein design. Thus, incorporation of deep learning techniques into different steps of protein folding and design approaches represents an exciting future direction and should continue to have a transformative impact on both fields.
Collapse
Affiliation(s)
- Robin Pearce
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA; Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA.
| |
Collapse
|
32
|
Amengual-Rigo P, Fernández-Recio J, Guallar V. UEP: an open-source and fast classifier for predicting the impact of mutations in protein-protein complexes. Bioinformatics 2021; 37:334-341. [PMID: 32761082 DOI: 10.1093/bioinformatics/btaa708] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2019] [Revised: 07/23/2020] [Accepted: 07/31/2020] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Single protein residue mutations may reshape the binding affinity of protein-protein interactions. Therefore, predicting its effects is of great interest in biotechnology and biomedicine. Unfortunately, the availability of experimental data on binding affinity changes upon mutation is limited, which hampers the development of new and more precise algorithms. Here, we propose UEP, a classifier for predicting beneficial and detrimental mutations in protein-protein complexes trained on interactome data. RESULTS Regardless of the simplicity of the UEP algorithm, which is based on a simple three-body contact potential derived from interactome data, we report competitive results with the gold standard methods in this field with the advantage of being faster in terms of computational time. Moreover, we propose a consensus selection procedure by involving the combination of three predictors that showed higher classification accuracy in our benchmark: UEP, pyDock and EvoEF1/FoldX. Overall, we demonstrate that the analysis of interactome data allows predicting the impact of protein-protein mutations using UEP, a fast and reliable open-source code. AVAILABILITY AND IMPLEMENTATION UEP algorithm can be found at: https://github.com/pepamengual/UEP. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Pep Amengual-Rigo
- Department of Life Sciences, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
| | - Juan Fernández-Recio
- Department of Life Sciences, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain.,Instituto de Ciencias de la Vid y del Vino (ICVV), CSIC-Universidad de la Rioja-Gobierno de la Rioja, 26007 Logroño, Spain
| | - Victor Guallar
- Department of Life Sciences, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain.,ICREA: Institució Catalana de Recerca i Estudis Avançats, 08010 Barcelona, Spain
| |
Collapse
|
33
|
Frappier V, Keating AE. Data-driven computational protein design. Curr Opin Struct Biol 2021; 69:63-69. [PMID: 33910104 DOI: 10.1016/j.sbi.2021.03.009] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2020] [Revised: 03/18/2021] [Accepted: 03/19/2021] [Indexed: 01/28/2023]
Abstract
Computational protein design can generate proteins not found in nature that adopt desired structures and perform novel functions. Although proteins could, in theory, be designed with ab initio methods, practical success has come from using large amounts of data that describe the sequences, structures, and functions of existing proteins and their variants. We present recent creative uses of multiple-sequence alignments, protein structures, and high-throughput functional assays in computational protein design. Approaches range from enhancing structure-based design with experimental data to building regression models to training deep neural nets that generate novel sequences. Looking ahead, deep learning will be increasingly important for maximizing the value of data for protein design.
Collapse
Affiliation(s)
- Vincent Frappier
- Generate Biomedicines, 26 Landsdowne Street, Cambridge, MA, 02139, USA
| | - Amy E Keating
- MIT Departments of Biology and Biological Engineering, 77 Massachusetts Ave., Cambridge, MA, 02139, USA.
| |
Collapse
|
34
|
Chang J, Zhang C, Cheng H, Tan YW. Rational Design of Adenylate Kinase Thermostability through Coevolution and Sequence Divergence Analysis. Int J Mol Sci 2021; 22:2768. [PMID: 33803409 PMCID: PMC7967156 DOI: 10.3390/ijms22052768] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Revised: 03/04/2021] [Accepted: 03/05/2021] [Indexed: 01/09/2023] Open
Abstract
Protein engineering is actively pursued in industrial and laboratory settings for high thermostability. Among the many protein engineering methods, rational design by bioinformatics provides theoretical guidance without time-consuming experimental screenings. However, most rational design methods either rely on protein tertiary structure information or have limited accuracies. We proposed a primary-sequence-based algorithm for increasing the heat resistance of a protein while maintaining its functions. Using adenylate kinase (ADK) family as a model system, this method identified a series of amino acid sites closely related to thermostability. Single- and double-point mutants constructed based on this method increase the thermal denaturation temperature of the mesophilic Escherichia coli (E. coli) ADK by 5.5 and 8.3 °C, respectively, while preserving most of the catalytic function at ambient temperatures. Additionally, the constructed mutants have improved enzymatic activity at higher temperature.
Collapse
Affiliation(s)
- Jian Chang
- State Key Laboratory of Surface Physics, Multiscale Research Institute of Complex Systems, Department of Physics, Fudan University, Shanghai 200433, China; (J.C.); (H.C.)
| | - Chengxin Zhang
- School of Life Science, Fudan University, Shanghai 200433, China;
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Huaqiang Cheng
- State Key Laboratory of Surface Physics, Multiscale Research Institute of Complex Systems, Department of Physics, Fudan University, Shanghai 200433, China; (J.C.); (H.C.)
| | - Yan-Wen Tan
- State Key Laboratory of Surface Physics, Multiscale Research Institute of Complex Systems, Department of Physics, Fudan University, Shanghai 200433, China; (J.C.); (H.C.)
| |
Collapse
|
35
|
Kruglikov A, Rakesh M, Wei Y, Xia X. Applications of Protein Secondary Structure Algorithms in SARS-CoV-2 Research. J Proteome Res 2021; 20:1457-1463. [PMID: 33617253 PMCID: PMC7927282 DOI: 10.1021/acs.jproteome.0c00734] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2020] [Indexed: 01/25/2023]
Abstract
Since the outset of COVID-19, the pandemic has prompted immediate global efforts to sequence SARS-CoV-2, and over 450 000 complete genomes have been publicly deposited over the course of 12 months. Despite this, comparative nucleotide and amino acid sequence analyses often fall short in answering key questions in vaccine design. For example, the binding affinity between different ACE2 receptors and SARS-COV-2 spike protein cannot be fully explained by amino acid similarity at ACE2 contact sites because protein structure similarities are not fully reflected by amino acid sequence similarities. To comprehensively compare protein homology, secondary structure (SS) analysis is required. While protein structure is slow and difficult to obtain, SS predictions can be made rapidly, and a well-predicted SS structure may serve as a viable proxy to gain biological insight. Here we review algorithms and information used in predicting protein SS to highlight its potential application in pandemics research. We also showed examples of how SS predictions can be used to compare ACE2 proteins and to evaluate the zoonotic origins of viruses. As computational tools are much faster than wet-lab experiments, these applications can be important for research especially in times when quickly obtained biological insights can help in speeding up response to pandemics.
Collapse
Affiliation(s)
- Alibek Kruglikov
- Department
of Biology, University of Ottawa, Ottawa, Ontario K1N 6N5, Canada
| | - Mohan Rakesh
- Department
of Biology, University of Ottawa, Ottawa, Ontario K1N 6N5, Canada
| | - Yulong Wei
- Department
of Biology, University of Ottawa, Ottawa, Ontario K1N 6N5, Canada
| | - Xuhua Xia
- Department
of Biology, University of Ottawa, Ottawa, Ontario K1N 6N5, Canada
- Ottawa
Institute of Systems Biology, University
of Ottawa, Ottawa, Ontario K1N 6N5, Canada
| |
Collapse
|
36
|
Laniado J, Meador K, Yeates TO. A fragment-based protein interface design algorithm for symmetric assemblies. Protein Eng Des Sel 2021; 34:gzab008. [PMID: 33955480 PMCID: PMC8101011 DOI: 10.1093/protein/gzab008] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 03/08/2021] [Indexed: 11/13/2022] Open
Abstract
Theoretical and experimental advances in protein engineering have led to the creation of precisely defined, novel protein assemblies of great size and complexity, with diverse applications. One powerful approach involves designing a new attachment or binding interface between two simpler symmetric oligomeric protein components. The required methods of design, which present both similarities and key differences compared to problems in protein docking, remain challenging and are not yet routine. With the aim of more fully enabling this emerging area of protein material engineering, we developed a computer program, nanohedra, to introduce two key advances. First, we encoded in the program the construction rules (i.e. the search space parameters) that underlie all possible symmetric material constructions. Second, we developed algorithms for rapidly identifying favorable docking/interface arrangements based on tabulations of empirical patterns of known protein fragment-pair associations. As a result, the candidate poses that nanohedra generates for subsequent amino acid interface design appear highly native-like (at the protein backbone level), while simultaneously conforming to the exacting requirements for symmetry-based assembly. A retrospective computational analysis of successful vs failed experimental studies supports the expectation that this should improve the success rate for this challenging area of protein engineering.
Collapse
Affiliation(s)
- Joshua Laniado
- UCLA Molecular Biology Institute, Los Angeles, CA 90095, USA
| | - Kyle Meador
- UCLA Department of Chemistry and Biochemistry, Los Angeles, CA 90095, USA
| | - Todd O Yeates
- UCLA Molecular Biology Institute, Los Angeles, CA 90095, USA
- UCLA Department of Chemistry and Biochemistry, Los Angeles, CA 90095, USA
- UCLA DOE Institute for Genomics and Proteomics, Los Angeles, CA 90095, USA
| |
Collapse
|
37
|
ADDRESS: A Database of Disease-associated Human Variants Incorporating Protein Structure and Folding Stabilities. J Mol Biol 2021; 433:166840. [PMID: 33539887 DOI: 10.1016/j.jmb.2021.166840] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 01/17/2021] [Accepted: 01/20/2021] [Indexed: 11/22/2022]
Abstract
Numerous human diseases are caused by mutations in genomic sequences. Since amino acid changes affect protein function through mechanisms often predictable from protein structure, the integration of structural and sequence data enables us to estimate with greater accuracy whether and how a given mutation will lead to disease. Publicly available annotated databases enable hypothesis assessment and benchmarking of prediction tools. However, the results are often presented as summary statistics or black box predictors, without providing full descriptive information. We developed a new semi-manually curated human variant database presenting information on the protein contact-map, sequence-to-structure mapping, amino acid identity change, and stability prediction for the popular UniProt database. We found that the profiles of pathogenic and benign missense polymorphisms can be effectively deduced using decision trees and comparative analyses based on the presented dataset. The database is made publicly available through https://zhanglab.ccmb.med.umich.edu/ADDRESS.
Collapse
|
38
|
Badhe Y, Gupta R, Rai B. In silico design of peptides with binding to the receptor binding domain (RBD) of the SARS-CoV-2 and their utility in bio-sensor development for SARS-CoV-2 detection. RSC Adv 2021; 11:3816-3826. [PMID: 35424358 PMCID: PMC8694220 DOI: 10.1039/d0ra09123e] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Accepted: 01/13/2021] [Indexed: 12/23/2022] Open
Abstract
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has infected millions of people across the globe and created not only a health emergency but also a financial crisis. This virus attacks the angiotensin-converting enzyme 2 (ACE2) receptor situated on the surface of the host cell membrane. The spike protein of the virus binds to this receptor which is a critical step in infection. A molecule which can specifically stop this binding could be a potential therapeutic agent. In this study, we have tested 12 potential peptides which can bind to the receptor binding domain (RBD) of the spike protein of the virus and thus can potentially inhibit the binding of the latter on ACE2 receptors. These peptides are screened based on their binding with the RBD of the spike protein and aqueous stability, obtained using several atomistic molecular dynamic simulations. The potential of mean force calculation of peptides confirmed their binding to the RBD of the spike protein. Furthermore, two potential peptides were tested for use in a biosensing application for SARS-CoV-2 detection. Two types of biosensing platforms, a graphene sheet and a carbon nano tube (CNT) were tested. The peptides were modified in order to functionalize the graphene and CNT. Based on the interaction between the substrate, peptide and spike protein, the utility of the screened peptide for a given bio sensing platform is discussed and recommended. The protocol for peptide design and testing for its usage as a sensor.![]()
Collapse
Affiliation(s)
- Yogesh Badhe
- Physical Science Research Area
- Tata Research Development and Design Centre
- TCS Research
- Tata Consultancy Services
- Pune-411013
| | - Rakesh Gupta
- Physical Science Research Area
- Tata Research Development and Design Centre
- TCS Research
- Tata Consultancy Services
- Pune-411013
| | - Beena Rai
- Physical Science Research Area
- Tata Research Development and Design Centre
- TCS Research
- Tata Consultancy Services
- Pune-411013
| |
Collapse
|
39
|
Ong E, Huang X, Pearce R, Zhang Y, He Y. Computational design of SARS-CoV-2 spike glycoproteins to increase immunogenicity by T cell epitope engineering. Comput Struct Biotechnol J 2020; 19:518-529. [PMID: 33398234 PMCID: PMC7773544 DOI: 10.1016/j.csbj.2020.12.039] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Revised: 12/24/2020] [Accepted: 12/24/2020] [Indexed: 01/12/2023] Open
Abstract
The development of effective and safe vaccines is the ultimate way to efficiently stop the ongoing COVID-19 pandemic, which is caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Built on the fact that SARS-CoV-2 utilizes the association of its Spike (S) protein with the human angiotensin-converting enzyme 2 (ACE2) receptor to invade host cells, we computationally redesigned the S protein sequence to improve its immunogenicity and antigenicity. Toward this purpose, we extended an evolutionary protein design algorithm, EvoDesign, to create thousands of stable S protein variants that perturb the core protein sequence but keep the surface conformation and B cell epitopes. The T cell epitope content and similarity scores of the perturbed sequences were calculated and evaluated. Out of 22,914 designs with favorable stability energy, 301 candidates contained at least two pre-existing immunity-related epitopes and had promising immunogenic potential. The benchmark tests showed that, although the epitope restraints were not included in the scoring function of EvoDesign, the top S protein design successfully recovered 31 out of the 32 major histocompatibility complex (MHC)-II T cell promiscuous epitopes in the native S protein, where two epitopes were present in all seven human coronaviruses. Moreover, the newly designed S protein introduced nine new MHC-II T cell promiscuous epitopes that do not exist in the wildtype SARS-CoV-2. These results demonstrated a new and effective avenue to enhance a target protein's immunogenicity using rational protein design, which could be applied for new vaccine design against COVID-19 and other pathogens.
Collapse
Affiliation(s)
- Edison Ong
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Xiaoqiang Huang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Robin Pearce
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA
| | - Yongqun He
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
40
|
Xu G, Wang Q, Ma J. OPUS-Rota3: Improving Protein Side-Chain Modeling by Deep Neural Networks and Ensemble Methods. J Chem Inf Model 2020; 60:6691-6697. [PMID: 33211480 DOI: 10.1021/acs.jcim.0c00951] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Side-chain modeling is critical for protein structure prediction since the uniqueness of the protein structure is largely determined by its side-chain packing conformation. In this paper, differing from most approaches that rely on rotamer library sampling, we first propose a novel side-chain rotamer prediction method based on deep neural networks, named OPUS-RotaNN. Then, on the basis of our previous work OPUS-Rota2, we propose an open-source side-chain modeling framework, OPUS-Rota3, which integrates the results of different methods into its rotamer library as the sampling candidates. By including OPUS-RotaNN into OPUS-Rota3, we conduct our experiments on three native backbone test sets and one non-native backbone test set. On the native backbone test set, CAMEO-Hard61 for example, OPUS-Rota3 successfully predicts 51.14% of all side-chain dihedral angles with a tolerance criterion of 20° and outperforms OSCAR-star (50.87%), SCWRL4 (50.40%), and FASPR (49.85%). On the non-native backbone test set DB379-ITASSER, the accuracy of OPUS-Rota3 is 52.49%, better than OSCAR-star (48.95%), FASPR (48.69%), and SCWRL4 (48.29%). All the source codes including the training codes and the data we used are available at https://github.com/thuxugang/opus_rota3.
Collapse
Affiliation(s)
- Gang Xu
- Multiscale Research Institute of Complex Systems, Fudan University, Shanghai 200433, China
| | - Qinghua Wang
- Verna and Marrs Mclean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, One Baylor Plaza, BCM-125, Houston, Texas 77030, United States
| | - Jianpeng Ma
- Multiscale Research Institute of Complex Systems, Fudan University, Shanghai 200433, China.,Verna and Marrs Mclean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, One Baylor Plaza, BCM-125, Houston, Texas 77030, United States.,Department of Bioengineering, Rice University, Houston, Texas 77005, United States
| |
Collapse
|
41
|
Huang X, Pearce R, Zhang Y. FASPR: an open-source tool for fast and accurate protein side-chain packing. Bioinformatics 2020; 36:3758-3765. [PMID: 32259206 DOI: 10.1093/bioinformatics/btaa234] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2020] [Revised: 03/30/2020] [Accepted: 04/01/2020] [Indexed: 01/04/2023] Open
Abstract
MOTIVATION Protein structure and function are essentially determined by how the side-chain atoms interact with each other. Thus, accurate protein side-chain packing (PSCP) is a critical step toward protein structure prediction and protein design. Despite the importance of the problem, however, the accuracy and speed of current PSCP programs are still not satisfactory. RESULTS We present FASPR for fast and accurate PSCP by using an optimized scoring function in combination with a deterministic searching algorithm. The performance of FASPR was compared with four state-of-the-art PSCP methods (CISRR, RASP, SCATD and SCWRL4) on both native and non-native protein backbones. For the assessment on native backbones, FASPR achieved a good performance by correctly predicting 69.1% of all the side-chain dihedral angles using a stringent tolerance criterion of 20°, compared favorably with SCWRL4, CISRR, RASP and SCATD which successfully predicted 68.8%, 68.6%, 67.8% and 61.7%, respectively. Additionally, FASPR achieved the highest speed for packing the 379 test protein structures in only 34.3 s, which was significantly faster than the control methods. For the assessment on non-native backbones, FASPR showed an equivalent or better performance on I-TASSER predicted backbones and the backbones perturbed from experimental structures. Detailed analyses showed that the major advantage of FASPR lies in the optimal combination of the dead-end elimination and tree decomposition with a well optimized scoring function, which makes FASPR of practical use for both protein structure modeling and protein design studies. AVAILABILITY AND IMPLEMENTATION The web server, source code and datasets are freely available at https://zhanglab.ccmb.med.umich.edu/FASPR and https://github.com/tommyhuangthu/FASPR. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Robin Pearce
- Department of Computational Medicine and Bioinformatics
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics.,Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
42
|
Huang X, Zheng W, Pearce R, Zhang Y. SSIPe: accurately estimating protein-protein binding affinity change upon mutations using evolutionary profiles in combination with an optimized physical energy function. Bioinformatics 2020; 36:2429-2437. [PMID: 31830252 DOI: 10.1093/bioinformatics/btz926] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2019] [Revised: 11/08/2019] [Accepted: 12/09/2019] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Most proteins perform their biological functions through interactions with other proteins in cells. Amino acid mutations, especially those occurring at protein interfaces, can change the stability of protein-protein interactions (PPIs) and impact their functions, which may cause various human diseases. Quantitative estimation of the binding affinity changes (ΔΔGbind) caused by mutations can provide critical information for protein function annotation and genetic disease diagnoses. RESULTS We present SSIPe, which combines protein interface profiles, collected from structural and sequence homology searches, with a physics-based energy function for accurate ΔΔGbind estimation. To offset the statistical limits of the PPI structure and sequence databases, amino acid-specific pseudocounts were introduced to enhance the profile accuracy. SSIPe was evaluated on large-scale experimental data containing 2204 mutations from 177 proteins, where training and test datasets were stringently separated with the sequence identity between proteins from the two datasets below 30%. The Pearson correlation coefficient between estimated and experimental ΔΔGbind was 0.61 with a root-mean-square-error of 1.93 kcal/mol, which was significantly better than the other methods. Detailed data analyses revealed that the major advantage of SSIPe over other traditional approaches lies in the novel combination of the physical energy function with the new knowledge-based interface profile. SSIPe also considerably outperformed a former profile-based method (BindProfX) due to the newly introduced sequence profiles and optimized pseudocount technique that allows for consideration of amino acid-specific prior mutation probabilities. AVAILABILITY AND IMPLEMENTATION Web-server/standalone program, source code and datasets are freely available at https://zhanglab.ccmb.med.umich.edu/SSIPe and https://github.com/tommyhuangthu/SSIPe. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Wei Zheng
- Department of Computational Medicine and Bioinformatics
| | - Robin Pearce
- Department of Computational Medicine and Bioinformatics
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics.,Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
43
|
Characterization of the adipogenic protein E4orf1 from adenovirus 36 through an in silico approach. J Mol Model 2020; 26:285. [PMID: 32978703 DOI: 10.1007/s00894-020-04531-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2019] [Accepted: 09/03/2020] [Indexed: 10/23/2022]
Abstract
Adenovirus 36 (Ad-36) is related to human obesity due to its adipogenic activity mediated by the early 4 open reading frame 1 (E4orf1) protein. Mechanisms underlying the adipogenic effect of E4orf1 are not completely understood; however, the proliferation and differentiation of fat cells are increased through the activation of the phosphatidyl inositol 3 kinase pathway by binding proteins containing PDZ domain. This study characterized E4orf1 tridimensional structure and analyzed its interactions with PDZ domain-containing proteins in order to provide new information about the behavior of this viral protein and its targets, which could provide an interesting druggable target for obesity-related cardiometabolic alterations. In silico strategies such as homology modeling, docking, and molecular dynamics (MD) were used to study the interaction of E4orf1 with five PDZ domains of disk large homolog 1 (PDZ-1 and PDZ-2), membrane-associated guanylate kinase 1 (PDZ-3), and multi-PDZ domain protein 1 (PDZ-7 and PDZ-10). Mutagenesis analysis of selected residues was performed to evaluate their effects on the stabilization of E4orf1:PDZ complexes. MD simulations showed that the E4orf1:PDZ10 complex was more stable than the others ones. The highly hydrophobic residues at the C-terminal region (114-125) of the E4orf1 are essential in the initial phase stabilization of the complexes. Moreover, the residues 80-85 in the core region contribute to longer stabilization of the E4orf1:PDZ10 complex, a result that was confirmed by in silico mutagenesis. In conclusion, E4orf1 forms a stable complex with PDZ10 domain, and the residues 80-85 are of particular importance. The characterization of E4orf1 interactions with PDZ domains provides an initial approach to discover druggable targets for Ad-36-induced obesity.
Collapse
|
44
|
Huang X, Pearce R, Zhang Y. EvoEF2: accurate and fast energy function for computational protein design. Bioinformatics 2020; 36:1135-1142. [PMID: 31588495 DOI: 10.1093/bioinformatics/btz740] [Citation(s) in RCA: 62] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2019] [Revised: 09/19/2019] [Accepted: 09/25/2019] [Indexed: 01/26/2023] Open
Abstract
MOTIVATION The accuracy and success rate of de novo protein design remain limited, mainly due to the parameter over-fitting of current energy functions and their inability to discriminate incorrect designs from correct designs. RESULTS We developed an extended energy function, EvoEF2, for efficient de novo protein sequence design, based on a previously proposed physical energy function, EvoEF. Remarkably, EvoEF2 recovered 32.5%, 47.9% and 22.3% of all, core and surface residues for 148 test monomers, and was generally applicable to protein-protein interaction design, as it recapitulated 30.9%, 42.4%, 31.3% and 21.4% of all, core, interface and surface residues for 88 test dimers, significantly outperforming EvoEF on the native sequence recapitulation. We further used I-TASSER to evaluate the foldability of the 148 designed monomer sequences, where all of them were predicted to fold into structures with high fold- and atomic-level similarity to their corresponding native structures, as demonstrated by the fact that 87.8% of the predicted structures shared a root-mean-square-deviation less than 2 Å to their native counterparts. The study also demonstrated that the usefulness of physical energy functions is highly correlated with the parameter optimization processes, and EvoEF2, with parameters optimized using sequence recapitulation, is more suitable for computational protein sequence design than EvoEF, which was optimized on thermodynamic mutation data. AVAILABILITY AND IMPLEMENTATION The source code of EvoEF2 and the benchmark datasets are freely available at https://zhanglab.ccmb.med.umich.edu/EvoEF. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Xiaoqiang Huang
- Department of Computational Medicine and Bioinformatics, MI 48109, USA
| | - Robin Pearce
- Department of Computational Medicine and Bioinformatics, MI 48109, USA
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, MI 48109, USA.,Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
45
|
Huang Q, Wang K, Li H, Yi S, Zhao X. Enhancing cellulosic ethanol production through coevolution of multiple enzymatic characteristics of β-glucosidase from Penicillium oxalicum 16. Appl Microbiol Biotechnol 2020; 104:8299-8308. [PMID: 32857198 DOI: 10.1007/s00253-020-10858-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Revised: 08/04/2020] [Accepted: 08/23/2020] [Indexed: 01/04/2023]
Abstract
In previous studies, we isolated a novel β-glucosidase from Penicillium oxalicum 16 (16BGL), which is useful for producing cellulosic ethanol. However, 16BGL has a relatively low enzyme activity and product tolerance; besides, a huge gap exists between the optimum temperature of 16BGL (70 °C) and the fermentation temperature for producing cellulosic ethanol (40 °C). Here, we present a directed evolution-based study, which combines one-round error-prone PCR with three rounds of high-throughput screening, for coevolving multiple enzymatic characteristics of 16BGL. We identified an improved variant Y-1-B1 with a triple mutant (G414S/D421V/T441S). Y-1-B1 had an optimum temperature of 50 °C, much closer to the fermentation temperature. The catalytic efficiency of Y-1-B1 for hydrolyzing pNPG was 1355 mM-1 s-1 at 50 °C and pH 5, significantly higher than that of 16BGL (807 mM-1 s-1). Y-1-B1 also achieved a slightly reduced strength of product inhibition of 1.1 at a glucose concentration of 20 mM, compared with the ratio of 1.3 for 16BGL. A maximum titer of 6.9 g/L for ethanol production was achieved in the reaction with Y-1-B1, which was 22% higher than that achieved with 16BGL. Structure modeling revealed that the mutations are distant from the active-site pocket. Therefore, we performed molecular dynamics (MD) simulations to understand why these mutations can improve catalytic efficiency. MD simulation revealed that the nucleophilic residue Asp261 had a much closer contact with the glucosidic center of pNPG in the simulation with Y-1-B1 than that with 16BGL, suggesting that the mutant is more favorable for catalysis. KEY POINTS: • Multiple enzymatic properties of Penicillium oxalicum 16 BGL were coevolved. • A catalytically efficient triple mutant G414S/D421V/T441S was reported. • Molecular dynamics simulation supported the enhanced catalytic activity.
Collapse
Affiliation(s)
- Qiuxia Huang
- College of Life Science, Jiangxi Normal University, Nanchang, 330022, China
| | - Kexin Wang
- College of Life Science, Jiangxi Normal University, Nanchang, 330022, China
| | - Hanxin Li
- College of Life Science, Jiangxi Normal University, Nanchang, 330022, China
| | - Shi Yi
- College of Life Science, Jiangxi Normal University, Nanchang, 330022, China
| | - Xihua Zhao
- College of Life Science, Jiangxi Normal University, Nanchang, 330022, China.
| |
Collapse
|
46
|
Ong E, Huang X, Pearce R, Zhang Y, He Y. Rational Design of SARS-CoV-2 Spike Glycoproteins To Increase Immunogenicity By T Cell Epitope Engineering. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2020:2020.08.14.251496. [PMID: 32817949 PMCID: PMC7430581 DOI: 10.1101/2020.08.14.251496] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The current COVID-19 pandemic caused by SARS-CoV-2 has resulted in millions of confirmed cases and thousands of deaths globally. Extensive efforts and progress have been made to develop effective and safe vaccines against COVID-19. A primary target of these vaccines is the SARS-CoV-2 spike (S) protein, and many studies utilized structural vaccinology techniques to either stabilize the protein or fix the receptor-binding domain at certain states. In this study, we extended an evolutionary protein design algorithm, EvoDesign, to create thousands of stable S protein variants without perturbing the surface conformation and B cell epitopes of the S protein. We then evaluated the mutated S protein candidates based on predicted MHC-II T cell promiscuous epitopes as well as the epitopes' similarity to human peptides. The presented strategy aims to improve the S protein's immunogenicity and antigenicity by inducing stronger CD4 T cell response while maintaining the protein's native structure and function. The top EvoDesign S protein candidate (Design-10705) recovered 31 out of 32 MHC-II T cell promiscuous epitopes in the native S protein, in which two epitopes were present in all seven human coronaviruses. This newly designed S protein also introduced nine new MHC-II T cell promiscuous epitopes and showed high structural similarity to its native conformation. The proposed structural vaccinology method provides an avenue to rationally design the antigen's structure with increased immunogenicity, which could be applied to the rational design of new COVID-19 vaccine candidates.
Collapse
Affiliation(s)
- Edison Ong
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Xiaoqiang Huang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Robin Pearce
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA
| | - Yongqun He
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
47
|
Tu Z, Huang X, Fu J, Hu N, Zheng W, Li Y, Zhang Y. Landscape of variable domain of heavy-chain-only antibody repertoire from alpaca. Immunology 2020; 161:53-65. [PMID: 32506493 DOI: 10.1111/imm.13224] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 05/18/2020] [Accepted: 05/19/2020] [Indexed: 01/05/2023] Open
Abstract
Heavy-chain-only antibodies (HCAbs), which are devoid of light chains, have been found naturally occurring in various species including camelids and cartilaginous fish. Because of their high thermostability, refoldability and capacity for cell permeation, the variable regions of the heavy chain of HCAbs (VHHs) have been widely used in diagnosis, bio-imaging, food safety and therapeutics. Most immunogenetic and functional studies of HCAbs are based on case studies or a limited number of low-throughput sequencing data. A complete picture derived from more abundant high-throughput sequencing (HTS) data can help us gain deeper insights. We cloned and sequenced the full-length coding region of VHHs in Alpaca (Vicugna pacos) via HTS in this study. A new pipeline was developed to conduct an in-depth analysis of the HCAb repertoires. Various critical features, including the length distribution of complementarity-determining region 3 (CDR3), V(D)J usage, VJ pairing, germline-specific mutation rate and germline-specific scoring profiles (GSSPs), were systematically characterized. The quantitative data show that V(D)J usage and VHH recombination are highly biased. Interestingly, we found that the average CDR3 length of classical VHHs is longer than that of non-classical ones, whereas the mutation rates are similar in both kinds of VHHs. Finally, GSSPs were built to quantitatively describe and compare sequences that originate from each VJ pair. Overall, this study presents a comprehensive landscape of the HCAb repertoire, which can provide useful guidance for the modeling of somatic hypermutation and the design of novel functional VHHs or VHH repertoires via evolutionary profiles.
Collapse
Affiliation(s)
- Zhui Tu
- State Key Laboratory of Food Science and Technology, Nanchang University, Nanchang, China.,Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.,Department of Biological Chemistry, University of Michigan, Ann Arbor, MI, USA.,Jiangxi Province Key Laboratory of Modern Analytical Science, Nanchang University, Nanchang, China
| | - Xiaoqiang Huang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Jinheng Fu
- State Key Laboratory of Food Science and Technology, Nanchang University, Nanchang, China.,Jiangxi-OAI Joint Research Institution, Nanchang University, Nanchang, China
| | - Na Hu
- State Key Laboratory of Food Science and Technology, Nanchang University, Nanchang, China.,Jiangxi Province Key Laboratory of Modern Analytical Science, Nanchang University, Nanchang, China.,Maternal and Child Medical Research Institute, Shenzhen Maternity and Child Healthcare Hospital, Southern Medical University, Shenzhen, China
| | - Wei Zheng
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Yanping Li
- State Key Laboratory of Food Science and Technology, Nanchang University, Nanchang, China.,Jiangxi Province Key Laboratory of Modern Analytical Science, Nanchang University, Nanchang, China.,Jiangxi-OAI Joint Research Institution, Nanchang University, Nanchang, China
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.,Department of Biological Chemistry, University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
48
|
Huang X, Pearce R, Zhang Y. De novo design of protein peptides to block association of the SARS-CoV-2 spike protein with human ACE2. Aging (Albany NY) 2020; 12:11263-11276. [PMID: 32544884 PMCID: PMC7343451 DOI: 10.18632/aging.103416] [Citation(s) in RCA: 79] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Accepted: 05/25/2020] [Indexed: 12/15/2022]
Abstract
The outbreak of COVID-19 has now become a global pandemic that has severely impacted lives and economic stability. There is, however, no effective antiviral drug that can be used to treat COVID-19 to date. Built on the fact that SARS-CoV-2 initiates its entry into human cells by the receptor binding domain (RBD) of its spike protein binding to the angiotensin-converting enzyme 2 (hACE2), we extended a recently developed approach, EvoDesign, to design multiple peptide sequences that can competitively bind to the SARS-CoV-2 RBD to inhibit the virus from entering human cells. The protocol starts with the construction of a hybrid peptidic scaffold by linking two fragments grafted from the interface of the hACE2 protein (a.a. 22-44 and 351-357) with a linker glycine, which is followed by the redesign and refinement simulations of the peptide sequence to optimize its binding affinity to the interface of the SARS-CoV-2 RBD. The binding experiment analyses showed that the designed peptides exhibited a significantly stronger binding potency to hACE2 than the wild-type hACE2 receptor (with -53.35 vs. -46.46 EvoEF2 energy unit scores for the top designed and wild-type peptides, respectively). This study demonstrates a new avenue to utilize computationally designed peptide motifs to treat the COVID-19 disease by blocking the critical spike-RBD and hACE2 interactions.
Collapse
Affiliation(s)
- Xiaoqiang Huang
- Department of Computational Medicine and Bioinformatics, Ann Arbor, MI 48109, USA
| | - Robin Pearce
- Department of Computational Medicine and Bioinformatics, Ann Arbor, MI 48109, USA
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, Ann Arbor, MI 48109, USA.,Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
49
|
Kunstmann S, Engström O, Wehle M, Widmalm G, Santer M, Barbirz S. Increasing the Affinity of an O-Antigen Polysaccharide Binding Site in Shigella flexneri Bacteriophage Sf6 Tailspike Protein. Chemistry 2020; 26:7263-7273. [PMID: 32189378 PMCID: PMC7463171 DOI: 10.1002/chem.202000495] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Revised: 03/09/2020] [Indexed: 12/30/2022]
Abstract
Broad and unspecific use of antibiotics accelerates spread of resistances. Sensitive and robust pathogen detection is thus important for a more targeted application. Bacteriophages contain a large repertoire of pathogen-binding proteins. These tailspike proteins (TSP) often bind surface glycans and represent a promising design platform for specific pathogen sensors. We analysed bacteriophage Sf6 TSP that recognizes the O-polysaccharide of dysentery-causing Shigella flexneri to develop variants with increased sensitivity for sensor applications. Ligand polyrhamnose backbone conformations were obtained from 2D 1 H,1 H-trNOESY NMR utilizing methine-methine and methine-methyl correlations. They agreed well with conformations obtained from molecular dynamics (MD), validating the method for further predictions. In a set of mutants, MD predicted ligand flexibilities that were in good correlation with binding strength as confirmed on immobilized S. flexneri O-polysaccharide (PS) with surface plasmon resonance. In silico approaches combined with rapid screening on PS surfaces hence provide valuable strategies for TSP-based pathogen sensor design.
Collapse
Affiliation(s)
- Sonja Kunstmann
- Physikalische BiochemieUniversität PotsdamKarl-Liebknecht-Str. 24–2514476PotsdamGermany
- Theory and BiosystemsMax Planck Institute of Colloids and InterfacesAm Mühlenberg 114476PotsdamGermany
- Current address: Department of Biotechnology and BiomedicineTechnical University of DenmarkSøltofts Plads2800 Kgs.LyngbyDenmark
| | - Olof Engström
- Department of Organic ChemistryArrhenius LaboratoryStockholm University10691StockholmSweden
| | - Marko Wehle
- Theory and BiosystemsMax Planck Institute of Colloids and InterfacesAm Mühlenberg 114476PotsdamGermany
| | - Göran Widmalm
- Department of Organic ChemistryArrhenius LaboratoryStockholm University10691StockholmSweden
| | - Mark Santer
- Theory and BiosystemsMax Planck Institute of Colloids and InterfacesAm Mühlenberg 114476PotsdamGermany
| | - Stefanie Barbirz
- Physikalische BiochemieUniversität PotsdamKarl-Liebknecht-Str. 24–2514476PotsdamGermany
| |
Collapse
|
50
|
Qi Y, Zhang JZH. DenseCPD: Improving the Accuracy of Neural-Network-Based Computational Protein Sequence Design with DenseNet. J Chem Inf Model 2020; 60:1245-1252. [DOI: 10.1021/acs.jcim.0c00043] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Affiliation(s)
- Yifei Qi
- Shanghai Engineering Research Center of Molecular Therapeutics & New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
- NYU−ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
| | - John Z. H. Zhang
- Shanghai Engineering Research Center of Molecular Therapeutics & New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
- NYU−ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
- Department of Chemistry, New York University, New York, New York 10003, United States
| |
Collapse
|