1
|
Lin YJ, Menon AS, Hu Z, Brenner SE. Variant Impact Predictor database (VIPdb), version 2: Trends from 25 years of genetic variant impact predictors. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.25.600283. [PMID: 38979289 PMCID: PMC11230257 DOI: 10.1101/2024.06.25.600283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Background Variant interpretation is essential for identifying patients' disease-causing genetic variants amongst the millions detected in their genomes. Hundreds of Variant Impact Predictors (VIPs), also known as Variant Effect Predictors (VEPs), have been developed for this purpose, with a variety of methodologies and goals. To facilitate the exploration of available VIP options, we have created the Variant Impact Predictor database (VIPdb). Results The Variant Impact Predictor database (VIPdb) version 2 presents a collection of VIPs developed over the past 25 years, summarizing their characteristics, ClinGen calibrated scores, CAGI assessment results, publication details, access information, and citation patterns. We previously summarized 217 VIPs and their features in VIPdb in 2019. Building upon this foundation, we identified and categorized an additional 186 VIPs, resulting in a total of 403 VIPs in VIPdb version 2. The majority of the VIPs have the capacity to predict the impacts of single nucleotide variants and nonsynonymous variants. More VIPs tailored to predict the impacts of insertions and deletions have been developed since the 2010s. In contrast, relatively few VIPs are dedicated to the prediction of splicing, structural, synonymous, and regulatory variants. The increasing rate of citations to VIPs reflects the ongoing growth in their use, and the evolving trends in citations reveal development in the field and individual methods. Conclusions VIPdb version 2 summarizes 403 VIPs and their features, potentially facilitating VIP exploration for various variant interpretation applications. Availability VIPdb version 2 is available at https://genomeinterpretation.org/vipdb.
Collapse
Affiliation(s)
- Yu-Jen Lin
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
- Center for Computational Biology, University of California, Berkeley, California 94720, USA
| | - Arul S. Menon
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
- College of Computing, Data Science, and Society, University of California, Berkeley, California 94720, USA
| | - Zhiqiang Hu
- Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA
- Currently at: Illumina, Foster City, California 94404, USA
| | - Steven E. Brenner
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
- Center for Computational Biology, University of California, Berkeley, California 94720, USA
- College of Computing, Data Science, and Society, University of California, Berkeley, California 94720, USA
- Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA
| |
Collapse
|
2
|
Gurusinghe SNS, Wu Y, DeGrado W, Shifman JM. ProBASS - a language model with sequence and structural features for predicting the effect of mutations on binding affinity. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.21.600041. [PMID: 38979193 PMCID: PMC11230163 DOI: 10.1101/2024.06.21.600041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Protein-protein interactions (PPIs) govern virtually all cellular processes. Even a single mutation within PPI can significantly influence overall protein functionality and potentially lead to various types of diseases. To date, numerous approaches have emerged for predicting the change in free energy of binding (ΔΔGbind) resulting from mutations, yet the majority of these methods lack precision. In recent years, protein language models (PLMs) have been developed and shown powerful predictive capabilities by leveraging both sequence and structural data from protein-protein complexes. Yet, PLMs have not been optimized specifically for predicting ΔΔGbind. We developed an approach to predict effects of mutations on PPI binding affinity based on two most advanced protein language models ESM2 and ESM-IF1 that incorporate PPI sequence and structural features, respectively. We used the two models to generate embeddings for each PPI mutant and subsequently fine-tuned our model by training on a large dataset of experimental ΔΔGbind values. Our model, ProBASS (Protein Binding Affinity from Structure and Sequence) achieved a correlation with experimental ΔΔGbind values of 0.83 ± 0.05 for single mutations and 0.69 ± 0.04 for double mutations when model training and testing was done on the same PDB. Moreover, ProBASS exhibited very high correlation (0.81 ± 0.02) between prediction and experiment when training and testing was performed on a dataset containing 2325 single mutations in 132 PPIs. ProBASS surpasses the state-of-the-art methods in correlation with experimental data and could be further trained as more experimental data becomes available. Our results demonstrate that the integration of extensive datasets containing ΔΔGbind values across multiple PPIs to refine the pre-trained PLMs represents a successful approach for achieving a precise and broadly applicable model for ΔΔGbind prediction, greatly facilitating future protein engineering and design studies.
Collapse
Affiliation(s)
- Sagara N S Gurusinghe
- Department of Biological Chemistry, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Yibing Wu
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California San Francisco, CA, USA
| | - William DeGrado
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California San Francisco, CA, USA
| | - Julia M Shifman
- Department of Biological Chemistry, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
| |
Collapse
|
3
|
Wang L, Zhang M, Teng H, Wang Z, Wang S, Li P, Wu J, Yang L, Xu G. Rationally introducing non-canonical amino acids to enhance catalytic activity of LmrR for Henry reaction. BIORESOUR BIOPROCESS 2024; 11:26. [PMID: 38647789 PMCID: PMC10992053 DOI: 10.1186/s40643-024-00744-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Accepted: 02/19/2024] [Indexed: 04/25/2024] Open
Abstract
The use of enzymes to catalyze Henry reaction has advantages of mild reaction conditions and low contamination, but low enzyme activity of promiscuous catalysis limits its application. Here, rational design was first performed to identify the key amino acid residues in Henry reaction catalyzed by Lactococcal multidrug resistance Regulator (LmrR). Further, non-canonical amino acids were introduced into LmrR, successfully obtaining variants that enhanced the catalytic activity of LmrR. The best variant, V15CNF, showed a 184% increase in enzyme activity compared to the wild type, and was 1.92 times more effective than the optimal natural amino acid variant, V15F. Additionally, this variant had a broad substrate spectrum, capable of catalyzing reactions between various aromatic aldehydes and nitromethane, with product yielded ranging from 55 to 99%. This study improved enzymatic catalytic activity by enhancing affinity between the enzyme and substrates, while breaking limited types of natural amino acid residues by introducing non-canonical amino acids into the enzyme, providing strategies for molecular modifications.
Collapse
Affiliation(s)
- Lan Wang
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, Zhejiang, China
| | - Mengting Zhang
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, Zhejiang, China
| | - Haidong Teng
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, Zhejiang, China
| | - Zhe Wang
- Huadong Medicine Co., Ltd, Hangzhou, 310011, Zhejiang, China
| | - Shulin Wang
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, Zhejiang, China
| | - Pengcheng Li
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, Zhejiang, China
| | - Jianping Wu
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, Zhejiang, China
| | - Lirong Yang
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, Zhejiang, China
| | - Gang Xu
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, Zhejiang, China.
| |
Collapse
|
4
|
Shirvanizadeh N, Vihinen M. VariBench, new variation benchmark categories and data sets. FRONTIERS IN BIOINFORMATICS 2023; 3:1248732. [PMID: 37795169 PMCID: PMC10546188 DOI: 10.3389/fbinf.2023.1248732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 09/08/2023] [Indexed: 10/06/2023] Open
Affiliation(s)
| | - Mauno Vihinen
- Department of Experimental Medical Science, Lund University, Lund, Sweden
| |
Collapse
|
5
|
Li J, Kang G, Wang J, Yuan H, Wu Y, Meng S, Wang P, Zhang M, Wang Y, Feng Y, Huang H, de Marco A. Affinity maturation of antibody fragments: A review encompassing the development from random approaches to computational rational optimization. Int J Biol Macromol 2023; 247:125733. [PMID: 37423452 DOI: 10.1016/j.ijbiomac.2023.125733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 07/04/2023] [Accepted: 07/06/2023] [Indexed: 07/11/2023]
Abstract
Routinely screened antibody fragments usually require further in vitro maturation to achieve the desired biophysical properties. Blind in vitro strategies can produce improved ligands by introducing random mutations into the original sequences and selecting the resulting clones under more and more stringent conditions. Rational approaches exploit an alternative perspective that aims first at identifying the specific residues potentially involved in the control of biophysical mechanisms, such as affinity or stability, and then to evaluate what mutations could improve those characteristics. The understanding of the antigen-antibody interactions is instrumental to develop this process the reliability of which, consequently, strongly depends on the quality and completeness of the structural information. Recently, methods based on deep learning approaches critically improved the speed and accuracy of model building and are promising tools for accelerating the docking step. Here, we review the features of the available bioinformatic instruments and analyze the reports illustrating the result obtained with their application to optimize antibody fragments, and nanobodies in particular. Finally, the emerging trends and open questions are summarized.
Collapse
Affiliation(s)
- Jiaqi Li
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China
| | - Guangbo Kang
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China
| | - Jiewen Wang
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China
| | - Haibin Yuan
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China
| | - Yili Wu
- Zhejiang Provincial Clinical Research Center for Mental Disorders, School of Mental Health and the Affiliated Kangning Hospital, Institute of Aging, Key Laboratory of Alzheimer's Disease of Zhejiang Province, Wenzhou Medical University, Oujiang Laboratory, Wenzhou, Zhejiang 325035, China
| | - Shuxian Meng
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China
| | - Ping Wang
- New Technology R&D Department, Tianjin Modern Innovative TCM Technology Company Limited, Tianjin 300392, China
| | - Miao Zhang
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; China Resources Biopharmaceutical Company Limited, Beijing 100029, China
| | - Yuli Wang
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Tianjin Pharmaceutical Da Ren Tang Group Corporation Limited, Traditional Chinese Pharmacy Research Institute, Tianjin Key Laboratory of Quality Control in Chinese Medicine, Tianjin 300457, China; State Key Laboratory of Drug Delivery Technology and Pharmacokinetics, Tianjin Institute of Pharmaceutical Research, Tianjin 300193, China
| | - Yuanhang Feng
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China
| | - He Huang
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China.
| | - Ario de Marco
- Laboratory for Environmental and Life Sciences, University of Nova Gorica, Nova Gorica, Slovenia.
| |
Collapse
|
6
|
Kumar A K, Rathore RS. Categorization of hotspots into three types - weak, moderate and strong to distinguish protein-protein versus protein-peptide interactions. J Biomol Struct Dyn 2023:1-13. [PMID: 37649387 DOI: 10.1080/07391102.2023.2252077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2023] [Accepted: 08/18/2023] [Indexed: 09/01/2023]
Abstract
Protein-protein and protein-peptide interactions (PPI and PPepI) belong to a similar category of interactions, yet seemingly subtle differences exist among them. To characterize differences between protein-protein (PP) and protein-peptide (PPep) interactions, we have focussed on two important classes of residues-hotspot and anchor residues. Using implicit solvation-based free energy calculations, a very large-scale alanine scanning has been performed on benchmark datasets, consisting of over 5700 interface residues. The differences in the two categories are more pronounced, if the data were divided into three distinct types, namely - weak hotspots (having binding free energy loss upon Ala mutation, ΔΔG, ∼2-10 kcal/mol), moderate hotspots (ΔΔG, ∼10-20 kcal/mol) and strong hotspots (ΔΔG ≥ ∼20 kcal/mol). The analysis suggests that for PPI, weak hotspots are predominantly populated by polar and hydrophobic residues. The distribution shifts towards charged and polar residues for moderate hotspot and charged residues (principally Arg) are overwhelmingly present in the strong hotspot. On the other hand, in the PPepI dataset, the distribution shifts from predominantly hydrophobic and polar (in the weak type) to almost similar preference for polar, hydrophobic and charged residues (in moderate type) and finally the charged residue (Arg) and Trp are mostly occupied in the strong type. The preferred anchor residues in both categories are Arg, Tyr and Leu, possessing bulky side chain and which also strike a delicate balance between side chain flexibility and rigidity. The present knowledge should aid in effective design of biologics, by augmentation or disruption of PPIs with peptides or peptidomimetics.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Kiran Kumar A
- Department of Bioinformatics, School of Earth, Biological and Environmental Sciences, Central University of South Bihar, Gaya, India
| | - R S Rathore
- Department of Bioinformatics, School of Earth, Biological and Environmental Sciences, Central University of South Bihar, Gaya, India
| |
Collapse
|
7
|
Pandey P, Panday SK, Rimal P, Ancona N, Alexov E. Predicting the Effect of Single Mutations on Protein Stability and Binding with Respect to Types of Mutations. Int J Mol Sci 2023; 24:12073. [PMID: 37569449 PMCID: PMC10418460 DOI: 10.3390/ijms241512073] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 07/24/2023] [Accepted: 07/26/2023] [Indexed: 08/13/2023] Open
Abstract
The development of methods and algorithms to predict the effect of mutations on protein stability, protein-protein interaction, and protein-DNA/RNA binding is necessitated by the needs of protein engineering and for understanding the molecular mechanism of disease-causing variants. The vast majority of the leading methods require a database of experimentally measured folding and binding free energy changes for training. These databases are collections of experimental data taken from scientific investigations typically aimed at probing the role of particular residues on the above-mentioned thermodynamic characteristics, i.e., the mutations are not introduced at random and do not necessarily represent mutations originating from single nucleotide variants (SNV). Thus, the reported performance of the leading algorithms assessed on these databases or other limited cases may not be applicable for predicting the effect of SNVs seen in the human population. Indeed, we demonstrate that the SNVs and non-SNVs are not equally presented in the corresponding databases, and the distribution of the free energy changes is not the same. It is shown that the Pearson correlation coefficients (PCCs) of folding and binding free energy changes obtained in cases involving SNVs are smaller than for non-SNVs, indicating that caution should be used in applying them to reveal the effect of human SNVs. Furthermore, it is demonstrated that some methods are sensitive to the chemical nature of the mutations, resulting in PCCs that differ by a factor of four across chemically different mutations. All methods are found to underestimate the energy changes by roughly a factor of 2.
Collapse
Affiliation(s)
- Preeti Pandey
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA; (P.P.); (S.K.P.); (P.R.)
| | - Shailesh Kumar Panday
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA; (P.P.); (S.K.P.); (P.R.)
| | - Prawin Rimal
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA; (P.P.); (S.K.P.); (P.R.)
| | - Nicolas Ancona
- Department of Biological Sciences, Clemson University, Clemson, SC 29634, USA;
| | - Emil Alexov
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA; (P.P.); (S.K.P.); (P.R.)
| |
Collapse
|
8
|
Li L, Zhang R, Xu Y, Zhang W. Comprehensive screening strategy coupled with structure-guided engineering of l-threonine aldolase from Pseudomonas putida for enhanced catalytic efficiency towards l- threo-4-methylsulfonylphenylserine. Front Bioeng Biotechnol 2023; 11:1117890. [PMID: 36793440 PMCID: PMC9922994 DOI: 10.3389/fbioe.2023.1117890] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 01/19/2023] [Indexed: 01/31/2023] Open
Abstract
l-Threonine aldolases (TAs) can catalyze aldol condensation reactions to form β-hydroxy-α-amino acids, but afford unsatisfactory conversion and poor stereoselectivity at the Cβ position. In this study, a directed evolution coupling high-throughput screening method was developed to screen more efficient l-TA mutants based on their aldol condensation activity. A mutant library with over 4000 l-TA mutants from Pseudomonas putida were obtained by random mutagenesis. About 10% of mutants retained activity toward 4-methylsulfonylbenzaldehyde, with five site mutations (A9L, Y13K, H133N, E147D, and Y312E) showing higher activity. Iterative combinatorial mutant A9V/Y13K/Y312R catalyzed l-threo-4-methylsulfonylphenylserine with a 72% conversion and 86% diastereoselectivity, representing 2.3-fold and 5.1-fold improvements relative to the wild-type. Molecular dynamics simulations illustrated that additional hydrogen bonds, water bridge force, hydrophobic interactions, and π-cation interactions were present in the A9V/Y13K/Y312R mutant compared with the wild-type to reshape the substrate-binding pocket, resulting in a higher conversion and Cβ stereoselectivity. This study provides a useful strategy for engineering TAs to resolve the low Cβ stereoselectivity problem and contributes to the industrial application of TAs.
Collapse
Affiliation(s)
- Lihong Li
- Lab of Brewing Microbiology and Applied Enzymology, School of Biotechnology and Key Laboratory of Industrial Biotechnology of Ministry of Education, Jiangnan University, Wuxi, China
| | - Rongzhen Zhang
- Lab of Brewing Microbiology and Applied Enzymology, School of Biotechnology and Key Laboratory of Industrial Biotechnology of Ministry of Education, Jiangnan University, Wuxi, China,*Correspondence: Rongzhen Zhang,
| | - Yan Xu
- Lab of Brewing Microbiology and Applied Enzymology, School of Biotechnology and Key Laboratory of Industrial Biotechnology of Ministry of Education, Jiangnan University, Wuxi, China
| | - Wenchi Zhang
- Solomon H. Snyder Department of Neuroscience, Johns Hopkins University School of Medicine, Baltimore, MD, United States
| |
Collapse
|
9
|
Chen J, Qiu Y, Wang R, Wei GW. Persistent Laplacian projected Omicron BA.4 and BA.5 to become new dominating variants. Comput Biol Med 2022; 151:106262. [PMID: 36379191 PMCID: PMC10754203 DOI: 10.1016/j.compbiomed.2022.106262] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 10/21/2022] [Accepted: 10/30/2022] [Indexed: 11/15/2022]
Abstract
Due to its high transmissibility, Omicron BA.1 ousted the Delta variant to become a dominating variant in late 2021 and was replaced by more transmissible Omicron BA.2 in March 2022. An important question is which new variants will dominate in the future. Topology-based deep learning models have had tremendous success in forecasting emerging variants in the past. However, topology is insensitive to homotopic shape evolution in virus-human protein-protein binding, which is crucial to viral evolution and transmission. This challenge is tackled with persistent Laplacian, which is able to capture both the topological change and homotopic shape evolution of data. Persistent Laplacian-based deep learning models are developed to systematically evaluate variant infectivity. Our comparative analysis of Alpha, Beta, Gamma, Delta, Lambda, Mu, and Omicron BA.1, BA.1.1, BA.2, BA.2.11, BA.2.12.1, BA.3, BA.4, and BA.5 unveils that Omicron BA.2.11, BA.2.12.1, BA.3, BA.4, and BA.5 are more contagious than BA.2. In particular, BA.4 and BA.5 are about 36% more infectious than BA.2 and are projected to become new dominant variants by natural selection. Moreover, the proposed models outperform the state-of-the-art methods on three major benchmark datasets for mutation-induced protein-protein binding free energy changes. Our key projection about BA4 and BA.5's dominance made on May 1, 2022 (see arXiv:2205.00532) became a reality in late June 2022.
Collapse
Affiliation(s)
- Jiahui Chen
- Department of Mathematics, Michigan State University, East Lansing, MI 48824, USA
| | - Yuchi Qiu
- Department of Mathematics, Michigan State University, East Lansing, MI 48824, USA
| | - Rui Wang
- Department of Mathematics, Michigan State University, East Lansing, MI 48824, USA
| | - Guo-Wei Wei
- Department of Mathematics, Michigan State University, East Lansing, MI 48824, USA; Department of Electrical and Computer Engineering, Michigan State University, East Lansing, MI 48824, USA; Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA.
| |
Collapse
|
10
|
Liu J, Xia KL, Wu J, Yau SST, Wei GW. Biomolecular Topology: Modelling and Analysis. ACTA MATHEMATICA SINICA, ENGLISH SERIES 2022; 38:1901-1938. [PMID: 36407804 PMCID: PMC9640850 DOI: 10.1007/s10114-022-2326-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 07/12/2022] [Indexed: 05/25/2023]
Abstract
With the great advancement of experimental tools, a tremendous amount of biomolecular data has been generated and accumulated in various databases. The high dimensionality, structural complexity, the nonlinearity, and entanglements of biomolecular data, ranging from DNA knots, RNA secondary structures, protein folding configurations, chromosomes, DNA origami, molecular assembly, to others at the macromolecular level, pose a severe challenge in their analysis and characterization. In the past few decades, mathematical concepts, models, algorithms, and tools from algebraic topology, combinatorial topology, computational topology, and topological data analysis, have demonstrated great power and begun to play an essential role in tackling the biomolecular data challenge. In this work, we introduce biomolecular topology, which concerns the topological problems and models originated from the biomolecular systems. More specifically, the biomolecular topology encompasses topological structures, properties and relations that are emerged from biomolecular structures, dynamics, interactions, and functions. We discuss the various types of biomolecular topology from structures (of proteins, DNAs, and RNAs), protein folding, and protein assembly. A brief discussion of databanks (and databases), theoretical models, and computational algorithms, is presented. Further, we systematically review related topological models, including graphs, simplicial complexes, persistent homology, persistent Laplacians, de Rham-Hodge theory, Yau-Hausdorff distance, and the topology-based machine learning models.
Collapse
Affiliation(s)
- Jian Liu
- School of Mathematical Sciences, Hebei Normal University, Shijiazhuang, 050024 P. R. China
- Yanqi Lake Beijing Institute of Mathematical Sciences and Applications, Beijing, 101408 P. R. China
| | - Ke-Lin Xia
- School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore, 639798 Singapore
| | - Jie Wu
- Yanqi Lake Beijing Institute of Mathematical Sciences and Applications, Beijing, 101408 P. R. China
- Department of Mathematical Sciences, Tsinghua University, Beijing, 100084 P. R. China
| | - Stephen Shing-Toung Yau
- Yanqi Lake Beijing Institute of Mathematical Sciences and Applications, Beijing, 101408 P. R. China
- Department of Mathematical Sciences, Tsinghua University, Beijing, 100084 P. R. China
| | - Guo-Wei Wei
- Department of Mathematics & Department of Biochemistry and Molecular Biology & Department of Electrical and Computer Engineering, Michigan State University, Wells Hall 619 Red Cedar Road, East Lansing, MI 48824-1027 USA
| |
Collapse
|
11
|
Liu X, Feng H, Wu J, Xia K. Hom-Complex-Based Machine Learning (HCML) for the Prediction of Protein-Protein Binding Affinity Changes upon Mutation. J Chem Inf Model 2022; 62:3961-3969. [PMID: 36040839 DOI: 10.1021/acs.jcim.2c00580] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Protein-protein interactions (PPIs) are involved in almost all biological processes in the cell. Understanding protein-protein interactions holds the key for the understanding of biological functions, diseases and the development of therapeutics. Recently, artificial intelligence (AI) models have demonstrated great power in PPIs. However, a key issue for all AI-based PPI models is efficient molecular representations and featurization. Here, we propose Hom-complex-based PPI representation, and Hom-complex-based machine learning models for the prediction of PPI binding affinity changes upon mutation, for the first time. In our model, various Hom complexes Hom(G1, G) can be generated for the graph representation G of protein-protein complex by using different graphs G1, which reveal G1-related inner connections within the graph representation G of protein-protein complex. Further, for a specific graph G1, a series of nested Hom complexes are generated to give a multiscale characterization of the PPIs. Its persistent homology and persistent Euler characteristic are used as molecular descriptors and further combined with the machine learning model, in particular, gradient boosting tree (GBT). We systematically test our model on the two most-commonly used data sets, that is, SKEMPI and AB-Bind. It has been found that our model outperforms all the existing models as far as we know, which demonstrates the great potential of our model for the analysis of PPIs. Our model can be used for the analysis and design of efficient antibodies for SARS-CoV-2.
Collapse
Affiliation(s)
- Xiang Liu
- Chern Institute of Mathematics and LPMC, Nankai University, Tianjin, China, 300071.,Division of Mathematical Sciences, School of Physical and Mathematical Sciences Nanyang Technological University, Singapore 637371
| | - Huitao Feng
- Division of Mathematical Sciences, School of Physical and Mathematical Sciences Nanyang Technological University, Singapore 637371.,Mathematical Science Research Center, Chongqing University of Technology, Chongqing, China, 400054
| | - Jie Wu
- Yanqi Lake Beijing Institute of Mathematical Sciences and Applications (BIMSA), Beijing, China,101408
| | - Kelin Xia
- Division of Mathematical Sciences, School of Physical and Mathematical Sciences Nanyang Technological University, Singapore 637371
| |
Collapse
|
12
|
Probing the Immune System Dynamics of the COVID-19 Disease for Vaccine Designing and Drug Repurposing Using Bioinformatics Tools. IMMUNO 2022. [DOI: 10.3390/immuno2020022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
The pathogenesis of COVID-19 is complicated by immune dysfunction. The impact of immune-based therapy in COVID-19 patients has been well documented, with some notable studies on the use of anti-cytokine medicines. However, the complexity of disease phenotypes, patient heterogeneity and the varying quality of evidence from immunotherapy studies provide problems in clinical decision-making. This review seeks to aid therapeutic decision-making by giving an overview of the immunological responses against COVID-19 disease that may contribute to the severity of the disease. We have extensively discussed theranostic methods for COVID-19 detection. With advancements in technology, bioinformatics has taken studies to a higher level. The paper also discusses the application of bioinformatics and machine learning tools for the diagnosis, vaccine design and drug repurposing against SARS-CoV-2.
Collapse
|
13
|
Lim TS, Choong YS. In silico design of ACE2 mutants for competitive binding of SARS-CoV-2 receptor binding domain with hACE2. PHYSICAL SCIENCES REVIEWS 2022. [DOI: 10.1515/psr-2021-0136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Abstract
The receptor binding motif (RBM) within the S-protein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has been touted as one of the main targets for vaccine/therapeutic development due to its interaction with the human angiotensin II converting enzyme 2 (hACE2) to facilitate virus entry into the host cell. The mechanism of action is based on the disruption of binding between the RBM and the hACE2 to prevent virus uptake for replication. In this work, we applied in silico approaches to design specific competitive binders for SARS-CoV-2 S-protein receptor binding motif (RBM) by using hACE2 peptidase domain (PD) mutants. Online single point mutation servers were utilised to estimate the effect of PD mutation on the binding affinity with RBM. The PD mutants were then modelled and the binding free energy was calculated. Three PD variants were designed with an increased affinity and interaction with SARS-CoV-2-RBM. It is hope that these designs could serve as the initial work for vaccine/drug development and could eventually interfere the preliminary recognition between SARS-CoV-2 and the host cell.
Collapse
Affiliation(s)
- Theam Soon Lim
- Institute for Research in Molecular Medicine, Universiti Sains Malaysia , Penang , Malaysia
| | - Yee Siew Choong
- Institute for Research in Molecular Medicine, Universiti Sains Malaysia , Penang , Malaysia
| |
Collapse
|
14
|
Wee J, Xia K. Persistent spectral based ensemble learning (PerSpect-EL) for protein-protein binding affinity prediction. Brief Bioinform 2022; 23:6533501. [PMID: 35189639 DOI: 10.1093/bib/bbac024] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 12/30/2021] [Accepted: 01/17/2022] [Indexed: 12/14/2022] Open
Abstract
Protein-protein interactions (PPIs) play a significant role in nearly all cellular and biological activities. Data-driven machine learning models have demonstrated great power in PPIs. However, the design of efficient molecular featurization poses a great challenge for all learning models for PPIs. Here, we propose persistent spectral (PerSpect) based PPI representation and featurization, and PerSpect-based ensemble learning (PerSpect-EL) models for PPI binding affinity prediction, for the first time. In our model, a sequence of Hodge (or combinatorial) Laplacian (HL) matrices at various different scales are generated from a specially designed filtration process. PerSpect attributes, which are statistical and combinatorial properties of spectrum information from these HL matrices, are used as features for PPI characterization. Each PerSpect attribute is input into a 1D convolutional neural network (CNN), and these CNN networks are stacked together in our PerSpect-based ensemble learning models. We systematically test our model on the two most commonly used datasets, i.e. SKEMPI and AB-Bind. It has been found that our model can achieve state-of-the-art results and outperform all existing models to the best of our knowledge.
Collapse
Affiliation(s)
- JunJie Wee
- Division of Mathematical Sciences, School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore 637371
| | - Kelin Xia
- Division of Mathematical Sciences, School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore 637371
| |
Collapse
|
15
|
Sun T, Chen Y, Wen Y, Zhu Z, Li M. PremPLI: a machine learning model for predicting the effects of missense mutations on protein-ligand interactions. Commun Biol 2021; 4:1311. [PMID: 34799678 PMCID: PMC8604987 DOI: 10.1038/s42003-021-02826-3] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Accepted: 10/26/2021] [Indexed: 02/07/2023] Open
Abstract
Resistance to small-molecule drugs is the main cause of the failure of therapeutic drugs in clinical practice. Missense mutations altering the binding of ligands to proteins are one of the critical mechanisms that result in genetic disease and drug resistance. Computational methods have made a lot of progress for predicting binding affinity changes and identifying resistance mutations, but their prediction accuracy and speed are still not satisfied and need to be further improved. To address these issues, we introduce a structure-based machine learning method for quantitatively estimating the effects of single mutations on ligand binding affinity changes (named as PremPLI). A comprehensive comparison of the predictive performance of PremPLI with other available methods on two benchmark datasets confirms that our approach performs robustly and presents similar or even higher predictive accuracy than the approaches relying on first-principle statistical mechanics and mixed physics- and knowledge-based potentials while requires much less computational resources. PremPLI can be used for guiding the design of ligand-binding proteins, identifying and understanding disease driver mutations, and finding potential resistance mutations for different drugs. PremPLI is freely available at https://lilab.jysw.suda.edu.cn/research/PremPLI/ and allows to do large-scale mutational scanning. Sun et al. present PremPLI, a machine learning approach and web tool to predict how missense mutations in a drug’s target protein will affect the drug’s binding affinity. PremPLI can be applied to identify drug resistance mechanisms in cancer cells and microorganisms and develop drugs to combat drug resistance.
Collapse
Affiliation(s)
- Tingting Sun
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, 215123, Suzhou, China
| | - Yuting Chen
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, 215123, Suzhou, China
| | - Yuhao Wen
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, 215123, Suzhou, China
| | - Zefeng Zhu
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, 215123, Suzhou, China
| | - Minghui Li
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, 215123, Suzhou, China.
| |
Collapse
|
16
|
Chen C, Boorla VS, Banerjee D, Chowdhury R, Cavener VS, Nissly RH, Gontu A, Boyle NR, Vandegrift K, Nair MS, Kuchipudi SV, Maranas CD. Computational prediction of the effect of amino acid changes on the binding affinity between SARS-CoV-2 spike RBD and human ACE2. Proc Natl Acad Sci U S A 2021; 118:e2106480118. [PMID: 34588290 PMCID: PMC8594574 DOI: 10.1073/pnas.2106480118] [Citation(s) in RCA: 50] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/23/2021] [Indexed: 01/22/2023] Open
Abstract
The association of the receptor binding domain (RBD) of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike protein with human angiotensin-converting enzyme 2 (hACE2) represents the first required step for cellular entry. SARS-CoV-2 has continued to evolve with the emergence of several novel variants, and amino acid changes in the RBD have been implicated with increased fitness and potential for immune evasion. Reliably predicting the effect of amino acid changes on the ability of the RBD to interact more strongly with the hACE2 can help assess the implications for public health and the potential for spillover and adaptation into other animals. Here, we introduce a two-step framework that first relies on 48 independent 4-ns molecular dynamics (MD) trajectories of RBD-hACE2 variants to collect binding energy terms decomposed into Coulombic, covalent, van der Waals, lipophilic, generalized Born solvation, hydrogen bonding, π-π packing, and self-contact correction terms. The second step implements a neural network to classify and quantitatively predict binding affinity changes using the decomposed energy terms as descriptors. The computational base achieves a validation accuracy of 82.8% for classifying single-amino acid substitution variants of the RBD as worsening or improving binding affinity for hACE2 and a correlation coefficient of 0.73 between predicted and experimentally calculated changes in binding affinities. Both metrics are calculated using a fivefold cross-validation test. Our method thus sets up a framework for screening binding affinity changes caused by unknown single- and multiple-amino acid changes offering a valuable tool to predict host adaptation of SARS-CoV-2 variants toward tighter hACE2 binding.
Collapse
Affiliation(s)
- Chen Chen
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802
| | - Veda Sheersh Boorla
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802
| | - Deepro Banerjee
- The Bioinformatics and Genomics Program, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802
| | - Ratul Chowdhury
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802
| | - Victoria S Cavener
- Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA 16802
| | - Ruth H Nissly
- Animal Diagnostic Laboratory, Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA 16802
| | - Abhinay Gontu
- Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA 16802
| | - Nina R Boyle
- Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA 16802
| | - Kurt Vandegrift
- Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park, PA 16802
| | - Meera Surendran Nair
- Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA 16802
- Animal Diagnostic Laboratory, Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA 16802
| | - Suresh V Kuchipudi
- Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA 16802;
- Animal Diagnostic Laboratory, Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA 16802
- Center for Infectious Disease Dynamics, The Pennsylvania State University, University Park, PA 16802
| | - Costas D Maranas
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802;
| |
Collapse
|
17
|
Liu X, Luo Y, Li P, Song S, Peng J. Deep geometric representations for modeling effects of mutations on protein-protein binding affinity. PLoS Comput Biol 2021; 17:e1009284. [PMID: 34347784 PMCID: PMC8366979 DOI: 10.1371/journal.pcbi.1009284] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Revised: 08/16/2021] [Accepted: 07/17/2021] [Indexed: 11/19/2022] Open
Abstract
Modeling the impact of amino acid mutations on protein-protein interaction plays a crucial role in protein engineering and drug design. In this study, we develop GeoPPI, a novel structure-based deep-learning framework to predict the change of binding affinity upon mutations. Based on the three-dimensional structure of a protein, GeoPPI first learns a geometric representation that encodes topology features of the protein structure via a self-supervised learning scheme. These representations are then used as features for training gradient-boosting trees to predict the changes of protein-protein binding affinity upon mutations. We find that GeoPPI is able to learn meaningful features that characterize interactions between atoms in protein structures. In addition, through extensive experiments, we show that GeoPPI achieves new state-of-the-art performance in predicting the binding affinity changes upon both single- and multi-point mutations on six benchmark datasets. Moreover, we show that GeoPPI can accurately estimate the difference of binding affinities between a few recently identified SARS-CoV-2 antibodies and the receptor-binding domain (RBD) of the S protein. These results demonstrate the potential of GeoPPI as a powerful and useful computational tool in protein design and engineering. Our code and datasets are available at: https://github.com/Liuxg16/GeoPPI. Estimating the binding affinities of protein-protein interactions (PPIs) is crucial to understand protein function and design new functional proteins. Since the experimental measurement in wet-labs is labor-intensive and time-consuming, fast and accurate in silico approaches have received much attention. Although considerable efforts have been made in this direction, predicting the effects of mutations on the protein-protein binding affinity is still a challenging research problem. In this work, we introduce GeoPPI, a novel computational approach that uses deep geometric representations of protein complexes to predict the effects of mutations on the binding affinity. The geometric representations are first learned via a self-supervised learning scheme and then integrated with gradient-boosting trees to accomplish the prediction. We find that the learned representations encode meaningful patterns underlying the interactions between atoms in protein structures. Also, extensive tests on major benchmark datasets show that GeoPPI has made an important improvement over the existing methods in predicting the effects of mutations on the binding affinity.
Collapse
Affiliation(s)
- Xianggen Liu
- Laboratory for Brain and Intelligence and Department of Biomedical Engineering, Tsinghua University, Beijing, China
- School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu, China
- Beijing Innovation Center for Future Chip, Tsinghua University, Beijing, China
| | - Yunan Luo
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Pengyong Li
- Laboratory for Brain and Intelligence and Department of Biomedical Engineering, Tsinghua University, Beijing, China
- Beijing Innovation Center for Future Chip, Tsinghua University, Beijing, China
| | - Sen Song
- Laboratory for Brain and Intelligence and Department of Biomedical Engineering, Tsinghua University, Beijing, China
- Beijing Innovation Center for Future Chip, Tsinghua University, Beijing, China
- * E-mail: (JP); (SS)
| | - Jian Peng
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
- * E-mail: (JP); (SS)
| |
Collapse
|
18
|
Li G, Pahari S, Murthy AK, Liang S, Fragoza R, Yu H, Alexov E. SAAMBE-SEQ: a sequence-based method for predicting mutation effect on protein-protein binding affinity. Bioinformatics 2021; 37:992-999. [PMID: 32866236 PMCID: PMC8128451 DOI: 10.1093/bioinformatics/btaa761] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2020] [Revised: 08/17/2020] [Accepted: 08/24/2020] [Indexed: 01/04/2023] Open
Abstract
MOTIVATION Vast majority of human genetic disorders are associated with mutations that affect protein-protein interactions by altering wild-type binding affinity. Therefore, it is extremely important to assess the effect of mutations on protein-protein binding free energy to assist the development of therapeutic solutions. Currently, the most popular approaches use structural information to deliver the predictions, which precludes them to be applicable on genome-scale investigations. Indeed, with the progress of genomic sequencing, researchers are frequently dealing with assessing effect of mutations for which there is no structure available. RESULTS Here, we report a Gradient Boosting Decision Tree machine learning algorithm, the SAAMBE-SEQ, which is completely sequence-based and does not require structural information at all. SAAMBE-SEQ utilizes 80 features representing evolutionary information, sequence-based features and change of physical properties upon mutation at the mutation site. The approach is shown to achieve Pearson correlation coefficient (PCC) of 0.83 in 5-fold cross validation in a benchmarking test against experimentally determined binding free energy change (ΔΔG). Further, a blind test (no-STRUC) is compiled collecting experimental ΔΔG upon mutation for protein complexes for which structure is not available and used to benchmark SAAMBE-SEQ resulting in PCC in the range of 0.37-0.46. The accuracy of SAAMBE-SEQ method is found to be either better or comparable to most advanced structure-based methods. SAAMBE-SEQ is very fast, available as webserver and stand-alone code, and indeed utilizes only sequence information, and thus it is applicable for genome-scale investigations to study the effect of mutations on protein-protein interactions. AVAILABILITY AND IMPLEMENTATION SAAMBE-SEQ is available at http://compbio.clemson.edu/saambe_webserver/indexSEQ.php#started. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Gen Li
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA
| | - Swagata Pahari
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA
| | | | - Siqi Liang
- Department of Computational Biology, Cornell University, Ithaca, NY 14850, USA
| | - Robert Fragoza
- Department of Computational Biology, Cornell University, Ithaca, NY 14850, USA
| | - Haiyuan Yu
- Department of Computational Biology, Cornell University, Ithaca, NY 14850, USA
| | - Emil Alexov
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA
| |
Collapse
|
19
|
Zheng W, Yu H, Fang S, Chen K, Wang Z, Cheng X, Xu G, Yang L, Wu J. Directed Evolution of l-Threonine Aldolase for the Diastereoselective Synthesis of β-Hydroxy-α-amino Acids. ACS Catal 2021. [DOI: 10.1021/acscatal.0c04949] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Affiliation(s)
- Wenlong Zheng
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Hangzhou Global Scientific and Technological Innovation Centre, Zhejiang University, Hangzhou 311200, China
| | - Haoran Yu
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Hangzhou Global Scientific and Technological Innovation Centre, Zhejiang University, Hangzhou 311200, China
| | - Sai Fang
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
| | - Kaitong Chen
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
| | - Zhe Wang
- Huadong Medicine Co., Ltd, Hangzhou 310011, China
| | - Xiuli Cheng
- Huadong Medicine Co., Ltd, Hangzhou 310011, China
| | - Gang Xu
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
| | - Lirong Yang
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Hangzhou Global Scientific and Technological Innovation Centre, Zhejiang University, Hangzhou 311200, China
| | - Jianping Wu
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Hangzhou Global Scientific and Technological Innovation Centre, Zhejiang University, Hangzhou 311200, China
| |
Collapse
|
20
|
Meseguer A, Dominguez L, Bota PM, Aguirre‐Plans J, Bonet J, Fernandez‐Fuentes N, Oliva B. Using collections of structural models to predict changes of binding affinity caused by mutations in protein-protein interactions. Protein Sci 2020; 29:2112-2130. [PMID: 32797645 PMCID: PMC7513729 DOI: 10.1002/pro.3930] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Revised: 08/04/2020] [Accepted: 08/05/2020] [Indexed: 12/24/2022]
Abstract
Protein-protein interactions (PPIs) in all the molecular aspects that take place both inside and outside cells. However, determining experimentally the structure and affinity of PPIs is expensive and time consuming. Therefore, the development of computational tools, as a complement to experimental methods, is fundamental. Here, we present a computational suite: MODPIN, to model and predict the changes of binding affinity of PPIs. In this approach we use homology modeling to derive the structures of PPIs and score them using state-of-the-art scoring functions. We explore the conformational space of PPIs by generating not a single structural model but a collection of structural models with different conformations based on several templates. We apply the approach to predict the changes in free energy upon mutations and splicing variants of large datasets of PPIs to statistically quantify the quality and accuracy of the predictions. As an example, we use MODPIN to study the effect of mutations in the interaction between colicin endonuclease 9 and colicin endonuclease 2 immune protein from Escherichia coli. Finally, we have compared our results with other state-of-art methods.
Collapse
Affiliation(s)
- Alberto Meseguer
- Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Department of Experimental and Health ScienceUniversitat Pompeu FabraBarcelonaCataloniaSpain
| | - Lluis Dominguez
- Integrative Biomedical Informatics Group (GRIB‐IMIM). Department of Experimental and Life SciencesUniversitat Pompeu FabraBarcelonaCataloniaSpain
| | - Patricia M. Bota
- Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Department of Experimental and Health ScienceUniversitat Pompeu FabraBarcelonaCataloniaSpain
- Department of BiosciencesUniversitat de Vic‐Universitat Central de CatalunyaVicCataloniaSpain
| | - Joaquim Aguirre‐Plans
- Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Department of Experimental and Health ScienceUniversitat Pompeu FabraBarcelonaCataloniaSpain
| | - Jaume Bonet
- Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Department of Experimental and Health ScienceUniversitat Pompeu FabraBarcelonaCataloniaSpain
| | - Narcis Fernandez‐Fuentes
- Department of BiosciencesUniversitat de Vic‐Universitat Central de CatalunyaVicCataloniaSpain
- Institute of Biological, Environmental and Rural SciencesAberystwyth UniversityAberystwythUK
| | - Baldo Oliva
- Structural Bioinformatics Group, Research Programme on Biomedical Informatics, Department of Experimental and Health ScienceUniversitat Pompeu FabraBarcelonaCataloniaSpain
| |
Collapse
|
21
|
Chen J, Wang R, Wang M, Wei GW. Mutations Strengthened SARS-CoV-2 Infectivity. J Mol Biol 2020; 432:5212-5226. [PMID: 32710986 PMCID: PMC7375973 DOI: 10.1016/j.jmb.2020.07.009] [Citation(s) in RCA: 326] [Impact Index Per Article: 81.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 07/09/2020] [Accepted: 07/17/2020] [Indexed: 12/12/2022]
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infectivity is a major concern in coronavirus disease 2019 (COVID-19) prevention and economic reopening. However, rigorous determination of SARS-CoV-2 infectivity is very difficult owing to its continuous evolution with over 10,000 single nucleotide polymorphisms (SNP) variants in many subtypes. We employ an algebraic topology-based machine learning model to quantitatively evaluate the binding free energy changes of SARS-CoV-2 spike glycoprotein (S protein) and host angiotensin-converting enzyme 2 receptor following mutations. We reveal that the SARS-CoV-2 virus becomes more infectious. Three out of six SARS-CoV-2 subtypes have become slightly more infectious, while the other three subtypes have significantly strengthened their infectivity. We also find that SARS-CoV-2 is slightly more infectious than SARS-CoV according to computed S protein-angiotensin-converting enzyme 2 binding free energy changes. Based on a systematic evaluation of all possible 3686 future mutations on the S protein receptor-binding domain, we show that most likely future mutations will make SARS-CoV-2 more infectious. Combining sequence alignment, probability analysis, and binding free energy calculation, we predict that a few residues on the receptor-binding motif, i.e., 452, 489, 500, 501, and 505, have high chances to mutate into significantly more infectious COVID-19 strains.
Collapse
Affiliation(s)
- Jiahui Chen
- Department of Mathematics, Michigan State University, MI 48824, USA
| | - Rui Wang
- Department of Mathematics, Michigan State University, MI 48824, USA
| | - Menglun Wang
- Department of Mathematics, Michigan State University, MI 48824, USA
| | - Guo-Wei Wei
- Department of Mathematics, Michigan State University, MI 48824, USA; Department of Electrical and Computer Engineering, Michigan State University, MI 48824, USA; Department of Biochemistry and Molecular Biology, Michigan State University, MI 48824, USA.
| |
Collapse
|
22
|
Mahase V, Sobitan A, Johnson C, Cooper F, Xie Y, Li L, Teng S. Computational analysis of hereditary spastic paraplegia mutations in the kinesin motor domains of KIF1A and KIF5A. JOURNAL OF THEORETICAL & COMPUTATIONAL CHEMISTRY 2020. [DOI: 10.1142/s0219633620410035] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Hereditary spastic paraplegias (HSPs) are a genetically heterogeneous collection of neurodegenerative disorders categorized by progressive lower-limb spasticity and frailty. The complex HSP forms are characterized by various neurological features including progressive spastic weakness, urinary sphincter dysfunction, extra pyramidal signs and intellectual disability (ID). The kinesin superfamily proteins (KIFs) are microtubule-dependent molecular motors involved in intracellular transport. Kinesins directionally transport membrane vesicles, protein complexes, and mRNAs along neurites, thus playing important roles in neuronal development and function. Recent genetic studies have identified kinesin mutations in patients with HSPs. In this study, we used the computational approaches to investigate the 40 missense mutations associated with HSP and ID in KIF1A and KIF5A. We performed homology modeling to construct the structures of kinesin–microtubule binding domain and kinesin–tubulin complex. We applied structure-based energy calculation methods to determine the effects of missense mutations on protein stability and protein–protein interaction. The results revealed that the most of disease-causing mutations could change the folding free energy of kinesin motor domain and the binding free energy of kinesin–tubulin complex. We found that E253K associated with ID in KIF1A decrease the protein stability of kinesin motor domains. We showed that the HSP mutations located in kinesin–tubulin complex interface, such as K253N and R280C in KIF5A, can destabilize the kinesin–tubulin complex. The computational analysis provides useful information for understanding the roles of kinesin mutations in the development of ID and HSPs.
Collapse
Affiliation(s)
- Vidhyanand Mahase
- Department of Biology, Howard University, Washington, D.C., 20059 USA
| | - Adebiyi Sobitan
- Department of Biology, Howard University, Washington, D.C., 20059 USA
| | - Christina Johnson
- Department of Biology, Howard University, Washington, D.C., 20059 USA
| | - Farion Cooper
- Department of Biology, Howard University, Washington, D.C., 20059 USA
| | - Yixin Xie
- Computational Science Program, University of Texas at El Paso, El Paso, Texas 79902, USA
| | - Lin Li
- Computational Science Program, University of Texas at El Paso, El Paso, Texas 79902, USA
- Department of Physics, University of Texas at El Paso, El Paso, Texas 79902, USA
| | - Shaolei Teng
- Department of Biology, Howard University, Washington, D.C., 20059 USA
| |
Collapse
|
23
|
Zhang N, Lu H, Chen Y, Zhu Z, Yang Q, Wang S, Li M. PremPRI: Predicting the Effects of Missense Mutations on Protein-RNA Interactions. Int J Mol Sci 2020; 21:ijms21155560. [PMID: 32756481 PMCID: PMC7432928 DOI: 10.3390/ijms21155560] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Revised: 07/28/2020] [Accepted: 07/30/2020] [Indexed: 12/23/2022] Open
Abstract
Protein–RNA interactions are crucial for many cellular processes, such as protein synthesis and regulation of gene expression. Missense mutations that alter protein–RNA interaction may contribute to the pathogenesis of many diseases. Here, we introduce a new computational method PremPRI, which predicts the effects of single mutations occurring in RNA binding proteins on the protein–RNA interactions by calculating the binding affinity changes quantitatively. The multiple linear regression scoring function of PremPRI is composed of three sequence- and eight structure-based features, and is parameterized on 248 mutations from 50 protein–RNA complexes. Our model shows a good agreement between calculated and experimental values of binding affinity changes with a Pearson correlation coefficient of 0.72 and the corresponding root-mean-square error of 0.76 kcal·mol−1, outperforming three other available methods. PremPRI can be used for finding functionally important variants, understanding the molecular mechanisms, and designing new protein–RNA interaction inhibitors.
Collapse
|
24
|
Zheng W, Chen K, Wang Z, Cheng X, Xu G, Yang L, Wu J. Construction of a Highly Diastereoselective Aldol Reaction System with l-Threonine Aldolase by Computer-Assisted Rational Molecular Modification and Medium Engineering. Org Lett 2020; 22:5763-5767. [DOI: 10.1021/acs.orglett.0c01792] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Affiliation(s)
- Wenlong Zheng
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
| | - Kaitong Chen
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
| | - Zhe Wang
- Huadong Medicine Co Ltd., Hangzhou, Zhejiang 310011, China
| | - Xiuli Cheng
- Huadong Medicine Co Ltd., Hangzhou, Zhejiang 310011, China
| | - Gang Xu
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
| | - Lirong Yang
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou 310027, China
| | - Jianping Wu
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou 310027, China
| |
Collapse
|
25
|
Ganakammal SR, Koirala M, Wu B, Alexov E. In-silico analysis to identify the role of MEN1 missense mutations in breast cancer. JOURNAL OF THEORETICAL & COMPUTATIONAL CHEMISTRY 2020. [DOI: 10.1142/s0219633620410023] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Background: The multiple endocrine neoplasia type 1 (MEN1) gene located on chromosome 11q13 encodes menin protein. Previously reported mutations were thought to result in loss of function of menin protein and that they are associated with multiple endocrine neoplasia 1 disorder. However, recently menin has also been characterized as an oncosuppressor protein and it was suggested that mutations in it are associated with various other tumors. Studies indicate that the menin protein stimulates the estrogen receptor (ER) that in turn increases the predisposition for inherited breast cancer. Methods: Here, we used our supervised in-house combinatory in-silico predictor method to investigate the impact of unclassified missense mutations in MEN1 gene found in breast cancer tissue. We also examined the biophysical and biochemical properties to predict the effects of these missense variants on the menin protein stability and interactions. The results are compared with the effects of known pathogenic mutations in menin causing neoplasia. Results: Our analysis indicates that some of the variants found in breast cancer tissue show similar pattern of destabilizing the menin protein and its interactions as the pathogenic variants associated with neoplasia. Taking together with the results of our in-silico consensus predictor, we classify missense mutations in menin protein found in breast cancer tissue into pathogenic and benign, and thus, suggesting as an indicator for early detection of elevated breast cancer risk.
Collapse
Affiliation(s)
| | - Mahesh Koirala
- Department of Physics, Clemson University, Clemson SC, USA
| | - Bohua Wu
- Department of Physics, Clemson University, Clemson SC, USA
| | - Emil Alexov
- Department of Healthcare Genetics, School of Nursing, Clemson University, Clemson SC, USA
- Department of Physics, Clemson University, Clemson SC, USA
| |
Collapse
|
26
|
Pahari S, Li G, Murthy AK, Liang S, Fragoza R, Yu H, Alexov E. SAAMBE-3D: Predicting Effect of Mutations on Protein-Protein Interactions. Int J Mol Sci 2020; 21:E2563. [PMID: 32272725 PMCID: PMC7177817 DOI: 10.3390/ijms21072563] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2020] [Revised: 04/04/2020] [Accepted: 04/05/2020] [Indexed: 12/26/2022] Open
Abstract
Maintaining wild type protein-protein interactions is essential for the normal function of cell and any mutation that alter their characteristics can cause disease. Therefore, the ability to correctly and quickly predict the effect of amino acid mutations is crucial for understanding disease effects and to be able to carry out genome-wide studies. Here, we report a new development of the SAAMBE method, SAAMBE-3D, which is a machine learning-based approach, resulting in accurate predictions and is extremely fast. It achieves the Pearson correlation coefficient ranging from 0.78 to 0.82 depending on the training protocol in benchmarking five-fold validation test against the SKEMPI v2.0 database and outperforms currently existing algorithms on various blind-tests. Furthermore, optimized and tested via five-fold cross-validation on the Cornell University dataset, the SAAMBE-3D achieves AUC of 1.0 and 0.96 on a homo and hereto-dimer test datasets. Another important feature of SAAMBE-3D is that it is very fast, it takes less than a fraction of a second to complete a prediction. SAAMBE-3D is available as a web server and as well as a stand-alone code, the last one being another important feature allowing other researchers to directly download the code and run it on their local computer. Combined all together, SAAMBE-3D is an accurate and fast software applicable for genome-wide studies to assess the effect of amino acid mutations on protein-protein interactions. The webserver and the stand-alone codes (SAAMBE-3D for predicting the change of binding free energy and SAAMBE-3D-DN for predicting if the mutation is disruptive or non-disruptive) are available.
Collapse
Affiliation(s)
- Swagata Pahari
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA; (S.P.); (G.L.); (A.K.M.)
| | - Gen Li
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA; (S.P.); (G.L.); (A.K.M.)
| | - Adithya Krishna Murthy
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA; (S.P.); (G.L.); (A.K.M.)
| | - Siqi Liang
- Department of Computational Biology, Cornell University, Ithaca, NY 14850, USA; (S.L.); (R.F.); (H.Y.)
| | - Robert Fragoza
- Department of Computational Biology, Cornell University, Ithaca, NY 14850, USA; (S.L.); (R.F.); (H.Y.)
| | - Haiyuan Yu
- Department of Computational Biology, Cornell University, Ithaca, NY 14850, USA; (S.L.); (R.F.); (H.Y.)
| | - Emil Alexov
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA; (S.P.); (G.L.); (A.K.M.)
| |
Collapse
|
27
|
Zhang N, Chen Y, Lu H, Zhao F, Alvarez RV, Goncearenco A, Panchenko AR, Li M. MutaBind2: Predicting the Impacts of Single and Multiple Mutations on Protein-Protein Interactions. iScience 2020; 23:100939. [PMID: 32169820 PMCID: PMC7068639 DOI: 10.1016/j.isci.2020.100939] [Citation(s) in RCA: 80] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2019] [Revised: 11/21/2019] [Accepted: 02/20/2020] [Indexed: 01/17/2023] Open
Abstract
Missense mutations may affect proteostasis by destabilizing or over-stabilizing protein complexes and changing the pathway flux. Predicting the effects of stabilizing mutations on protein-protein interactions is notoriously difficult because existing experimental sets are skewed toward mutations reducing protein-protein binding affinity and many computational methods fail to correctly evaluate their effects. To address this issue, we developed a method MutaBind2, which estimates the impacts of single as well as multiple mutations on protein-protein interactions. MutaBind2 employs only seven features, and the most important of them describe interactions of proteins with the solvent, evolutionary conservation of the site, and thermodynamic stability of the complex and each monomer. This approach shows a distinct improvement especially in evaluating the effects of mutations increasing binding affinity. MutaBind2 can be used for finding disease driver mutations, designing stable protein complexes, and discovering new protein-protein interaction inhibitors.
Collapse
Affiliation(s)
- Ning Zhang
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China
| | - Yuting Chen
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China
| | - Haoyu Lu
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China
| | - Feiyang Zhao
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China
| | - Roberto Vera Alvarez
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, USA
| | - Alexander Goncearenco
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, USA
| | - Anna R Panchenko
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, USA.
| | - Minghui Li
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China.
| |
Collapse
|
28
|
A topology-based network tree for the prediction of protein-protein binding affinity changes following mutation. NAT MACH INTELL 2020; 2:116-123. [PMID: 34170981 PMCID: PMC7223817 DOI: 10.1038/s42256-020-0149-6] [Citation(s) in RCA: 83] [Impact Index Per Article: 20.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Accepted: 01/10/2020] [Indexed: 12/14/2022]
Abstract
The ability to predict protein-protein interactions is crucial to our understanding of a wide range of biological activities and functions in the human body, and for guiding drug discovery. Despite considerable efforts to develop suitable computational methods, predicting protein-protein interaction binding affinity changes following mutation (ΔΔG) remains a severe challenge. Algebraic topology, a champion in recent worldwide competitions for protein-ligand binding affinity predictions, is a promising approach to simplifying the complexity of biological structures. Here we introduce element- and site-specific persistent homology (a new branch of algebraic topology) to simplify the structural complexity of protein-protein complexes and embed crucial biological information into topological invariants. We also propose a new deep learning algorithm called NetTree to take advantage of convolutional neural networks and gradient-boosting trees. A topology-based network tree is constructed by integrating the topological representation and NetTree for predicting protein-protein interaction ΔΔG. Tests on major benchmark datasets indicate that the proposed topology-based network tree is an important improvement over the current state of the art in predicting ΔΔG.
Collapse
|
29
|
Generating quantitative binding landscapes through fractional binding selections combined with deep sequencing and data normalization. Nat Commun 2020; 11:297. [PMID: 31941882 PMCID: PMC6962383 DOI: 10.1038/s41467-019-13895-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2019] [Accepted: 11/28/2019] [Indexed: 11/09/2022] Open
Abstract
Quantifying the effects of various mutations on binding free energy is crucial for understanding the evolution of protein-protein interactions and would greatly facilitate protein engineering studies. Yet, measuring changes in binding free energy (ΔΔGbind) remains a tedious task that requires expression of each mutant, its purification, and affinity measurements. We developed an attractive approach that allows us to quantify ΔΔGbind for thousands of protein mutants in one experiment. Our protocol combines protein randomization, Yeast Surface Display technology, deep sequencing, and a few experimental ΔΔGbind data points on purified proteins to generate ΔΔGbind values for the remaining numerous mutants of the same protein complex. Using this methodology, we comprehensively map the single-mutant binding landscape of one of the highest-affinity interaction between BPTI and Bovine Trypsin (BT). We show that ΔΔGbind for this interaction could be quantified with high accuracy over the range of 12 kcal mol−1 displayed by various BPTI single mutants. Quantifying the effect of mutations on binding free energy is important to understand protein-protein interaction (PPI). Here the authors develop a method based on yeast display and next-generation sequencing to generate quantitative binding landscapes for any PPI regardless of their Kd value.
Collapse
|
30
|
Goswami AM. Computational analyses prioritize and reveal the deleterious nsSNPs in human angiotensinogen gene. Comput Biol Chem 2020; 84:107199. [PMID: 31931433 DOI: 10.1016/j.compbiolchem.2019.107199] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Revised: 12/26/2019] [Accepted: 12/30/2019] [Indexed: 12/27/2022]
Abstract
Angiotensinogen (AGT) is a key component of renin-angiotensin-aldosterone system (RAAS), which plays central role in blood pressure homeostasis. Association of AGT polymorphisms have been investigated in different ethnic populations in variety of cardiovascular and non-cardiovascular conditions. In this study, 354 non-synonymous SNPs (nsSNPs) of AGT were evaluated to predict damaging and structurally important variants. Majority of the deleterious nsSNPs occurred in the evolutionary conserved regions. Several of these nsSNPs were found to affect post-translational modifications like methylation, glycosylation, phosphorylation, ubiquitination etc. Structural evaluations predicted 19 variants as destabilizing and some of them were also predicted to destabilize the renin-AGT interaction. Therefore, the present computational investigation predicted pathogenic and functionally important variants of human AGT gene. The study has also shown that AGT deregulation is associated with survival outcome in patients with gastric and breast cancer, using microarray gene expression profile. Furthermore, the computationally screened nsSNPs can be analyzed in population based genotyping studies and may help futuristic drug development in the area of AGT pharmacogenomics.
Collapse
Affiliation(s)
- Achintya Mohan Goswami
- Department of Physiology, Krishnagar Govt. College, Krishnagar, Nadia, West Bengal, 741101, India.
| |
Collapse
|
31
|
Lim YY, Lim TS, Choong YS. Human IgG1 Fc pH-dependent optimization from a constant pH molecular dynamics simulation analysis. RSC Adv 2020; 10:13066-13075. [PMID: 35492131 PMCID: PMC9051383 DOI: 10.1039/c9ra10712f] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Accepted: 03/11/2020] [Indexed: 11/21/2022] Open
Abstract
The binding of IgG Fc with FcRn enables the long circulating half-life of IgG, where the Fc–FcRn complex interacts in a pH-dependent manner. This complex shows stronger interaction at pH ≤ 6.5 and weaker interaction at pH ≥ 7.4. The Fc–FcRn binding mechanism that promotes the long circulating half-life of IgG has prompted several IgG Fc-related mutational studies to focus on the pH-dependent Fc–FcRn complex interactions in order to improve the pharmacokinetic properties of Fc. Hence, in this study, we applied the in silico constant pH molecular dynamics (CpHMD) simulation approach to evaluate the human Fc–FcRn complex binding (pH 6.0) and dissociating (pH 7.5) mechanism at the molecular level. The analysis showed that the protonated state of the titratable residues changes from pH 6.0 to pH 7.5, where the disrupting energy for Fc–FcRn complex formation was found to be due to the electrostatic repulsion between the complex. According to the analysis, an Fc variant was computationally designed with an improved binding affinity at pH 6.0, which is still able to dissociate at pH 7.5 with FcRn at the in silico level. The binding free energy calculation via the MMPB/GBSA approach showed that the designed Fc mutant (MutM4) has increased binding affinity only at pH 6.0 compared with the reported mutant (YTE) Fc. This work demonstrates an alternative Fc design with better binding properties for FcRn, which can be useful for future experimental evaluation and validation. An in silico IgG-Fc variant with better affinity at pH 6.0 but retained the dissociation at pH 7.5 was designed.![]()
Collapse
Affiliation(s)
- Yee Ying Lim
- Institute for Research in Molecular Medicine (INFORMM)
- Universiti Sains Malaysia
- 11800 Minden
- Malaysia
| | - Theam Soon Lim
- Institute for Research in Molecular Medicine (INFORMM)
- Universiti Sains Malaysia
- 11800 Minden
- Malaysia
| | - Yee Siew Choong
- Institute for Research in Molecular Medicine (INFORMM)
- Universiti Sains Malaysia
- 11800 Minden
- Malaysia
| |
Collapse
|
32
|
Li C, Jia Z, Chakravorty A, Pahari S, Peng Y, Basu S, Koirala M, Panday SK, Petukh M, Li L, Alexov E. DelPhi Suite: New Developments and Review of Functionalities. J Comput Chem 2019; 40:2502-2508. [PMID: 31237360 PMCID: PMC6771749 DOI: 10.1002/jcc.26006] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Revised: 05/07/2019] [Accepted: 06/09/2019] [Indexed: 12/25/2022]
Abstract
Electrostatic potential, energies, and forces affect virtually any process in molecular biology, however, computing these quantities is a difficult task due to irregularly shaped macromolecules and the presence of water. Here, we report a new edition of the popular software package DelPhi along with describing its functionalities. The new DelPhi is a C++ object-oriented package supporting various levels of multiprocessing and memory distribution. It is demonstrated that multiprocessing results in significant improvement of computational time. Furthermore, for computations requiring large grid size (large macromolecular assemblages), the approach of memory distribution is shown to reduce the requirement of RAM and thus permitting large-scale modeling to be done on Linux clusters with moderate architecture. The new release comes with new features, whose functionalities and applications are described as well. © 2019 The Authors. Journal of Computational Chemistry published by Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Chuan Li
- Department of MathematicsWest Chester University of PennsylvaniaWest ChesterPennsylvania19383
| | - Zhe Jia
- Department of Physics and AstronomyClemson UniversityClemsonSouth Carolina29634
| | - Arghya Chakravorty
- Department of Physics and AstronomyClemson UniversityClemsonSouth Carolina29634
| | - Swagata Pahari
- Department of Physics and AstronomyClemson UniversityClemsonSouth Carolina29634
| | - Yunhui Peng
- Department of Physics and AstronomyClemson UniversityClemsonSouth Carolina29634
| | - Sankar Basu
- Department of Physics and AstronomyClemson UniversityClemsonSouth Carolina29634
| | - Mahesh Koirala
- Department of Physics and AstronomyClemson UniversityClemsonSouth Carolina29634
| | | | - Marharyta Petukh
- Department of BiologyPresbyterian CollegeClintonSouth Carolina29325
| | - Lin Li
- Department of PhysicsUniversity of Texas at EI PasoTexas79968
| | - Emil Alexov
- Department of Physics and AstronomyClemson UniversityClemsonSouth Carolina29634
| |
Collapse
|
33
|
Novel Genetic Markers for Early Detection of Elevated Breast Cancer Risk in Women. Int J Mol Sci 2019; 20:ijms20194828. [PMID: 31569399 PMCID: PMC6801521 DOI: 10.3390/ijms20194828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2019] [Revised: 09/20/2019] [Accepted: 09/25/2019] [Indexed: 12/25/2022] Open
Abstract
This study suggests that two newly discovered variants in the MSH2 gene, which codes for a DNA mismatch repair (MMR) protein, can be associated with a high risk of breast cancer. While variants in the MSH2 gene are known to be linked with an elevated cancer risk, the MSH2 gene is not a part of the standard kit for testing patients for elevated breast cancer risk. Here we used the results of genetic testing of women diagnosed with breast cancer, but who did not have variants in BRCA1 and BRCA2 genes. Instead, the test identified four variants with unknown significance (VUS) in the MSH2 gene. Here, we carried in silico analysis to develop a classifier that can distinguish pathogenic from benign mutations in MSH2 genes taken from ClinVar. The classifier was then used to classify VUS in MSH2 genes, and two of them, p.Ala272Val and p.Met592Val, were predicted to be pathogenic mutations. These two mutations were found in women with breast cancer who did not have mutations in BRCA1 and BRCA2 genes, and thus they are suggested to be considered as new bio-markers for the early detection of elevated breast cancer risk. However, before this is done, an in vitro validation of mutation pathogenicity is needed and, moreover, the presence of these mutations should be demonstrated in a higher number of patients or in families with breast cancer history.
Collapse
|
34
|
Rodrigues CHM, Myung Y, Pires DEV, Ascher DB. mCSM-PPI2: predicting the effects of mutations on protein-protein interactions. Nucleic Acids Res 2019; 47:W338-W344. [PMID: 31114883 PMCID: PMC6602427 DOI: 10.1093/nar/gkz383] [Citation(s) in RCA: 200] [Impact Index Per Article: 40.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Revised: 04/30/2019] [Accepted: 05/20/2019] [Indexed: 12/13/2022] Open
Abstract
Protein-protein Interactions are involved in most fundamental biological processes, with disease causing mutations enriched at their interfaces. Here we present mCSM-PPI2, a novel machine learning computational tool designed to more accurately predict the effects of missense mutations on protein-protein interaction binding affinity. mCSM-PPI2 uses graph-based structural signatures to model effects of variations on the inter-residue interaction network, evolutionary information, complex network metrics and energetic terms to generate an optimised predictor. We demonstrate that our method outperforms previous methods, ranking first among 26 others on CAPRI blind tests. mCSM-PPI2 is freely available as a user friendly webserver at http://biosig.unimelb.edu.au/mcsm_ppi2/.
Collapse
Affiliation(s)
- Carlos H M Rodrigues
- Department of Biochemistry and Molecular Biology, University of Melbourne, Melbourne, Australia
- ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, Australia
- Structural Biology and Bioinformatics, Baker Heart and Diabetes Institute, Melbourne, Australia
| | - Yoochan Myung
- Department of Biochemistry and Molecular Biology, University of Melbourne, Melbourne, Australia
- ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, Australia
- Structural Biology and Bioinformatics, Baker Heart and Diabetes Institute, Melbourne, Australia
| | - Douglas E V Pires
- Department of Biochemistry and Molecular Biology, University of Melbourne, Melbourne, Australia
- ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, Australia
- Structural Biology and Bioinformatics, Baker Heart and Diabetes Institute, Melbourne, Australia
| | - David B Ascher
- Department of Biochemistry and Molecular Biology, University of Melbourne, Melbourne, Australia
- ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, Australia
- Structural Biology and Bioinformatics, Baker Heart and Diabetes Institute, Melbourne, Australia
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| |
Collapse
|
35
|
Spellicy CJ, Peng Y, Olewiler L, Cathey SS, Rogers RC, Bartholomew D, Johnson J, Alexov E, Lee JA, Friez MJ, Jones JR. Three additional patients with EED-associated overgrowth: potential mutation hotspots identified? J Hum Genet 2019; 64:561-572. [PMID: 30858506 DOI: 10.1038/s10038-019-0585-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2018] [Revised: 02/12/2019] [Accepted: 02/13/2019] [Indexed: 12/25/2022]
Abstract
Variants have been identified in the embryonic ectoderm development (EED) gene in seven patients with syndromic overgrowth similar to that observed in Weaver syndrome. Here, we present three additional patients with missense variants in the EED gene. All the missense variants reported to date (including the three presented here) have localized to one of seven WD40 domains of the EED protein, which are necessary for interaction with enhancer of zeste 2 polycomb repressive complex 2 subunit (EZH2). In addition, among the seven patients reported in the literature and the three new patients presented here, all of the reported pathogenic variants except one occurred at one of four amino acid residues in the EED protein. The recurrence of pathogenic variation at these loci suggests that these residues are functionally important (mutation hotspots). In silico modeling and calculations of the free energy changes resulting from these variants suggested that they not only destabilize the EED protein structure but also adversely affect interactions between EED, EZH2, and/or H3K27me3. These cases help demonstrate the mechanism(s) by which apparently deleterious variants in the EED gene might cause overgrowth and lend further support that amino acid residues in the WD40 domain region may be mutation hotspots.
Collapse
Affiliation(s)
| | - Yunhui Peng
- Computational Biophysics and Bioinformatics laboratory, Clemson University, Clemson, SC, 29634, USA
| | - Leah Olewiler
- Genetics, Nationwide Children's Hospital, Columbus, OH, 43205, USA
| | - Sara S Cathey
- Greenwood Genetic Center, Greenwood, SC, 29646, USA
- Clinical Genetics, Greenwood Genetic Center, Greenwood, SC, 29646, USA
| | - R Curtis Rogers
- Greenwood Genetic Center, Greenwood, SC, 29646, USA
- Clinical Genetics, Greenwood Genetic Center, Greenwood, SC, 29646, USA
| | | | | | - Emil Alexov
- Computational Biophysics and Bioinformatics laboratory, Clemson University, Clemson, SC, 29634, USA
| | | | | | - Julie R Jones
- Greenwood Genetic Center, Greenwood, SC, 29646, USA.
| |
Collapse
|
36
|
Chakravorty A, Gallicchio E, Alexov E. A grid-based algorithm in conjunction with a gaussian-based model of atoms for describing molecular geometry. J Comput Chem 2019; 40:1290-1304. [PMID: 30698861 PMCID: PMC6506848 DOI: 10.1002/jcc.25786] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2018] [Revised: 12/12/2018] [Accepted: 01/06/2019] [Indexed: 11/06/2022]
Abstract
A novel grid-based method is presented, which in conjunction with a smooth Gaussian-based model of atoms, is used to compute molecular volume (MV) and surface area (MSA). The MV and MSA are essential for computing nonpolar component of free energies. The objective of our grid-based approach is to identify solute atom pairs that share overlapping volumes in space. Once completed, this information is used to construct a rooted tree using depth-first method to yield the final volume and SA by using the formulations of the Gaussian model described by Grant and Pickup (J. Phys Chem, 1995, 99, 3503). The method is designed to function uninterruptedly with the grid-based finite-difference method implemented in Delphi, a popular and open-source package used for solving the Poisson-Boltzmann equation (PBE). We demonstrate the time efficacy of the method while also validating its performance in terms of the effect of grid-resolution, positioning of the solute within the grid-map and accuracy in identification of overlapping atom pairs. We also explore and discuss different aspects of the Gaussian model with key emphasis on its physical meaningfulness. This development and its future release with the Delphi package are intended to provide a physically meaningful, fast, robust and comprehensive tool for MM/PBSA based free energy calculations. © 2019 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Arghya Chakravorty
- Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634
| | | | - Emil Alexov
- Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634
| |
Collapse
|
37
|
Yang YP, He LP, Bao JX, Qi YF, Zhang JZH. Computational analysis for residue-specific CDK2-inhibitor bindings. CHINESE J CHEM PHYS 2019. [DOI: 10.1063/1674-0068/cjcp1901012] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Affiliation(s)
- Yun-peng Yang
- School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
| | - Li-ping He
- School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
| | - Jing-xiao Bao
- School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
| | - Yi-fei Qi
- School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
| | - John Z. H. Zhang
- School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
- Department of Chemistry, New York University, NY, NY 10003, USA
| |
Collapse
|
38
|
Peng Y, Alexov E, Basu S. Structural Perspective on Revealing and Altering Molecular Functions of Genetic Variants Linked with Diseases. Int J Mol Sci 2019; 20:ijms20030548. [PMID: 30696058 PMCID: PMC6386852 DOI: 10.3390/ijms20030548] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2018] [Revised: 01/25/2019] [Accepted: 01/26/2019] [Indexed: 12/25/2022] Open
Abstract
Structural information of biological macromolecules is crucial and necessary to deliver predictions about the effects of mutations-whether polymorphic or deleterious (i.e., disease causing), wherein, thermodynamic parameters, namely, folding and binding free energies potentially serve as effective biomarkers. It may be emphasized that the effect of a mutation depends on various factors, including the type of protein (globular, membrane or intrinsically disordered protein) and the structural context in which it occurs. Such information may positively aid drug-design. Furthermore, due to the intrinsic plasticity of proteins, even mutations involving radical change of the structural and physico⁻chemical properties of the amino acids (native vs. mutant) can still have minimal effects on protein thermodynamics. However, if a mutation causes significant perturbation by either folding or binding free energies, it is quite likely to be deleterious. Mitigating such effects is a promising alternative to the traditional approaches of designing inhibitors. This can be done by structure-based in silico screening of small molecules for which binding to the dysfunctional protein restores its wild type thermodynamics. In this review we emphasize the effects of mutations on two important biophysical properties, stability and binding affinity, and how structures can be used for structure-based drug design to mitigate the effects of disease-causing variants on the above biophysical properties.
Collapse
Affiliation(s)
- Yunhui Peng
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA.
| | - Emil Alexov
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA.
| | - Sankar Basu
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA.
| |
Collapse
|
39
|
Geng C, Xue LC, Roel‐Touris J, Bonvin AMJJ. Finding the ΔΔ
G
spot: Are predictors of binding affinity changes upon mutations in protein–protein interactions ready for it? WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2019. [DOI: 10.1002/wcms.1410] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Affiliation(s)
- Cunliang Geng
- Bijvoet Center for Biomolecular Research, Faculty of Science—Chemistry Utrecht University Utrecht The Netherlands
| | - Li C. Xue
- Bijvoet Center for Biomolecular Research, Faculty of Science—Chemistry Utrecht University Utrecht The Netherlands
| | - Jorge Roel‐Touris
- Bijvoet Center for Biomolecular Research, Faculty of Science—Chemistry Utrecht University Utrecht The Netherlands
| | - Alexandre M. J. J. Bonvin
- Bijvoet Center for Biomolecular Research, Faculty of Science—Chemistry Utrecht University Utrecht The Netherlands
| |
Collapse
|
40
|
Valdebenito-Maturana B, Reyes-Suarez JA, Henriquez J, Holmes DS, Quatrini R, Pohl E, Arenas-Salinas M. Mutantelec: An In Silico mutation simulation platform for comparative electrostatic potential profiling of proteins. J Comput Chem 2018; 38:467-474. [PMID: 28114729 DOI: 10.1002/jcc.24712] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2016] [Revised: 12/06/2016] [Accepted: 12/07/2016] [Indexed: 11/07/2022]
Abstract
The electrostatic potential plays a key role in many biological processes like determining the affinity of a ligand to a given protein target, and they are responsible for the catalytic activity of many enzymes. Understanding the effect that amino acid mutations will have on the electrostatic potential of a protein, will allow a thorough understanding of which residues are the most important in a protein. MutantElec, is a friendly web application for in silico generation of site-directed mutagenesis of proteins and the comparison of electrostatic potential between the wild type protein and the mutant(s), based on the three-dimensional structure of the protein. The effect of the mutation is evaluated using different approach to the traditional surface map. MutantElec provides a graphical display of the results that allows the visualization of changes occurring at close distance from the mutation and thus uncovers the local and global impact of a specific change. © 2017 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Braulio Valdebenito-Maturana
- Centro de Bioinformática y Simulación Molecular, Facultad de Ingeniería, Universidad de Talca, Talca, 346 5548, Chile
| | - Jose Antonio Reyes-Suarez
- Centro de Bioinformática y Simulación Molecular, Facultad de Ingeniería, Universidad de Talca, Talca, 346 5548, Chile
| | - Jaime Henriquez
- Centro de Bioinformática y Simulación Molecular, Facultad de Ingeniería, Universidad de Talca, Talca, 346 5548, Chile
| | - David S Holmes
- Fundación Ciencia & Vida, Santiago, 778 0272, Chile.,Facultad de Ciencias Biologicas, Universidad Andres Bello, Santiago, Chile
| | | | - Ehmke Pohl
- Department of Chemistry, Durham University, Durham, DH1 3LE, United Kingdom.,Department of Biosciences, Durham University, Durham, DH1 3LE, United Kingdom.,Biophysical Sciences Institute, Durham University, Durham, DH1 3LE, United Kingdom
| | - Mauricio Arenas-Salinas
- Centro de Bioinformática y Simulación Molecular, Facultad de Ingeniería, Universidad de Talca, Talca, 346 5548, Chile
| |
Collapse
|
41
|
Prasad R, Banerjee S, Sen P. Contribution of allosteric disulfide in the structural regulation of membrane-bound tissue factor-factor VIIa binary complex. J Biomol Struct Dyn 2018; 37:3707-3720. [PMID: 30238846 DOI: 10.1080/07391102.2018.1526118] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Two distinct populations, active and cryptic forms of tissue factor (TF), reside on the cell surface. Apart from phospholipid contribution, various models have been introduced to explain decryption/encryption of TF. The proposed model, the switching of Cys186-Cys209 bond of TF, has become the matter of controversy. However, it is well accepted that this disulfide has an immense influence upon ligand factor VIIa (FVIIa) for its binding. However, molecular level understanding for this remains unveiled due to lack of detailed structural information. In this regard, we have performed the molecular dynamic study of membrane-bound TF/TF-FVIIa in both the forms (±Cys186-Cys209 allosteric disulfide bond), individually. Dynamic study depicts that disulfide bond provides structural rigidity of TF in both free and ligand-bound forms. This disulfide bond also governs the conformation of FVIIa structure as well as the binding affinity of FVIIa toward TF. Significant differences in lipid-protein interaction profiles of both the forms of TF in the complex were observed. Two forms of TF, oxidized and reduced, have different structural conformation and behave differentially toward its ligand FVIIa. This disulfide bond not only alters the conformation of GLA domain of FVIIa in the vicinity but allosterically regulates the conformation of the distantly located FVIIa protease domain. We suggest that the redox status of the disulfide bond also governs the lipid-mediated interactions with both TF and FVIIa. Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Ramesh Prasad
- a Department of Biological Chemistry , Indian Association for the Cultivation of Science , Jadavpur , Kolkata , India
| | - Suparna Banerjee
- a Department of Biological Chemistry , Indian Association for the Cultivation of Science , Jadavpur , Kolkata , India
| | - Prosenjit Sen
- a Department of Biological Chemistry , Indian Association for the Cultivation of Science , Jadavpur , Kolkata , India
| |
Collapse
|
42
|
Hodges AM, Fenton AW, Dougherty LL, Overholt AC, Swint-Kruse L. RheoScale: A tool to aggregate and quantify experimentally determined substitution outcomes for multiple variants at individual protein positions. Hum Mutat 2018; 39:1814-1826. [PMID: 30117637 DOI: 10.1002/humu.23616] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2018] [Revised: 07/31/2018] [Accepted: 08/13/2018] [Indexed: 12/25/2022]
Abstract
Human mutations often cause amino acid changes (variants) that can alter protein function or stability. Some variants fall at protein positions that experimentally exhibit "rheostatic" mutation outcomes (different amino acid substitutions lead to a range of functional outcomes). In ongoing studies of rheostat positions, we encountered the need to aggregate experimental results from multiple variants, to describe the overall roles of individual positions. Here, we present "RheoScale" which generates quantitative scores to discriminate rheostat positions from those with "toggle" (most substitutions abolish function) or "neutral" (most substitutions have wild-type function) outcomes. RheoScale scores facilitate correlations of experimental data (such as binding affinity or stability) with structural and bioinformatic analyses. The RheoScale calculator is encoded into a Microsoft Excel workbook and an R script. Example analyses are shown for three model protein systems, including one assessed via deep mutational scanning. The RheoScale calculator quickly and efficiently provided quantitative descriptions that were in good agreement with prior qualitative observations. As an example application, scores were compared to the example proteins' structures; strong rheostat positions tended to occur in dynamic locations. In the future, RheoScale scores can be easily integrated into computational studies to facilitate improved algorithms for predicting outcomes of human variants.
Collapse
Affiliation(s)
- Abby M Hodges
- Department of Natural, Health, and Mathematical Sciences, MidAmerica Nazarene University, Olathe, Kansas, USA
| | - Aron W Fenton
- The Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Larissa L Dougherty
- The Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Andrew C Overholt
- Department of Natural, Health, and Mathematical Sciences, MidAmerica Nazarene University, Olathe, Kansas, USA
| | - Liskin Swint-Kruse
- The Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| |
Collapse
|
43
|
Computational Approaches to Prioritize Cancer Driver Missense Mutations. Int J Mol Sci 2018; 19:ijms19072113. [PMID: 30037003 PMCID: PMC6073793 DOI: 10.3390/ijms19072113] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Revised: 07/02/2018] [Accepted: 07/05/2018] [Indexed: 12/31/2022] Open
Abstract
Cancer is a complex disease that is driven by genetic alterations. There has been a rapid development of genome-wide techniques during the last decade along with a significant lowering of the cost of gene sequencing, which has generated widely available cancer genomic data. However, the interpretation of genomic data and the prediction of the association of genetic variations with cancer and disease phenotypes still requires significant improvement. Missense mutations, which can render proteins non-functional and provide a selective growth advantage to cancer cells, are frequently detected in cancer. Effects caused by missense mutations can be pinpointed by in silico modeling, which makes it more feasible to find a treatment and reverse the effect. Specific human phenotypes are largely determined by stability, activity, and interactions between proteins and other biomolecules that work together to execute specific cellular functions. Therefore, analysis of missense mutations’ effects on proteins and their complexes would provide important clues for identifying functionally important missense mutations, understanding the molecular mechanisms of cancer progression and facilitating treatment and prevention. Herein, we summarize the major computational approaches and tools that provide not only the classification of missense mutations as cancer drivers or passengers but also the molecular mechanisms induced by driver mutations. This review focuses on the discussion of annotation and prediction methods based on structural and biophysical data, analysis of somatic cancer missense mutations in 3D structures of proteins and their complexes, predictions of the effects of missense mutations on protein stability, protein-protein and protein-nucleic acid interactions, and assessment of conformational changes in protein conformations induced by mutations.
Collapse
|
44
|
Jemimah S, Yugandhar K, Michael Gromiha M. PROXiMATE: a database of mutant protein-protein complex thermodynamics and kinetics. Bioinformatics 2018; 33:2787-2788. [PMID: 28498885 DOI: 10.1093/bioinformatics/btx312] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2016] [Accepted: 05/10/2017] [Indexed: 11/13/2022] Open
Abstract
Summary We have developed PROXiMATE, a database of thermodynamic data for more than 6000 missense mutations in 174 heterodimeric protein-protein complexes, supplemented with interaction network data from STRING database, solvent accessibility, sequence, structural and functional information, experimental conditions and literature information. Additional features include complex structure visualization, search and display options, download options and a provision for users to upload their data. Availability and implementation The database is freely available at http://www.iitm.ac.in/bioinfo/PROXiMATE/ . The website is implemented in Python, and supports recent versions of major browsers such as IE10, Firefox, Chrome and Opera. Contact gromiha@iitm.ac.in. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sherlyn Jemimah
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India
| | - K Yugandhar
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India
| |
Collapse
|
45
|
Chakravorty A, Jia Z, Li L, Zhao S, Alexov E. Reproducing the Ensemble Average Polar Solvation Energy of a Protein from a Single Structure: Gaussian-Based Smooth Dielectric Function for Macromolecular Modeling. J Chem Theory Comput 2018; 14:1020-1032. [PMID: 29350933 PMCID: PMC9885857 DOI: 10.1021/acs.jctc.7b00756] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Typically, the ensemble average polar component of solvation energy (ΔGpolarsolv) of a macromolecule is computed using molecular dynamics (MD) or Monte Carlo (MC) simulations to generate conformational ensemble and then single/rigid conformation solvation energy calculation is performed on each snapshot. The primary objective of this work is to demonstrate that Poisson-Boltzmann (PB)-based approach using a Gaussian-based smooth dielectric function for macromolecular modeling previously developed by us (Li et al. J. Chem. Theory Comput. 2013, 9 (4), 2126-2136) can reproduce that ensemble average (ΔGpolarsolv) of a protein from a single structure. We show that the Gaussian-based dielectric model reproduces the ensemble average ΔGpolarsolv(⟨ΔGpolarsolv⟩) from an energy-minimized structure of a protein regardless of the minimization environment (structure minimized in vacuo, implicit or explicit waters, or crystal structure); the best case, however, is when it is paired with an in vacuo-minimized structure. In other minimization environments (implicit or explicit waters or crystal structure), the traditional two-dielectric model can still be selected with which the model produces correct solvation energies. Our observations from this work reflect how the ability to appropriately mimic the motion of residues, especially the salt bridge residues, influences a dielectric model's ability to reproduce the ensemble average value of polar solvation free energy from a single in vacuo-minimized structure.
Collapse
Affiliation(s)
- Arghya Chakravorty
- Computational Biophysics and Bioinformatics, Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, USA
| | - Zhe Jia
- Computational Biophysics and Bioinformatics, Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, USA
| | - Lin Li
- Computational Biophysics and Bioinformatics, Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, USA
| | - Shan Zhao
- Departement of Mathematics, College of Arts and Sciences, University of Alabama, Tuscaloosa, Alabama 35487, USA
| | - Emil Alexov
- Computational Biophysics and Bioinformatics, Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, USA.,Corresponding Author Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, USA.
| |
Collapse
|
46
|
Liu X, Peng L, Zhou Y, Zhang Y, Zhang JZH. Computational Alanine Scanning with Interaction Entropy for Protein–Ligand Binding Free Energies. J Chem Theory Comput 2018; 14:1772-1780. [DOI: 10.1021/acs.jctc.7b01295] [Citation(s) in RCA: 54] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Xiao Liu
- State Key Laboratory for Precision Spectroscopy, Shanghai Engineering Research Center of Molecular Therapeutics & New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
| | - Long Peng
- State Key Laboratory for Precision Spectroscopy, Shanghai Engineering Research Center of Molecular Therapeutics & New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
| | - Yifan Zhou
- State Key Laboratory for Precision Spectroscopy, Shanghai Engineering Research Center of Molecular Therapeutics & New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
| | - Youzhi Zhang
- State Key Laboratory for Precision Spectroscopy, Shanghai Engineering Research Center of Molecular Therapeutics & New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
| | - John Z. H. Zhang
- State Key Laboratory for Precision Spectroscopy, Shanghai Engineering Research Center of Molecular Therapeutics & New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
- Department of Chemistry, New York University, New York, New York 10003, United States
- Collaborative Innovation Center of Extreme Optics, Shanxi University, Taiyuan, Shanxi 030006, China
| |
Collapse
|
47
|
Wang C, Greene D, Xiao L, Qi R, Luo R. Recent Developments and Applications of the MMPBSA Method. Front Mol Biosci 2018; 4:87. [PMID: 29367919 PMCID: PMC5768160 DOI: 10.3389/fmolb.2017.00087] [Citation(s) in RCA: 325] [Impact Index Per Article: 54.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2017] [Accepted: 11/30/2017] [Indexed: 12/23/2022] Open
Abstract
The Molecular Mechanics Poisson-Boltzmann Surface Area (MMPBSA) approach has been widely applied as an efficient and reliable free energy simulation method to model molecular recognition, such as for protein-ligand binding interactions. In this review, we focus on recent developments and applications of the MMPBSA method. The methodology review covers solvation terms, the entropy term, extensions to membrane proteins and high-speed screening, and new automation toolkits. Recent applications in various important biomedical and chemical fields are also reviewed. We conclude with a few future directions aimed at making MMPBSA a more robust and efficient method.
Collapse
Affiliation(s)
- Changhao Wang
- Chemical and Materials Physics Graduate Program, University of California, Irvine, Irvine, CA, United States
- Department of Molecular Biology and Biochemistry, University of California, Irvine, Irvine, CA, United States
- Department of Physics and Astronomy, University of California, Irvine, Irvine, CA, United States
| | - D'Artagnan Greene
- Department of Molecular Biology and Biochemistry, University of California, Irvine, Irvine, CA, United States
| | - Li Xiao
- Department of Molecular Biology and Biochemistry, University of California, Irvine, Irvine, CA, United States
- Department of Biomedical Engineering, University of California, Irvine, Irvine, CA, United States
| | - Ruxi Qi
- Department of Molecular Biology and Biochemistry, University of California, Irvine, Irvine, CA, United States
| | - Ray Luo
- Chemical and Materials Physics Graduate Program, University of California, Irvine, Irvine, CA, United States
- Department of Molecular Biology and Biochemistry, University of California, Irvine, Irvine, CA, United States
- Department of Biomedical Engineering, University of California, Irvine, Irvine, CA, United States
- Department of Chemical Engineering and Materials Science, University of California, Irvine, Irvine, CA, United States
| |
Collapse
|
48
|
Glusman G, Rose PW, Prlić A, Dougherty J, Duarte JM, Hoffman AS, Barton GJ, Bendixen E, Bergquist T, Bock C, Brunk E, Buljan M, Burley SK, Cai B, Carter H, Gao J, Godzik A, Heuer M, Hicks M, Hrabe T, Karchin R, Leman JK, Lane L, Masica DL, Mooney SD, Moult J, Omenn GS, Pearl F, Pejaver V, Reynolds SM, Rokem A, Schwede T, Song S, Tilgner H, Valasatava Y, Zhang Y, Deutsch EW. Mapping genetic variations to three-dimensional protein structures to enhance variant interpretation: a proposed framework. Genome Med 2017; 9:113. [PMID: 29254494 PMCID: PMC5735928 DOI: 10.1186/s13073-017-0509-y] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
The translation of personal genomics to precision medicine depends on the accurate interpretation of the multitude of genetic variants observed for each individual. However, even when genetic variants are predicted to modify a protein, their functional implications may be unclear. Many diseases are caused by genetic variants affecting important protein features, such as enzyme active sites or interaction interfaces. The scientific community has catalogued millions of genetic variants in genomic databases and thousands of protein structures in the Protein Data Bank. Mapping mutations onto three-dimensional (3D) structures enables atomic-level analyses of protein positions that may be important for the stability or formation of interactions; these may explain the effect of mutations and in some cases even open a path for targeted drug development. To accelerate progress in the integration of these data types, we held a two-day Gene Variation to 3D (GVto3D) workshop to report on the latest advances and to discuss unmet needs. The overarching goal of the workshop was to address the question: what can be done together as a community to advance the integration of genetic variants and 3D protein structures that could not be done by a single investigator or laboratory? Here we describe the workshop outcomes, review the state of the field, and propose the development of a framework with which to promote progress in this arena. The framework will include a set of standard formats, common ontologies, a common application programming interface to enable interoperation of the resources, and a Tool Registry to make it easy to find and apply the tools to specific analysis problems. Interoperability will enable integration of diverse data sources and tools and collaborative development of variant effect prediction methods.
Collapse
Affiliation(s)
| | - Peter W Rose
- San Diego Supercomputer Center, University of California San Diego, La Jolla, CA, 98093, USA
| | - Andreas Prlić
- San Diego Supercomputer Center, University of California San Diego, La Jolla, CA, 98093, USA.,RCSB Protein Data Bank, University of California San Diego, La Jolla, CA, 98093, USA
| | | | - José M Duarte
- RCSB Protein Data Bank, University of California San Diego, La Jolla, CA, 98093, USA
| | - Andrew S Hoffman
- Human Centered Design & Engineering, University of Washington, Seattle, WA, 98195, USA
| | - Geoffrey J Barton
- Division of Computational Biology, School of Life Sciences, University of Dundee, Dundee, DD1 5EH, UK
| | - Emøke Bendixen
- Department of Molecular Biology and Genetics, Aarhus University, 8000, Aarhus, Denmark
| | - Timothy Bergquist
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, 98109, USA
| | - Christian Bock
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, 98109, USA
| | - Elizabeth Brunk
- University of California San Diego, La Jolla, CA, 92093, USA
| | - Marija Buljan
- Institute of Molecular Systems Biology, ETH Zurich, CH-8093, Zurich, Switzerland
| | - Stephen K Burley
- San Diego Supercomputer Center, University of California San Diego, La Jolla, CA, 98093, USA.,RCSB Protein Data Bank, University of California San Diego, La Jolla, CA, 98093, USA.,Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA
| | - Binghuang Cai
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, 98109, USA
| | - Hannah Carter
- University of California San Diego, La Jolla, CA, 92093, USA
| | - JianJiong Gao
- Kravis Center for Molecular Oncology, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
| | - Adam Godzik
- SBP Medical Discovery Institute, La Jolla, CA, 92037, USA
| | - Michael Heuer
- AMPLab, University of California, Berkeley, CA, 94720, USA
| | | | - Thomas Hrabe
- SBP Medical Discovery Institute, La Jolla, CA, 92037, USA
| | - Rachel Karchin
- Department of Biomedical Engineering, Institute for Computational Medicine, Johns Hopkins University, Baltimore, MD, 21218, USA.,Department of Oncology, Johns Hopkins Medicine, Baltimore, MD, 21287, USA
| | - Julia Koehler Leman
- Flatiron Institute, Center for Computational Biology, Simons Foundation, New York, NY, 10010, USA.,Department of Biology and Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA
| | - Lydie Lane
- SIB Swiss Institute of Bioinformatics and University of Geneva, CH-1211, Geneva, Switzerland
| | - David L Masica
- Department of Biomedical Engineering, Institute for Computational Medicine, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Sean D Mooney
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, 98109, USA
| | - John Moult
- Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD, 20850, USA.,Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD, 20742, USA
| | - Gilbert S Omenn
- Institute for Systems Biology, Seattle, WA, 98109, USA.,Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, MI, 48109-2218, USA
| | - Frances Pearl
- School of Life Sciences, University of Sussex, Brighton, BN1 9QG, UK
| | - Vikas Pejaver
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, 98109, USA.,The University of Washington eScience Institute, Seattle, WA, 98195, USA
| | | | - Ariel Rokem
- The University of Washington eScience Institute, Seattle, WA, 98195, USA
| | - Torsten Schwede
- SIB Swiss Institute of Bioinformatics and Biozentrum University of Basel, CH-4056, Basel, Switzerland
| | - Sicheng Song
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, 98109, USA
| | - Hagen Tilgner
- Brain and Mind Research Institute, Weill Cornell Medicine, New York City, NY, 10021, USA
| | - Yana Valasatava
- RCSB Protein Data Bank, University of California San Diego, La Jolla, CA, 98093, USA
| | - Yang Zhang
- Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, MI, 48109-2218, USA
| | | |
Collapse
|
49
|
Exploring Protein-Protein Interactions as Drug Targets for Anti-cancer Therapy with In Silico Workflows. Methods Mol Biol 2017; 1647:221-236. [PMID: 28809006 DOI: 10.1007/978-1-4939-7201-2_15] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
We describe a computational protocol to aid the design of small molecule and peptide drugs that target protein-protein interactions, particularly for anti-cancer therapy. To achieve this goal, we explore multiple strategies, including finding binding hot spots, incorporating chemical similarity and bioactivity data, and sampling similar binding sites from homologous protein complexes. We demonstrate how to combine existing interdisciplinary resources with examples of semi-automated workflows. Finally, we discuss several major problems, including the occurrence of drug-resistant mutations, drug promiscuity, and the design of dual-effect inhibitors.
Collapse
|
50
|
Gromiha MM, Yugandhar K, Jemimah S. Protein-protein interactions: scoring schemes and binding affinity. Curr Opin Struct Biol 2016; 44:31-38. [PMID: 27866112 DOI: 10.1016/j.sbi.2016.10.016] [Citation(s) in RCA: 80] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Revised: 09/30/2016] [Accepted: 10/25/2016] [Indexed: 01/16/2023]
Abstract
Protein-protein interactions mediate several cellular functions, which can be understood from the information obtained using the three-dimensional structures of protein-protein complexes and binding affinity data. This review focuses on computational aspects of predicting the best native-like complex structure and binding affinities. The first part covers the prediction of protein-protein complex structures and the advantages of conformational searching and scoring functions in protein-protein docking. The second part is devoted to various aspects of protein-protein interaction thermodynamics, such as databases for binding affinities and other thermodynamic parameters, computational methods to predict the binding affinity using either the three-dimensional structures of complexes or amino acid sequences, and change in binding affinities of the complexes upon mutations. We provide the latest developments on protein-protein docking and binding affinity studies along with a list of available computational resources for understanding protein-protein interactions.
Collapse
Affiliation(s)
- M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India.
| | - K Yugandhar
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India
| | - Sherlyn Jemimah
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India
| |
Collapse
|