1
|
Ullah SA, Yang X, Jones B, Zhao S, Geng W, Wei GW. Bridging Eulerian and Lagrangian Poisson-Boltzmann solvers by ESES. J Comput Chem 2024; 45:306-320. [PMID: 37830273 PMCID: PMC10993026 DOI: 10.1002/jcc.27239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 08/08/2023] [Accepted: 09/24/2023] [Indexed: 10/14/2023]
Abstract
The Poisson-Boltzmann (PB) model is a widely used electrostatic model for biomolecular solvation analysis. Formulated as an elliptic interface problem, the PB model can be numerically solved on either Eulerian meshes using finite difference/finite element methods or Lagrangian meshes using boundary element methods. Molecular surface generators, which produce the discretized dielectric interfaces between solutes and solvents, are critical factors in determining the accuracy and efficiency of the PB solvers. In this work, we investigate the utility of the Eulerian Solvent Excluded Surface (ESES) software for rendering conjugated Eulerian and Lagrangian surface representations, which enables us to numerically validate and compare the quality of Eulerian PB solvers, such as the MIBPB solver, and the Lagrangian PB solvers, such as the TABI-PB solver. Furthermore, with the ESES software and its associated PB solvers, we are able to numerically validate an interesting and useful but often neglected source-target symmetric property associated with the linearized PB model.
Collapse
Affiliation(s)
| | - Xin Yang
- Department of Mathematics, Southern Methodist University, Dallas, Texas, USA
| | - Ben Jones
- Department of Mathematics, Michigan State University, East Lansing, Michigan, USA
| | - Shan Zhao
- Department of Mathematics, University of Alabama, Tuscaloosa, Alabama, USA
| | - Weihua Geng
- Department of Mathematics, Southern Methodist University, Dallas, Texas, USA
| | - Guo-Wei Wei
- Department of Mathematics, Michigan State University, East Lansing, Michigan, USA
| |
Collapse
|
2
|
Liao J, Wu M, Gao J, Chen C. Calculation of solvation force in molecular dynamics simulation by deep-learning method. Biophys J 2024:S0006-3495(24)00163-2. [PMID: 38444159 DOI: 10.1016/j.bpj.2024.02.029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 01/31/2024] [Accepted: 02/29/2024] [Indexed: 03/07/2024] Open
Abstract
Electrostatic calculations are generally used in studying the thermodynamics and kinetics of biomolecules in solvent. Generally, this is performed by solving the Poisson-Boltzmann equation on a large grid system, a process known to be time consuming. In this study, we developed a deep neural network to predict the decomposed solvation free energies and forces of all atoms in a molecule. To train the network, the internal coordinates of the molecule were used as the input data, and the solvation free energies along with transformed atomic forces from the Poisson-Boltzmann equation were used as labels. Both the training and prediction tasks were accelerated on GPU. Formal tests demonstrated that our method can provide reasonable predictions for small molecules when the network is well-trained with its simulation data. This method is suitable for processing lots of snapshots of molecules in a long trajectory. Moreover, we applied this method in the molecular dynamics simulation with enhanced sampling. The calculated free energy landscape closely resembled that obtained from explicit solvent simulations.
Collapse
Affiliation(s)
- Jun Liao
- Biomolecular Physics and Modeling Group, School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Mincong Wu
- Biomolecular Physics and Modeling Group, School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Junyong Gao
- Biomolecular Physics and Modeling Group, School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Changjun Chen
- Biomolecular Physics and Modeling Group, School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China.
| |
Collapse
|
3
|
Qiu Y, Wei GW. Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models. Brief Bioinform 2023; 24:bbad289. [PMID: 37580175 PMCID: PMC10516362 DOI: 10.1093/bib/bbad289] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 07/14/2023] [Accepted: 07/26/2023] [Indexed: 08/16/2023] Open
Abstract
Protein engineering is an emerging field in biotechnology that has the potential to revolutionize various areas, such as antibody design, drug discovery, food security, ecology, and more. However, the mutational space involved is too vast to be handled through experimental means alone. Leveraging accumulative protein databases, machine learning (ML) models, particularly those based on natural language processing (NLP), have considerably expedited protein engineering. Moreover, advances in topological data analysis (TDA) and artificial intelligence-based protein structure prediction, such as AlphaFold2, have made more powerful structure-based ML-assisted protein engineering strategies possible. This review aims to offer a comprehensive, systematic, and indispensable set of methodological components, including TDA and NLP, for protein engineering and to facilitate their future development.
Collapse
Affiliation(s)
- Yuchi Qiu
- Department of Mathematics, Michigan State University, East Lansing, 48824 MI, USA
| | - Guo-Wei Wei
- Department of Mathematics, Michigan State University, East Lansing, 48824 MI, USA
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, 48824 MI, USA
- Department of Electrical and Computer Engineering, Michigan State University, East Lansing, 48824 MI, USA
| |
Collapse
|
4
|
Liao J, Shu Z, Gao J, Wu M, Chen C. SurfPB: A GPU-Accelerated Electrostatic Calculation and Visualization Tool for Biomolecules. J Chem Inf Model 2023; 63:4490-4496. [PMID: 37500509 DOI: 10.1021/acs.jcim.3c00745] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
In this work, we present SurfPB as a useful tool for the study of biomolecules. It can do many typical calculations, including the molecular surface, electrostatic potential, solvation free energy, entropy, and binding free energy. Among all of the calculations, the entropy calculation is the most time-consuming one. In SurfPB, the calculation can be performed in a vacuum or implicit solvent and accelerated on GPU. The Poisson-Boltzmann equation solver is accelerated on GPU as well. Moreover, we developed a graphical user interface for SurfPB. It allows users to input the parameters and complete the whole calculation in a visual way. The calculated electrostatic potentials are shown on the molecular surface in a three-dimensional scene.
Collapse
Affiliation(s)
- Jun Liao
- Biomolecular Physics and Modeling Group, School of Physics Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Zirui Shu
- Biomolecular Physics and Modeling Group, School of Physics Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Junyong Gao
- Biomolecular Physics and Modeling Group, School of Physics Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Mincong Wu
- Biomolecular Physics and Modeling Group, School of Physics Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Changjun Chen
- Biomolecular Physics and Modeling Group, School of Physics Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| |
Collapse
|
5
|
Qiu Y, Wei GW. Artificial intelligence-aided protein engineering: from topological data analysis to deep protein language models. ARXIV 2023:arXiv:2307.14587v1. [PMID: 37547662 PMCID: PMC10402185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]
Abstract
Protein engineering is an emerging field in biotechnology that has the potential to revolutionize various areas, such as antibody design, drug discovery, food security, ecology, and more. However, the mutational space involved is too vast to be handled through experimental means alone. Leveraging accumulative protein databases, machine learning (ML) models, particularly those based on natural language processing (NLP), have considerably expedited protein engineering. Moreover, advances in topological data analysis (TDA) and artificial intelligence-based protein structure prediction, such as AlphaFold2, have made more powerful structure-based ML-assisted protein engineering strategies possible. This review aims to offer a comprehensive, systematic, and indispensable set of methodological components, including TDA and NLP, for protein engineering and to facilitate their future development.
Collapse
Affiliation(s)
- Yuchi Qiu
- Department of Mathematics, Michigan State University, East Lansing, 48824, MI, USA
| | - Guo-Wei Wei
- Department of Mathematics, Michigan State University, East Lansing, 48824, MI, USA
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, 48824, MI, USA
- Department of Electrical and Computer Engineering, Michigan State University, East Lansing, 48824, MI, USA
| |
Collapse
|
6
|
Chemistry-informed molecular graph as reaction descriptor for machine-learned retrosynthesis planning. Proc Natl Acad Sci U S A 2022; 119:e2212711119. [PMID: 36191228 PMCID: PMC9564830 DOI: 10.1073/pnas.2212711119] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Infusing "chemical wisdom" should improve the data-driven approaches that rely exclusively on historical synthetic data for automatic retrosynthesis planning. For this purpose, we designed a chemistry-informed molecular graph (CIMG) to describe chemical reactions. A collection of key information that is most relevant to chemical reactions is integrated in CIMG:NMR chemical shifts as vertex features, bond dissociation energies as edge features, and solvent/catalyst information as global features. For any given compound as a target, a product CIMG is generated and exploited by a graph neural network (GNN) model to choose reaction template(s) leading to this product. A reactant CIMG is then inferred and used in two GNN models to select appropriate catalyst and solvent, respectively. Finally, a fourth GNN model compares the two CIMG descriptors to check the plausibility of the proposed reaction. A reaction vector is obtained for every molecule in training these models. The chemical wisdom of reaction propensity contained in the pretrained reaction vectors is exploited to autocategorize molecules/reactions and to accelerate Monte Carlo tree search (MCTS) for multistep retrosynthesis planning. Full synthetic routes with recommended catalysts/solvents are predicted efficiently using this CIMG-based approach.
Collapse
|
7
|
Reis PBPS, Bertolini M, Montanari F, Rocchia W, Machuqueiro M, Clevert DA. A Fast and Interpretable Deep Learning Approach for Accurate Electrostatics-Driven p Ka Predictions in Proteins. J Chem Theory Comput 2022; 18:5068-5078. [PMID: 35837736 DOI: 10.1021/acs.jctc.2c00308] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Existing computational methods for estimating pKa values in proteins rely on theoretical approximations and lengthy computations. In this work, we use a data set of 6 million theoretically determined pKa shifts to train deep learning models, which are shown to rival the physics-based predictors. These neural networks managed to infer the electrostatic contributions of different chemical groups and learned the importance of solvent exposure and close interactions, including hydrogen bonds. Although trained only using theoretical data, our pKAI+ model displayed the best accuracy in a test set of ∼750 experimental values. Inference times allow speedups of more than 1000× compared to physics-based methods. By combining speed, accuracy, and a reasonable understanding of the underlying physics, our models provide a game-changing solution for fast estimations of macroscopic pKa values from ensembles of microscopic values as well as for many downstream applications such as molecular docking and constant-pH molecular dynamics simulations.
Collapse
Affiliation(s)
| | - Marco Bertolini
- Machine Learning Research, Bayer A.G., Berlin 13353, Germany
| | | | - Walter Rocchia
- CONCEPT Lab, Istituto Italiano di Tecnologia (IIT), Via Melen 83, B Block, Genoa 16152, Italy
| | - Miguel Machuqueiro
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisboa, Campo Grande, Lisboa 1749-016, Portugal
| | | |
Collapse
|
8
|
Gao K, Wang R, Chen J, Cheng L, Frishcosy J, Huzumi Y, Qiu Y, Schluckbier T, Wei X, Wei GW. Methodology-Centered Review of Molecular Modeling, Simulation, and Prediction of SARS-CoV-2. Chem Rev 2022; 122:11287-11368. [PMID: 35594413 PMCID: PMC9159519 DOI: 10.1021/acs.chemrev.1c00965] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Despite tremendous efforts in the past two years, our understanding of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), virus-host interactions, immune response, virulence, transmission, and evolution is still very limited. This limitation calls for further in-depth investigation. Computational studies have become an indispensable component in combating coronavirus disease 2019 (COVID-19) due to their low cost, their efficiency, and the fact that they are free from safety and ethical constraints. Additionally, the mechanism that governs the global evolution and transmission of SARS-CoV-2 cannot be revealed from individual experiments and was discovered by integrating genotyping of massive viral sequences, biophysical modeling of protein-protein interactions, deep mutational data, deep learning, and advanced mathematics. There exists a tsunami of literature on the molecular modeling, simulations, and predictions of SARS-CoV-2 and related developments of drugs, vaccines, antibodies, and diagnostics. To provide readers with a quick update about this literature, we present a comprehensive and systematic methodology-centered review. Aspects such as molecular biophysics, bioinformatics, cheminformatics, machine learning, and mathematics are discussed. This review will be beneficial to researchers who are looking for ways to contribute to SARS-CoV-2 studies and those who are interested in the status of the field.
Collapse
Affiliation(s)
- Kaifu Gao
- Department
of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
| | - Rui Wang
- Department
of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
| | - Jiahui Chen
- Department
of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
| | - Limei Cheng
- Clinical
Pharmacology and Pharmacometrics, Bristol
Myers Squibb, Princeton, New Jersey 08536, United States
| | - Jaclyn Frishcosy
- Department
of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
| | - Yuta Huzumi
- Department
of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
| | - Yuchi Qiu
- Department
of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
| | - Tom Schluckbier
- Department
of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
| | - Xiaoqi Wei
- Department
of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
| | - Guo-Wei Wei
- Department
of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
- Department
of Electrical and Computer Engineering, Michigan State University, East Lansing, Michigan 48824, United States
- Department
of Biochemistry and Molecular Biology, Michigan
State University, East Lansing, Michigan 48824, United States
| |
Collapse
|