1
|
Vornweg J, Jacob CR. Protein-Ligand Interaction Energies from Quantum-Chemical Fragmentation Methods: Upgrading the MFCC-Scheme with Many-Body Contributions. J Phys Chem B 2024; 128:11597-11606. [PMID: 39550698 PMCID: PMC11613497 DOI: 10.1021/acs.jpcb.4c05645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2024] [Revised: 10/26/2024] [Accepted: 11/08/2024] [Indexed: 11/18/2024]
Abstract
Quantum-chemical fragmentation methods offer an attractive approach for the accurate calculation of protein-ligand interaction energies. While the molecular fractionation with conjugate caps (MFCC) scheme offers a rather straightforward approach for this purpose, its accuracy is often not sufficient. Here, we upgrade the MFCC scheme for the calculation of protein-ligand interactions by including many-body contributions. The resulting fragmentation scheme is an extension of our previously developed MFCC-MBE(2) scheme [J. Comput. Chem. 2023, 44, 1634-1644]. For a diverse test set of protein-ligand complexes, we demonstrate that by upgrading the MFCC scheme with many-body contributions, the error in protein-ligand interaction energies can be reduced significantly, and one generally achieves errors below 20 kJ/mol. Our scheme allows for systematically reducing these errors by including higher-order many-body contributions. As it combines the use of single amino acid fragments with high accuracy, our scheme provides an ideal starting point for the parametrization of accurate machine learning potentials for proteins and protein-ligand interactions.
Collapse
Affiliation(s)
- Johannes
R. Vornweg
- Institute of Physical and Theoretical
Chemistry, Technische Universität
Braunschweig, Gaußstr.
17, Braunschweig 38106, Germany
| | - Christoph R. Jacob
- Institute of Physical and Theoretical
Chemistry, Technische Universität
Braunschweig, Gaußstr.
17, Braunschweig 38106, Germany
| |
Collapse
|
2
|
Wang Y, Teng C, Begin E, Bussiere M, Bao JL. PW-SMD: A Plane-Wave Implicit Solvation Model Based on Electron Density for Surface Chemistry and Crystalline Systems in Aqueous Solution. J Chem Theory Comput 2024. [PMID: 39024317 DOI: 10.1021/acs.jctc.4c00594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
Electron density-based implicit solvation models are a class of techniques for quantifying solvation effects and calculating free energies of solvation without an explicit representation of solvent molecules. Integral to the accuracy of solvation modeling is the proper definition of the solvation shell separating the solute molecule from the solvent environment, allowing for a physical partitioning of the free energies of solvation. Unlike state-of-the-art implicit solvation models for molecular quantum chemistry calculations, e.g., the solvation model based on solute electron density (SMD), solvation models for systems under periodic boundary conditions with plane-wave (PW) basis sets have been limited in their accuracy. Furthermore, a unified implicit solvation model with both homogeneous solution-phase and heterogeneous interfacial structures treated on equal footing is needed. In order to address this challenge, we developed a high-accuracy solvation model for periodic PW calculations that is applicable to molecular, ionic, interfacial, and bulk-phase chemistry. Our model, PW-SMD, is an extension of the SMD molecular solvation model to periodic systems in water. The free energy of solvation is partitioned into the electrostatic and cavity-dispersion-solvent structure (CDS) contributions. The electrostatic contributions of the solvation shell surrounding solute structures are parametrized based on their geometric and physical properties. In addition, the nonelectrostatic contribution to the solvation energy is accounted for by extending the CDS formalism of SMD to incorporate periodic boundary conditions. We validate the accuracy and robustness of our solvation model by comparing predicted solvation free energies against experimental data for molecular and ionic systems, carved-cluster composite energetic models of solvated reaction energies and barriers on surface systems, and deep-learning-accelerated ab initio molecular dynamics (AIMD). Our developed periodic implicit solvation model shows significantly improved accuracy compared to previous work (namely, solvation models in aqueous solution) and can be applied to simulate solvent effects in a wide range of surface and crystalline materials.
Collapse
Affiliation(s)
- Yang Wang
- Department of Chemistry, Boston College, Chestnut Hill, Massachusetts 02467, United States
| | - Chong Teng
- Department of Chemistry, Boston College, Chestnut Hill, Massachusetts 02467, United States
| | - Elijah Begin
- Department of Chemistry, Boston College, Chestnut Hill, Massachusetts 02467, United States
| | - Mason Bussiere
- Department of Chemistry, Boston College, Chestnut Hill, Massachusetts 02467, United States
| | - Junwei Lucas Bao
- Department of Chemistry, Boston College, Chestnut Hill, Massachusetts 02467, United States
| |
Collapse
|
3
|
Wen M, Chang X, Xu Y, Chen D, Chu Q. Determining the mechanical and decomposition properties of high energetic materials (α-RDX, β-HMX, and ε-CL-20) using a neural network potential. Phys Chem Chem Phys 2024; 26:9984-9997. [PMID: 38477375 DOI: 10.1039/d4cp00017j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/14/2024]
Abstract
Molecular simulations of high energetic materials (HEMs) are limited by efficiency and accuracy. Recently, neural network potential (NNP) models have achieved molecular simulations of millions of atoms while maintaining the accuracy of density functional theory (DFT) levels. Herein, an NNP model covering typical HEMs containing C, H, N, and O elements is developed. The mechanical and decomposition properties of 1,3,5-trinitroperhydro-1,3,5-triazine (RDX), hexahydro-1,3,5-trinitro-1,3,5-triazine (HMX), and 2,4,6,8,10,12-hexanitrohexaazaisowurtzitane (CL-20) are determined by employing the molecular dynamics (MD) simulations based on the NNP model. The calculated results show that the mechanical properties of α-RDX, β-HMX, and ε-CL-20 agree with previous experiments and theoretical results, including cell parameters, equations of state, and elastic constants. In the thermal decomposition simulations, it is also found that the initial decomposition reactions of the three crystals are N-NO2 homolysis, corresponding radical intermediates formation, and NO2-induced reactions. This decomposition trajectory is mainly divided into two stages separating from the peak of NO2: pyrolysis and oxidation. Overall, the NNP model for C/H/N/O elements in this work is an alternative reactive force field for RDX, HMX, and CL-20 HEMs, and it opens up new potential for future kinetic study of nitramine explosives.
Collapse
Affiliation(s)
- Mingjie Wen
- State Key Laboratory of Explosion Science and Safety Protection, Beijing Institute of Technology, Beijing 100081, P. R. China.
| | - Xiaoya Chang
- State Key Laboratory of Explosion Science and Safety Protection, Beijing Institute of Technology, Beijing 100081, P. R. China.
| | - Yabei Xu
- State Key Laboratory of Explosion Science and Safety Protection, Beijing Institute of Technology, Beijing 100081, P. R. China.
| | - Dongping Chen
- State Key Laboratory of Explosion Science and Safety Protection, Beijing Institute of Technology, Beijing 100081, P. R. China.
| | - Qingzhao Chu
- State Key Laboratory of Explosion Science and Safety Protection, Beijing Institute of Technology, Beijing 100081, P. R. China.
| |
Collapse
|
4
|
Wang Z, Chen A, Tao K, Han Y, Li J. MatGPT: A Vane of Materials Informatics from Past, Present, to Future. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2306733. [PMID: 37813548 DOI: 10.1002/adma.202306733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 09/05/2023] [Indexed: 10/17/2023]
Abstract
Combining materials science, artificial intelligence (AI), physical chemistry, and other disciplines, materials informatics is continuously accelerating the vigorous development of new materials. The emergence of "GPT (Generative Pre-trained Transformer) AI" shows that the scientific research field has entered the era of intelligent civilization with "data" as the basic factor and "algorithm + computing power" as the core productivity. The continuous innovation of AI will impact the cognitive laws and scientific methods, and reconstruct the knowledge and wisdom system. This leads to think more about materials informatics. Here, a comprehensive discussion of AI models and materials infrastructures is provided, and the advances in the discovery and design of new materials are reviewed. With the rise of new research paradigms triggered by "AI for Science", the vane of materials informatics: "MatGPT", is proposed and the technical path planning from the aspects of data, descriptors, generative models, pretraining models, directed design models, collaborative training, experimental robots, as well as the efforts and preparations needed to develop a new generation of materials informatics, is carried out. Finally, the challenges and constraints faced by materials informatics are discussed, in order to achieve a more digital, intelligent, and automated construction of materials informatics with the joint efforts of more interdisciplinary scientists.
Collapse
Affiliation(s)
- Zhilong Wang
- National Key Laboratory of Science and Technology on Micro/Nano Fabrication, Shanghai Jiao Tong University, Shanghai, 200240, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - An Chen
- National Key Laboratory of Science and Technology on Micro/Nano Fabrication, Shanghai Jiao Tong University, Shanghai, 200240, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Kehao Tao
- National Key Laboratory of Science and Technology on Micro/Nano Fabrication, Shanghai Jiao Tong University, Shanghai, 200240, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Yanqiang Han
- National Key Laboratory of Science and Technology on Micro/Nano Fabrication, Shanghai Jiao Tong University, Shanghai, 200240, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Jinjin Li
- National Key Laboratory of Science and Technology on Micro/Nano Fabrication, Shanghai Jiao Tong University, Shanghai, 200240, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| |
Collapse
|
5
|
Lima Costa AH, Bezerra KS, de Lima Neto JX, Oliveira JIN, Galvão DS, Fulco UL. Deciphering Interactions between Potential Inhibitors and the Plasmodium falciparum DHODH Enzyme: A Computational Perspective. J Phys Chem B 2023; 127:9461-9475. [PMID: 37897437 DOI: 10.1021/acs.jpcb.3c05738] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/30/2023]
Abstract
Malaria is a parasitic disease that, in its most severe form, can even lead to death. Insect-resistant vectors, insufficiently effective vaccines, and drugs that cannot stop parasitic infestations are making the fight against the disease increasingly difficult. It is known that the enzyme dihydroorotate dehydrogenase (DHODH) is of paramount importance for the synthesis of pyrimidine from the Plasmodium precursor, that is, for its growth and reproduction. Therefore, its blockade can lead to disruption of the parasite's life cycle in the vertebrate host. In this scenario, PfDHODH inhibitors have been considered candidates for a new therapy to stop the parasitic energy source. Given what is known, in this work, we applied molecular fractionation with conjugated caps (MFCC) in the framework of the quantum formalism of density functional theory (DFT) to evaluate the energies of the interactions between the enzyme and the different triazolopyrimidines (DSM483, DMS557, and DSM1), including a complex carrying the mutation C276F. From these results, it was possible to identify the main features of each system, focusing on the wild-type and mutant PfDHODH and examining the major amino acid residues that are part of the four complexes. Our analysis provides new information that can be used to develop new drugs that could prove to be more effective alternatives to present antimalarial drugs.
Collapse
Affiliation(s)
- Aranthya Hevelly Lima Costa
- Departamento de Biofísica e Farmacologia, Universidade Federal do Rio Grande do Norte, 59072-970 Natal-RN, Brazil
| | - Katyanna Sales Bezerra
- Departamento de Biofísica e Farmacologia, Universidade Federal do Rio Grande do Norte, 59072-970 Natal-RN, Brazil
- Applied Physics Department, University of Campinas, 130838-59 Campinas, São Paulo, Brazil
| | - José Xavier de Lima Neto
- Departamento de Biofísica e Farmacologia, Universidade Federal do Rio Grande do Norte, 59072-970 Natal-RN, Brazil
| | - Jonas Ivan Nobre Oliveira
- Departamento de Biofísica e Farmacologia, Universidade Federal do Rio Grande do Norte, 59072-970 Natal-RN, Brazil
| | - Douglas Soares Galvão
- Applied Physics Department, University of Campinas, 130838-59 Campinas, São Paulo, Brazil
| | - Umberto Laino Fulco
- Departamento de Biofísica e Farmacologia, Universidade Federal do Rio Grande do Norte, 59072-970 Natal-RN, Brazil
| |
Collapse
|
6
|
Tu H, Han Y, Wang Z, Chen A, Tao K, Ye S, Wang S, Wei Z, Li J. RotNet: A Rotationally Invariant Graph Neural Network for Quantum Mechanical Calculations. SMALL METHODS 2023:e2300534. [PMID: 37727096 DOI: 10.1002/smtd.202300534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Revised: 08/28/2023] [Indexed: 09/21/2023]
Abstract
Deep learning has proven promising in biological and chemical applications, aiding in accurate predictions of properties such as atomic forces, energies, and material band gaps. Traditional methods with rotational invariance, one of the most crucial physical laws for predictions made by machine learning, have relied on Fourier transforms or specialized convolution filters, leading to complex model design and reduced accuracy and efficiency. However, models without rotational invariance exhibit poor generalization ability across datasets. Addressing this contradiction, this work proposes a rotationally invariant graph neural network, named RotNet, for accurate and accelerated quantum mechanical calculations that can overcome the generalization deficiency caused by rotations of molecules. RotNet ensures rotational invariance through an effective transformation and learns distance and angular information from atomic coordinates. Benchmark experiments on three datasets (protein fragments, electronic materials, and QM9) demonstrate that the proposed RotNet framework outperforms popular baselines and generalizes well to spatial data with varying rotations. The high accuracy, efficiency, and fast convergence of RotNet suggest that it has tremendous potential to significantly facilitate studies of protein dynamics simulation and materials engineering while maintaining physical plausibility.
Collapse
Affiliation(s)
- Hongwei Tu
- Shanghai Key Laboratory of Maternal Fetal Medicine, Shanghai Institute of Maternal-Fetal Medicine and Gynecologic Oncology, Shanghai First Maternity and Infant Hospital, School of Medicine, Tongji University, Shanghai, 200092, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Yanqiang Han
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Zhilong Wang
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - An Chen
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Kehao Tao
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Simin Ye
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Shiwei Wang
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Zhiyun Wei
- Shanghai Key Laboratory of Maternal Fetal Medicine, Shanghai Institute of Maternal-Fetal Medicine and Gynecologic Oncology, Shanghai First Maternity and Infant Hospital, School of Medicine, Tongji University, Shanghai, 200092, China
| | - Jinjin Li
- Shanghai Key Laboratory of Maternal Fetal Medicine, Shanghai Institute of Maternal-Fetal Medicine and Gynecologic Oncology, Shanghai First Maternity and Infant Hospital, School of Medicine, Tongji University, Shanghai, 200092, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| |
Collapse
|
7
|
Zeng J, Zhang D, Lu D, Mo P, Li Z, Chen Y, Rynik M, Huang L, Li Z, Shi S, Wang Y, Ye H, Tuo P, Yang J, Ding Y, Li Y, Tisi D, Zeng Q, Bao H, Xia Y, Huang J, Muraoka K, Wang Y, Chang J, Yuan F, Bore SL, Cai C, Lin Y, Wang B, Xu J, Zhu JX, Luo C, Zhang Y, Goodall REA, Liang W, Singh AK, Yao S, Zhang J, Wentzcovitch R, Han J, Liu J, Jia W, York DM, E W, Car R, Zhang L, Wang H. DeePMD-kit v2: A software package for deep potential models. J Chem Phys 2023; 159:054801. [PMID: 37526163 PMCID: PMC10445636 DOI: 10.1063/5.0155600] [Citation(s) in RCA: 63] [Impact Index Per Article: 31.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Accepted: 07/03/2023] [Indexed: 08/02/2023] Open
Abstract
DeePMD-kit is a powerful open-source software package that facilitates molecular dynamics simulations using machine learning potentials known as Deep Potential (DP) models. This package, which was released in 2017, has been widely used in the fields of physics, chemistry, biology, and material science for studying atomistic systems. The current version of DeePMD-kit offers numerous advanced features, such as DeepPot-SE, attention-based and hybrid descriptors, the ability to fit tensile properties, type embedding, model deviation, DP-range correction, DP long range, graphics processing unit support for customized operators, model compression, non-von Neumann molecular dynamics, and improved usability, including documentation, compiled binary packages, graphical user interfaces, and application programming interfaces. This article presents an overview of the current major version of the DeePMD-kit package, highlighting its features and technical details. Additionally, this article presents a comprehensive procedure for conducting molecular dynamics as a representative application, benchmarks the accuracy and efficiency of different models, and discusses ongoing developments.
Collapse
Affiliation(s)
- Jinzhe Zeng
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | | | - Denghui Lu
- HEDPS, CAPT, College of Engineering, Peking University, Beijing 100871, People’s Republic of China
| | - Pinghui Mo
- College of Electrical and Information Engineering, Hunan University, Changsha, People’s Republic of China
| | - Zeyu Li
- Yuanpei College, Peking University, Beijing 100871, People’s Republic of China
| | - Yixiao Chen
- Program in Applied and Computational Mathematics, Princeton University, Princeton, New Jersey 08540, USA
| | - Marián Rynik
- Department of Experimental Physics, Comenius University, Mlynská Dolina F2, 842 48 Bratislava, Slovakia
| | - Li’ang Huang
- Center for Quantum Information, Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, People’s Republic of China
| | | | - Shaochen Shi
- ByteDance Research, Zhonghang Plaza, No. 43, North 3rd Ring West Road, Haidian District, Beijing, People’s Republic of China
| | | | - Haotian Ye
- Yuanpei College, Peking University, Beijing 100871, People’s Republic of China
| | - Ping Tuo
- AI for Science Institute, Beijing 100080, People’s Republic of China
| | - Jiabin Yang
- Baidu, Inc., Beijing, People’s Republic of China
| | | | - Yifan Li
- Department of Chemistry, Princeton University, Princeton, New Jersey 08544, USA
| | | | - Qiyu Zeng
- Department of Physics, National University of Defense Technology, Changsha, Hunan 410073, People’s Republic of China
| | | | - Yu Xia
- ByteDance Research, Zhonghang Plaza, No. 43, North 3rd Ring West Road, Haidian District, Beijing, People’s Republic of China
| | | | - Koki Muraoka
- Department of Chemical System Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan
| | - Yibo Wang
- DP Technology, Beijing 100080, People’s Republic of China
| | | | - Fengbo Yuan
- DP Technology, Beijing 100080, People’s Republic of China
| | - Sigbjørn Løland Bore
- Hylleraas Centre for Quantum Molecular Sciences and Department of Chemistry, University of Oslo, P.O. Box 1033 Blindern, 0315 Oslo, Norway
| | | | - Yinnian Lin
- Wangxuan Institute of Computer Technology, Peking University, Beijing 100871, People’s Republic of China
| | - Bo Wang
- Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, Shanghai Key Laboratory of Green Chemistry and Chemical Process, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, People’s Republic of China
| | - Jiayan Xu
- School of Chemistry and Chemical Engineering, Queen’s University Belfast, Belfast BT9 5AG, United Kingdom
| | - Jia-Xin Zhu
- State Key Laboratory of Physical Chemistry of Solid Surfaces, iChEM, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, People’s Republic of China
| | - Chenxing Luo
- Department of Applied Physics and Applied Mathematics, Columbia University, New York, New York 10027, USA
| | - Yuzhi Zhang
- DP Technology, Beijing 100080, People’s Republic of China
| | | | - Wenshuo Liang
- DP Technology, Beijing 100080, People’s Republic of China
| | - Anurag Kumar Singh
- Department of Data Science, Indian Institute of Technology, Palakkad, Kerala, India
| | - Sikai Yao
- DP Technology, Beijing 100080, People’s Republic of China
| | - Jingchao Zhang
- NVIDIA AI Technology Center (NVAITC), Santa Clara, California 95051, USA
| | | | - Jiequn Han
- Center for Computational Mathematics, Flatiron Institute, New York, New York 10010, USA
| | - Jie Liu
- College of Electrical and Information Engineering, Hunan University, Changsha, People’s Republic of China
| | | | - Darrin M. York
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | | | - Roberto Car
- Department of Chemistry, Princeton University, Princeton, New Jersey 08544, USA
| | | | - Han Wang
- Author to whom correspondence should be addressed:
| |
Collapse
|
8
|
Sahu N, Khire SS, Gadre SR. Combining fragmentation method and high-performance computing: Geometry optimization and vibrational spectra of proteins. J Chem Phys 2023; 159:044309. [PMID: 37522406 DOI: 10.1063/5.0149572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Accepted: 07/12/2023] [Indexed: 08/01/2023] Open
Abstract
Exploring the structures and spectral features of proteins with advanced quantum chemical methods is an uphill task. In this work, a fragment-based molecular tailoring approach (MTA) is appraised for the CAM-B3LYP/aug-cc-pVDZ-level geometry optimization and vibrational infrared (IR) spectra calculation of ten real proteins containing up to 407 atoms and 6617 basis functions. The use of MTA and the inherently parallel nature of the fragment calculations enables a rapid and accurate calculation of the IR spectrum. The applicability of MTA to optimize the protein geometry and evaluate its IR spectrum employing a polarizable continuum model with water as a solvent is also showcased. The typical errors in the total energy and IR frequencies computed by MTA vis-à-vis their full calculation (FC) counterparts for the studied protein are 5-10 millihartrees and 5 cm-1, respectively. Moreover, due to the independent execution of the fragments, large-scale parallelization can also be achieved. With increasing size and level of theory, MTA shows an appreciable advantage in computer time as well as memory and disk space requirement over the corresponding FCs. The present study suggests that the geometry optimization and IR computations on the biomolecules containing ∼1000 atoms and/or ∼15 000 basis functions using MTA and HPC facility can be clearly envisioned in the near future.
Collapse
Affiliation(s)
- Nityananda Sahu
- Theoretische Chemie, Philipps-Universität Marburg, 35032 Marburg, Germany
| | - Subodh S Khire
- RIKEN Center for Computational Science, Kobe 650-0047, Japan
| | - Shridhar R Gadre
- Departments of Scientific Computing, Modelling & Simulation and Chemistry, Savitribai Phule Pune University, Pune 411007, India
| |
Collapse
|
9
|
Zhang P, Yang W. Toward a general neural network force field for protein simulations: Refining the intramolecular interaction in protein. J Chem Phys 2023; 159:024118. [PMID: 37431910 PMCID: PMC10481389 DOI: 10.1063/5.0142280] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 06/22/2023] [Indexed: 07/12/2023] Open
Abstract
Molecular dynamics (MD) is an extremely powerful, highly effective, and widely used approach to understanding the nature of chemical processes in atomic details for proteins. The accuracy of results from MD simulations is highly dependent on force fields. Currently, molecular mechanical (MM) force fields are mainly utilized in MD simulations because of their low computational cost. Quantum mechanical (QM) calculation has high accuracy, but it is exceedingly time consuming for protein simulations. Machine learning (ML) provides the capability for generating accurate potential at the QM level without increasing much computational effort for specific systems that can be studied at the QM level. However, the construction of general machine learned force fields, needed for broad applications and large and complex systems, is still challenging. Here, general and transferable neural network (NN) force fields based on CHARMM force fields, named CHARMM-NN, are constructed for proteins by training NN models on 27 fragments partitioned from the residue-based systematic molecular fragmentation (rSMF) method. The NN for each fragment is based on atom types and uses new input features that are similar to MM inputs, including bonds, angles, dihedrals, and non-bonded terms, which enhance the compatibility of CHARMM-NN to MM MD and enable the implementation of CHARMM-NN force fields in different MD programs. While the main part of the energy of the protein is based on rSMF and NN, the nonbonded interactions between the fragments and with water are taken from the CHARMM force field through mechanical embedding. The validations of the method for dipeptides on geometric data, relative potential energies, and structural reorganization energies demonstrate that the CHARMM-NN local minima on the potential energy surface are very accurate approximations to QM, showing the success of CHARMM-NN for bonded interactions. However, the MD simulations on peptides and proteins indicate that more accurate methods to represent protein-water interactions in fragments and non-bonded interactions between fragments should be considered in the future improvement of CHARMM-NN, which can increase the accuracy of approximation beyond the current mechanical embedding QM/MM level.
Collapse
Affiliation(s)
- Pan Zhang
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
| | - Weitao Yang
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
| |
Collapse
|
10
|
Vornweg JR, Wolter M, Jacob CR. A simple and consistent quantum-chemical fragmentation scheme for proteins that includes two-body contributions. J Comput Chem 2023; 44:1634-1644. [PMID: 37171574 DOI: 10.1002/jcc.27114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 03/28/2023] [Accepted: 03/30/2023] [Indexed: 05/13/2023]
Abstract
The Molecular Fractionation with Conjugate Caps (MFCC) method is a popular fragmentation method for the quantum-chemical treatment of proteins. However, it does not account for interactions between the amino acid fragments, such as intramolecular hydrogen bonding. Here, we present a combination of the MFCC fragmentation scheme with a second-order many-body expansion (MBE) that consistently accounts for all fragment-fragment, fragment-cap, and cap-cap interactions, while retaining the overall simplicity of the MFCC scheme with its chemically meaningful fragments. We show that with the resulting MFCC-MBE(2) scheme, the errors in the total energies of selected polypeptides and proteins can be reduced by up to one order of magnitude and relative energies of different protein conformers can be predicted accurately.
Collapse
Affiliation(s)
- Johannes R Vornweg
- Institute of Physical and Theoretical Chemistry, Technische Universität Braunschweig, Braunschweig, Germany
| | - Mario Wolter
- Institute of Physical and Theoretical Chemistry, Technische Universität Braunschweig, Braunschweig, Germany
| | - Christoph R Jacob
- Institute of Physical and Theoretical Chemistry, Technische Universität Braunschweig, Braunschweig, Germany
| |
Collapse
|
11
|
Zhu Q, Ge Y, Li W, Ma J. Treating Polarization Effects in Charged and Polar Bio-Molecules Through Variable Electrostatic Parameters. J Chem Theory Comput 2023; 19:396-411. [PMID: 36592097 DOI: 10.1021/acs.jctc.2c01130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Polarization plays important roles in charged and hydrogen bonding containing systems. Much effort ranging from the construction of physics-based models to quantum mechanism (QM)-based and machine learning (ML)-assisted models have been devoted to incorporating the polarization effect into the conventional force fields at different levels, such as atomic and coarse grained (CG). The application of polarizable force fields or polarization models was limited by two aspects, namely, computational cost and transferability. Different from physics-based models, no predetermining parameters were required in the QM-based approaches. Taking advantage of both the accuracy of QM calculations and efficiency of molecular mechanism (MM) and ML, polarization effects could be treated more efficiently while maintaining the QM accuracy. The computational cost could be reduced with variable electrostatic parameters, such as the charge, dipole, and electronic dielectric constant with the help of linear scaling fragmentation-based QM calculations and ML models. Polarization and entropy effects on the prediction of partition coefficient of druglike molecules are demonstrated by using both explicit or implicit all-atom molecular dynamics simulations and machine learning-assisted models. Directions and challenges for future development are also envisioned.
Collapse
Affiliation(s)
- Qiang Zhu
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing210023, P. R. China
| | - Yang Ge
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing210023, P. R. China
| | - Wei Li
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing210023, P. R. China
| | - Jing Ma
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing210023, P. R. China
| |
Collapse
|
12
|
Liu J, He X. Recent advances in quantum fragmentation approaches to complex molecular and condensed‐phase systems. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Affiliation(s)
- Jinfeng Liu
- Department of Basic Medicine and Clinical Pharmacy China Pharmaceutical University Nanjing China
- Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, Shanghai Frontiers Science Center of Molecule Intelligent Syntheses, School of Chemistry and Molecular Engineering East China Normal University Shanghai China
| | - Xiao He
- Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, Shanghai Frontiers Science Center of Molecule Intelligent Syntheses, School of Chemistry and Molecular Engineering East China Normal University Shanghai China
- New York University‐East China Normal University Center for Computational Chemistry New York University Shanghai Shanghai China
| |
Collapse
|
13
|
Liao K, Dong S, Cheng Z, Li W, Li S. Combined fragment-based machine learning force field with classical force field and its application in the NMR calculations of macromolecules in solutions. Phys Chem Chem Phys 2022; 24:18559-18567. [PMID: 35916054 DOI: 10.1039/d2cp02192g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
We have developed a combined fragment-based machine learning (ML) force field and molecular mechanics (MM) force field for simulating the structures of macromolecules in solutions, and then compute its NMR chemical shifts with the generalized energy-based fragmentation (GEBF) approach at the level of density functional theory (DFT). In this work, we first construct Gaussian approximation potential based on GEBF subsystems of macromolecules for MD simulations and then a GEBF-based neural network (GEBF-NN) with deep potential model for the studied macromolecule. Then, we develop a GEBF-NN/MM force field for macromolecules in solutions by combining the GEBF-NN force field for the solute molecule and ff14SB force field for solvent molecules. Using the GEBF-NN/MM MD simulation to generate snapshot structures of solute/solvent clusters, we then perform the NMR calculations with the GEBF approach at the DFT level to calculate NMR chemical shifts of the solute molecule. Taking a heptamer of oligopyridine-dicarboxamides in chloroform solution as an example, our results show that the GEBF-NN force field is quite accurate for this heptamer by comparing with the reference DFT results. For this heptamer in chloroform solution, both the GEBF-NN/MM and classical MD simulations could lead to helical structures from the same initial extended structure. The GEBF-DFT NMR results indicate that the GEBF-NN/MM force field could lead to more accurate NMR chemical shifts on hydrogen atoms by comparing with the experimental NMR results. Therefore, the GEBF-NN/MM force field could be employed for predicting more accurate dynamical behaviors than the classical force field for complex systems in solutions.
Collapse
Affiliation(s)
- Kang Liao
- School of Chemistry and Chemical Engineering, Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, Nanjing University, Nanjing, 210023, P. R. China.
| | - Shiyu Dong
- School of Chemistry and Chemical Engineering, Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, Nanjing University, Nanjing, 210023, P. R. China.
| | - Zheng Cheng
- School of Chemistry and Chemical Engineering, Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, Nanjing University, Nanjing, 210023, P. R. China.
| | - Wei Li
- School of Chemistry and Chemical Engineering, Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, Nanjing University, Nanjing, 210023, P. R. China.
| | - Shuhua Li
- School of Chemistry and Chemical Engineering, Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, Nanjing University, Nanjing, 210023, P. R. China.
| |
Collapse
|
14
|
Tzeli D, Xantheas SS. Breaking covalent bonds in the context of the many-body expansion (MBE). I. The purported "first row anomaly" in XH n (X = C, Si, Ge, Sn; n = 1-4). J Chem Phys 2022; 156:244303. [PMID: 35778077 DOI: 10.1063/5.0095329] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We present a new, novel implementation of the Many-Body Expansion (MBE) to account for the breaking of covalent bonds, thus extending the range of applications from its previous popular usage in the breaking of hydrogen bonds in clusters to molecules. A central concept of the new implementation is the in situ atomic electronic state of an atom in a molecule that casts the one-body term as the energy required to promote it to that state from its ground state. The rest of the terms correspond to the individual diatomic, triatomic, etc., fragments. Its application to the atomization energies of the XHn series, X = C, Si, Ge, Sn and n = 1-4, suggests that the (negative, stabilizing) 2-B is by far the largest term in the MBE with the higher order terms oscillating between positive and negative values and decreasing dramatically in size with increasing rank of the expansion. The analysis offers an alternative explanation for the purported "first row anomaly" in the incremental Hn-1X-H bond energies seen when these energies are evaluated with respect to the lowest energy among the states of the XHn molecules. Due to the "flipping" of the ground/first excited state between CH2 (3B1 ground state, 1A1 first excited state) and XH2, X = Si, Ge, Sn (1A1 ground state, 3B1 first excited state), the overall picture does not exhibit a "first row anomaly" when the incremental bond energies are evaluated with respect to the molecular states having the same in situ atomic states.
Collapse
Affiliation(s)
- Demeter Tzeli
- Laboratory of Physical Chemistry, Department of Chemistry, National and Kapodistrian University of Athens, Panepistimiopolis Zografou, Athens 15784, Greece
| | - Sotiris S Xantheas
- Advanced Computing, Mathematics and Data Division, Pacific Northwest National Laboratory, 902 Battelle Boulevard, P.O. Box 999, Mississippi K1-83, Richland, Washington 99352, USA
| |
Collapse
|
15
|
Abstract
Monte Carlo (MC) methods are important computational tools for molecular structure optimizations and predictions. When solvent effects are explicitly considered, MC methods become very expensive due to the large degree of freedom associated with the water molecules and mobile ions. Alternatively implicit-solvent MC can largely reduce the computational cost by applying a mean field approximation to solvent effects and meanwhile maintains the atomic detail of the target molecule. The two most popular implicit-solvent models are the Poisson-Boltzmann (PB) model and the Generalized Born (GB) model in a way such that the GB model is an approximation to the PB model but is much faster in simulation time. In this work, we develop a machine learning-based implicit-solvent Monte Carlo (MLIMC) method by combining the advantages of both implicit solvent models in accuracy and efficiency. Specifically, the MLIMC method uses a fast and accurate PB-based machine learning (PBML) scheme to compute the electrostatic solvation free energy at each step. We validate our MLIMC method by using a benzene-water system and a protein-water system. We show that the proposed MLIMC method has great advantages in speed and accuracy for molecular structure optimization and prediction.
Collapse
Affiliation(s)
- Jiahui Chen
- Department of Mathematics, Michigan State University, MI 48824, USA
| | - Weihua Geng
- Department of Mathematics, Southern Methodist University, Dallas, TX 75275, USA
| | - Guo-Wei Wei
- Department of Mathematics, Michigan State University, MI 48824, USA
- Department of Biochemistry and Molecular Biology, Michigan State University, MI 48824, USA
| |
Collapse
|
16
|
Cheng Z, Du J, Zhang L, Ma J, Li W, Li S. Building quantum mechanics quality force fields of proteins with the generalized energy-based fragmentation approach and machine learning. Phys Chem Chem Phys 2021; 24:1326-1337. [PMID: 34718360 DOI: 10.1039/d1cp03934b] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
We combined our generalized energy-based fragmentation (GEBF) approach and machine learning (ML) technique to construct quantum mechanics (QM) quality force fields for proteins. In our scheme, the training sets for a protein are only constructed from its small subsystems, which capture all short-range interactions in the target system. The energy of a given protein is expressed as the summation of atomic contributions from QM calculations of various subsystems, corrected by long-range Coulomb and van der Waals interactions. With the Gaussian approximation potential (GAP) method, our protocol can automatically generate training sets with high efficiency. To facilitate the construction of training sets for proteins, we store all trained subsystem data in a library. If subsystems in the library are detected in a new protein, corresponding datasets can be directly reused as a part of the training set on this new protein. With two polypeptides, 4ZNN and 1XQ8 segment, as examples, the energies and forces predicted by GEBF-GAP are in good agreement with those from conventional QM calculations, and dihedral angle distributions from GEBF-GAP molecular dynamics (MD) simulations can also well reproduce those from ab initio MD simulations. In addition, with the training set generated from GEBF-GAP, we also demonstrate that GEBF-ML force fields constructed by neural network (NN) methods can also show QM quality. Therefore, the present work provides an efficient and systematic way to build QM quality force fields for biological systems.
Collapse
Affiliation(s)
- Zheng Cheng
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, P. R. China.
| | - Jiahui Du
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, P. R. China.
| | - Lei Zhang
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, P. R. China.
| | - Jing Ma
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, P. R. China.
| | - Wei Li
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, P. R. China.
| | - Shuhua Li
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, P. R. China.
| |
Collapse
|
17
|
Shen C, Jin X, Glover WJ, He X. Accurate Prediction of Absorption Spectral Shifts of Proteorhodopsin Using a Fragment-Based Quantum Mechanical Method. Molecules 2021; 26:4486. [PMID: 34361639 PMCID: PMC8347797 DOI: 10.3390/molecules26154486] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Revised: 07/19/2021] [Accepted: 07/20/2021] [Indexed: 11/17/2022] Open
Abstract
Many experiments have been carried out to display different colors of Proteorhodopsin (PR) and its mutants, but the mechanism of color tuning of PR was not fully elucidated. In this study, we applied the Electrostatically Embedded Generalized Molecular Fractionation with Conjugate Caps (EE-GMFCC) method to the prediction of excitation energies of PRs. Excitation energies of 10 variants of Blue Proteorhodopsin (BPR-PR105Q) in residue 105GLN were calculated with the EE-GMFCC method at the TD-B3LYP/6-31G* level. The calculated results show good correlation with the experimental values of absorption wavelengths, although the experimental wavelength range among these systems is less than 50 nm. The ensemble-averaged electric fields along the polyene chain of retinal correlated well with EE-GMFCC calculated excitation energies for these 10 PRs, suggesting that electrostatic interactions from nearby residues are responsible for the color tuning. We also utilized the GMFCC method to decompose the excitation energy contribution per residue surrounding the chromophore. Our results show that residues ASP97 and ASP227 have the largest contribution to the absorption spectral shift of PR among the nearby residues of retinal. This work demonstrates that the EE-GMFCC method can be applied to accurately predict the absorption spectral shifts for biomacromolecules.
Collapse
Affiliation(s)
- Chenfei Shen
- Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China; (C.S.); (X.J.)
| | - Xinsheng Jin
- Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China; (C.S.); (X.J.)
| | - William J. Glover
- NYU Shanghai, 1555 Century Avenue, Shanghai 200122, China;
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
- Department of Chemistry, New York University, New York, NY 10003, USA
| | - Xiao He
- Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China; (C.S.); (X.J.)
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
| |
Collapse
|
18
|
Morawietz T, Artrith N. Machine learning-accelerated quantum mechanics-based atomistic simulations for industrial applications. J Comput Aided Mol Des 2021; 35:557-586. [PMID: 33034008 PMCID: PMC8018928 DOI: 10.1007/s10822-020-00346-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 09/26/2020] [Indexed: 01/13/2023]
Abstract
Atomistic simulations have become an invaluable tool for industrial applications ranging from the optimization of protein-ligand interactions for drug discovery to the design of new materials for energy applications. Here we review recent advances in the use of machine learning (ML) methods for accelerated simulations based on a quantum mechanical (QM) description of the system. We show how recent progress in ML methods has dramatically extended the applicability range of conventional QM-based simulations, allowing to calculate industrially relevant properties with enhanced accuracy, at reduced computational cost, and for length and time scales that would have otherwise not been accessible. We illustrate the benefits of ML-accelerated atomistic simulations for industrial R&D processes by showcasing relevant applications from two very different areas, drug discovery (pharmaceuticals) and energy materials. Writing from the perspective of both a molecular and a materials modeling scientist, this review aims to provide a unified picture of the impact of ML-accelerated atomistic simulations on the pharmaceutical, chemical, and materials industries and gives an outlook on the exciting opportunities that could emerge in the future.
Collapse
Affiliation(s)
- Tobias Morawietz
- Bayer AG, Pharmaceuticals, R&D, Digital Technologies, Computational Molecular Design, 42096 Wuppertal, Germany
| | - Nongnuch Artrith
- Department of Chemical Engineering, Columbia University, New York, NY 10027 USA
| |
Collapse
|
19
|
Ma S, Ma Y, Zhang B, Tian Y, Jin Z. Forecasting System of Computational Time of DFT/TDDFT Calculations under the Multiverse Ansatz via Machine Learning and Cheminformatics. ACS OMEGA 2021; 6:2001-2024. [PMID: 33521440 PMCID: PMC7841786 DOI: 10.1021/acsomega.0c04981] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Accepted: 01/04/2021] [Indexed: 06/12/2023]
Abstract
With the view of achieving a better performance in task assignment and load-balancing, a top-level designed forecasting system for predicting computational times of density-functional theory (DFT)/time-dependent DFT (TDDFT) calculations is presented. The computational time is assumed as the intrinsic property for the molecule. Based on this assumption, the forecasting system is established using the "reinforced concrete", which combines the cheminformatics, several machine-learning (ML) models, and the framework of many-world interpretation (MWI) in multiverse ansatz. Herein, the cheminformatics is used to recognize the topological structure of molecules, the ML models are used to build the relationships between topology and computational cost, and the MWI framework is used to hold various combinations of DFT functionals and basis sets in DFT/TDDFT calculations. Calculated results of molecules from the DrugBank dataset show that (1) it can give quantitative predictions of computational costs, typical mean relative errors can be less than 0.2 for DFT/TDDFT calculations with derivations of ±25% using the exactly pretrained ML models and (2) it can also be employed to various combinations of DFT functional and basis set cases without exactly pretrained ML models, while only slightly enlarge predicting errors.
Collapse
Affiliation(s)
- Shuo Ma
- Computer
Network Information Center, Chinese Academy
of Sciences, Beijing 100190, China
- School
of Computer Science and Technology, University
of Chinese Academy of Sciences, Beijing 101408, China
| | - Yingjin Ma
- Computer
Network Information Center, Chinese Academy
of Sciences, Beijing 100190, China
- Center
of Scientific Computing Applications & Research, Chinese Academy of Sciences, Beijing 100190, China
| | - Baohua Zhang
- Computer
Network Information Center, Chinese Academy
of Sciences, Beijing 100190, China
- Center
of Scientific Computing Applications & Research, Chinese Academy of Sciences, Beijing 100190, China
| | - Yingqi Tian
- Computer
Network Information Center, Chinese Academy
of Sciences, Beijing 100190, China
- School
of Computer Science and Technology, University
of Chinese Academy of Sciences, Beijing 101408, China
| | - Zhong Jin
- Computer
Network Information Center, Chinese Academy
of Sciences, Beijing 100190, China
- Center
of Scientific Computing Applications & Research, Chinese Academy of Sciences, Beijing 100190, China
| |
Collapse
|
20
|
Han Y, Wang Z, Li J. Neural Networks Accelerate the Ab Initio Prediction of Solid-Solid Phase Transitions at High Pressures. J Phys Chem Lett 2021; 12:132-137. [PMID: 33314933 DOI: 10.1021/acs.jpclett.0c03101] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
High-level ab initio chemical calculations, such as second-order Møller-Plesset perturbation (MP2), are highly accurate but time-consuming, making it inefficient to apply to macromolecular systems. Here, we propose a newly efficient approach based on the neural network and fragment method to predict the Gibbs free energy, structural characteristics, and thus phase transition of solid crystal structures. The proposed approach has the same prediction accuracy as the MP2 calculation but is hundreds of times faster than the MP2. The predicted structures and phase transitions of two selected ice phases (IX and XV) under extreme conditions are in excellent agreement with the MP2 calculations and experimental results but with an extremely low computational cost. It not only predicts the high-pressure structures and phase diagrams of solid systems accurately and efficiently but also solves the problem of extreme calculation cost during a high-precision theoretical study on high-pressure molecular crystals with potentially essential applications.
Collapse
|
21
|
Gao W, Mahajan SP, Sulam J, Gray JJ. Deep Learning in Protein Structural Modeling and Design. PATTERNS (NEW YORK, N.Y.) 2020; 1:100142. [PMID: 33336200 PMCID: PMC7733882 DOI: 10.1016/j.patter.2020.100142] [Citation(s) in RCA: 95] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Deep learning is catalyzing a scientific revolution fueled by big data, accessible toolkits, and powerful computational resources, impacting many fields, including protein structural modeling. Protein structural modeling, such as predicting structure from amino acid sequence and evolutionary information, designing proteins toward desirable functionality, or predicting properties or behavior of a protein, is critical to understand and engineer biological systems at the molecular level. In this review, we summarize the recent advances in applying deep learning techniques to tackle problems in protein structural modeling and design. We dissect the emerging approaches using deep learning techniques for protein structural modeling and discuss advances and challenges that must be addressed. We argue for the central importance of structure, following the "sequence → structure → function" paradigm. This review is directed to help both computational biologists to gain familiarity with the deep learning methods applied in protein modeling, and computer scientists to gain perspective on the biologically meaningful problems that may benefit from deep learning techniques.
Collapse
Affiliation(s)
- Wenhao Gao
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Sai Pooja Mahajan
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Jeremias Sulam
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Jeffrey J. Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| |
Collapse
|
22
|
Barry MC, Wise KE, Kalidindi SR, Kumar S. Voxelized Atomic Structure Potentials: Predicting Atomic Forces with the Accuracy of Quantum Mechanics Using Convolutional Neural Networks. J Phys Chem Lett 2020; 11:9093-9099. [PMID: 32985196 DOI: 10.1021/acs.jpclett.0c02271] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
This paper introduces voxelized atomic structure (VASt) potentials as a machine learning (ML) framework for developing interatomic potentials. The VASt framework utilizes a voxelized representation of the atomic structure directly as the input to a convolutional neural network (CNN). This allows for high-fidelity representations of highly complex and diverse spatial arrangements of the atomic environments of interest. The CNN implicitly establishes the low-dimensional features needed to correlate each atomic neighborhood to its net atomic force. The selection of the salient features of the atomic structure (i.e., feature engineering) in the VASt framework is implicit, comprehensive, automated, scalable, and highly efficient. The calibrated convolutional layers learn the complex spatial relationships and multibody interactions that govern the physics of atomic systems with remarkable fidelity. We show that VASt potentials predict highly accurate forces on two phases of silicon carbide and the thermal conductivity of silicon over a range of isotropic strain.
Collapse
Affiliation(s)
- Matthew C Barry
- G.W. Woodruff School of Mechanical Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Kristopher E Wise
- Advanced Materials and Processing Branch, NASA Langley Research Center, Hampton, Virginia 23681, United States
| | - Surya R Kalidindi
- G.W. Woodruff School of Mechanical Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Satish Kumar
- G.W. Woodruff School of Mechanical Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| |
Collapse
|
23
|
Reinholdt P, Jørgensen FK, Kongsted J, Olsen JMH. Polarizable Density Embedding for Large Biomolecular Systems. J Chem Theory Comput 2020; 16:5999-6006. [PMID: 32991163 DOI: 10.1021/acs.jctc.0c00763] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
We present an efficient and robust fragment-based quantum-classical embedding model capable of accurately capturing effects from complex environments such as proteins and nucleic acids. This is realized by combining the molecular fractionation with conjugate caps (MFCC) procedure with the polarizable density embedding (PDE) model at the level of Fock matrix construction. The PDE contributions to the Fock matrix of the core region are constructed using the local molecular basis of the individual fragments rather than the supermolecular basis of the entire system. Thereby, we avoid complications associated with the application of the MFCC procedure on environment quantities such as electronic densities and molecular-orbital energies. Moreover, the computational cost associated with solving self-consistent field (SCF) equations of the core region remains unchanged from that of purely classical polarized embedding models. We analyze the performance of the resulting model in terms of the reproduction of the electrostatic potential of an insulin monomer protein and further in the context of solving problems related to electron spill-out. Finally, we showcase the model for the calculation of one- and two-photon properties of the Nile red molecule in a protein environment. Based on our analyses, we find that the combination of the MFCC approach with the PDE model is an efficient, yet accurate approach for calculating molecular properties of molecules embedded in structured biomolecular environments.
Collapse
Affiliation(s)
- Peter Reinholdt
- Department of Physics, Chemistry and Pharmacy, University of Southern Denmark, DK-5230 Odense M, Denmark
| | - Frederik Kamper Jørgensen
- Department of Physics, Chemistry and Pharmacy, University of Southern Denmark, DK-5230 Odense M, Denmark
| | - Jacob Kongsted
- Department of Physics, Chemistry and Pharmacy, University of Southern Denmark, DK-5230 Odense M, Denmark
| | - Jógvan Magnus Haugaard Olsen
- Hylleraas Centre for Quantum Molecular Sciences, Department of Chemistry, UiT The Arctic University of Norway, N-9037 Tromsø, Norway.,Department of Chemistry, Aarhus University, DK-8000 Aarhus C, Denmark
| |
Collapse
|
24
|
Cheng Z, Zhao D, Ma J, Li W, Li S. An On-the-Fly Approach to Construct Generalized Energy-Based Fragmentation Machine Learning Force Fields of Complex Systems. J Phys Chem A 2020; 124:5007-5014. [DOI: 10.1021/acs.jpca.0c04526] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Affiliation(s)
- Zheng Cheng
- Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, People’s Republic of China
| | - Dongbo Zhao
- Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, People’s Republic of China
- Kuang Yaming Honors School, Nanjing University, Nanjing 210023, People’s Republic of China
| | - Jing Ma
- Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, People’s Republic of China
| | - Wei Li
- Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, People’s Republic of China
| | - Shuhua Li
- Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, People’s Republic of China
| |
Collapse
|