1
|
Shirani H, Hashemianzadeh SM. Quantum-level machine learning calculations of Levodopa. Comput Biol Chem 2024; 112:108146. [PMID: 39067350 DOI: 10.1016/j.compbiolchem.2024.108146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 06/20/2024] [Accepted: 07/08/2024] [Indexed: 07/30/2024]
Abstract
Many drug molecules contain functional groups, resulting in a torsional barrier corresponding to rotation around the bond linking the fragments. In medicinal chemistry and pharmaceutical sciences, inclusive of drug design studies, the exact calculation of the potential energy surface (PES) of these molecular torsions is extremely important and precious. Machine learning (ML), including deep learning (DL), is currently one of the most rapidly evolving tools in computer-aided drug discovery and molecular simulations. In this work, we used ANI-1x neural network potential as a quantum-level ML to predict the PESs of the L-3,4-dihydroxyphenylalanine (Levodopa) antiparkinsonian drug molecule. The electronic energies and structural parameters calculated by density functional theory (DFT) using the wB97X method and all possible Pople's basis sets indicated the 6-31G(d) basis set, when used with the wB97X functional, exhibits behavior similar to that of the ANI-1x model. The vibrational frequencies investigation showed a linear correlation between DFT and ML data. All ANI-1x calculations were completed quickly in a very short computing time. From this perspective, we expect the ANI-1x dataset applied in this work to be appreciably efficient and effective in computational structure-based drug design studies.
Collapse
Affiliation(s)
- Hossein Shirani
- Molecular Simulation Research Laboratory, Department of Chemistry, Iran University of Science and Technology, P.O. Box 16846-13114, Tehran, Iran.
| | - Seyed Majid Hashemianzadeh
- Molecular Simulation Research Laboratory, Department of Chemistry, Iran University of Science and Technology, P.O. Box 16846-13114, Tehran, Iran.
| |
Collapse
|
2
|
Williams CD, Kalayan J, Burton NA, Bryce RA. Stable and accurate atomistic simulations of flexible molecules using conformationally generalisable machine learned potentials. Chem Sci 2024; 15:12780-12795. [PMID: 39148799 PMCID: PMC11323334 DOI: 10.1039/d4sc01109k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Accepted: 07/07/2024] [Indexed: 08/17/2024] Open
Abstract
Computational simulation methods based on machine learned potentials (MLPs) promise to revolutionise shape prediction of flexible molecules in solution, but their widespread adoption has been limited by the way in which training data is generated. Here, we present an approach which allows the key conformational degrees of freedom to be properly represented in reference molecular datasets. MLPs trained on these datasets using a global descriptor scheme are generalisable in conformational space, providing quantum chemical accuracy for all conformers. These MLPs are capable of propagating long, stable molecular dynamics trajectories, an attribute that has remained a challenge. We deploy the MLPs in obtaining converged conformational free energy surfaces for flexible molecules via well-tempered metadynamics simulations; this approach provides a hitherto inaccessible route to accurately computing the structural, dynamical and thermodynamical properties of a wide variety of flexible molecular systems. It is further demonstrated that MLPs must be trained on reference datasets with complete coverage of conformational space, including in barrier regions, to achieve stable molecular dynamics trajectories.
Collapse
Affiliation(s)
- Christopher D Williams
- Division of Pharmacy and Optometry, School of Health Sciences, Faculty of Biology, Medicine and Health, The University of Manchester Oxford Road Manchester M13 9PL UK
| | - Jas Kalayan
- Science and Technologies Facilities Council (STFC), Daresbury Laboratory Keckwick Lane, Daresbury Warrington WA4 4AD UK
| | - Neil A Burton
- Department of Chemistry, School of Natural Sciences, Faculty of Science and Engineering, The University of Manchester Oxford Road Manchester M13 9PL UK
| | - Richard A Bryce
- Division of Pharmacy and Optometry, School of Health Sciences, Faculty of Biology, Medicine and Health, The University of Manchester Oxford Road Manchester M13 9PL UK
| |
Collapse
|
3
|
Plé T, Adjoua O, Lagardère L, Piquemal JP. FeNNol: An efficient and flexible library for building force-field-enhanced neural network potentials. J Chem Phys 2024; 161:042502. [PMID: 39051830 DOI: 10.1063/5.0217688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Accepted: 06/28/2024] [Indexed: 07/27/2024] Open
Abstract
Neural network interatomic potentials (NNPs) have recently proven to be powerful tools to accurately model complex molecular systems while bypassing the high numerical cost of ab initio molecular dynamics simulations. In recent years, numerous advances in model architectures as well as the development of hybrid models combining machine-learning (ML) with more traditional, physically motivated, force-field interactions have considerably increased the design space of ML potentials. In this paper, we present FeNNol, a new library for building, training, and running force-field-enhanced neural network potentials. It provides a flexible and modular system for building hybrid models, allowing us to easily combine state-of-the-art embeddings with ML-parameterized physical interaction terms without the need for explicit programming. Furthermore, FeNNol leverages the automatic differentiation and just-in-time compilation features of the Jax Python library to enable fast evaluation of NNPs, shrinking the performance gap between ML potentials and standard force-fields. This is demonstrated with the popular ANI-2x model reaching simulation speeds nearly on par with the AMOEBA polarizable force-field on commodity GPUs (graphics processing units). We hope that FeNNol will facilitate the development and application of new hybrid NNP architectures for a wide range of molecular simulation problems.
Collapse
Affiliation(s)
- Thomas Plé
- Sorbonne Université, LCT, UMR 7616 CNRS, 75005 Paris, France
| | - Olivier Adjoua
- Sorbonne Université, LCT, UMR 7616 CNRS, 75005 Paris, France
| | - Louis Lagardère
- Sorbonne Université, LCT, UMR 7616 CNRS, 75005 Paris, France
| | | |
Collapse
|
4
|
Kalayan J, Ramzan I, Williams CD, Bryce RA, Burton NA. A neural network potential based on pairwise resolved atomic forces and energies. J Comput Chem 2024; 45:1143-1151. [PMID: 38284556 DOI: 10.1002/jcc.27313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 12/23/2023] [Accepted: 01/05/2024] [Indexed: 01/30/2024]
Abstract
Molecular simulations have become a key tool in molecular and materials design. Machine learning (ML)-based potential energy functions offer the prospect of simulating complex molecular systems efficiently at quantum chemical accuracy. In previous work, we have introduced the ML-based PairF-Net approach to neural network potentials, that adopts a pairwise interatomic scheme to predicting forces within a molecular system. Here, we further develop the PairF-Net model to intrinsically incorporate energy conservation and couple the model to a molecular mechanical (MM) environment within the OpenMM package. The updated PairF-Net model yields energy and force predictions and dynamical distributions in good agreement with the rMD17 dataset of ten small organic molecules in the gas-phase. We further show that these in vacuo ML models of small molecules can be applied to force predictions in aqueous solution via hybrid ML/MM simulations. We present a new benchmark dataset for these ten molecules in solution, obtained from QM/MM simulations, which we denote as rMD17-aq (https://zenodo.org/records/10048644); and assess the ability of PairF-Net to reproduce the molecular energy, atomic forces and dynamical distributions of these solution conformations via ML/MM simulations.
Collapse
Affiliation(s)
- Jas Kalayan
- Division of Pharmacy and Optometry, School of Health Sciences, University of Manchester, Manchester, UK
| | - Ismaeel Ramzan
- Division of Pharmacy and Optometry, School of Health Sciences, University of Manchester, Manchester, UK
- Neural Circuits and Computations Unit, RIKEN Center for Brain Science, Wako, Japan
| | - Christopher D Williams
- Division of Pharmacy and Optometry, School of Health Sciences, University of Manchester, Manchester, UK
| | - Richard A Bryce
- Division of Pharmacy and Optometry, School of Health Sciences, University of Manchester, Manchester, UK
| | - Neil A Burton
- Department of Chemistry, University of Manchester, Manchester, UK
| |
Collapse
|
5
|
Wan K, He J, Shi X. Construction of High Accuracy Machine Learning Interatomic Potential for Surface/Interface of Nanomaterials-A Review. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2305758. [PMID: 37640376 DOI: 10.1002/adma.202305758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 08/24/2023] [Indexed: 08/31/2023]
Abstract
The inherent discontinuity and unique dimensional attributes of nanomaterial surfaces and interfaces bestow them with various exceptional properties. These properties, however, also introduce difficulties for both experimental and computational studies. The advent of machine learning interatomic potential (MLIP) addresses some of the limitations associated with empirical force fields, presenting a valuable avenue for accurate simulations of these surfaces/interfaces of nanomaterials. Central to this approach is the idea of capturing the relationship between system configuration and potential energy, leveraging the proficiency of machine learning (ML) to precisely approximate high-dimensional functions. This review offers an in-depth examination of MLIP principles and their execution and elaborates on their applications in the realm of nanomaterial surface and interface systems. The prevailing challenges faced by this potent methodology are also discussed.
Collapse
Affiliation(s)
- Kaiwei Wan
- Laboratory of Theoretical and Computational Nanoscience, National Center for Nanoscience and Technology, Chinese Academy of Sciences, Beijing, 100190, China
- University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing, 100049, China
| | - Jianxin He
- Laboratory of Theoretical and Computational Nanoscience, National Center for Nanoscience and Technology, Chinese Academy of Sciences, Beijing, 100190, China
- University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing, 100049, China
| | - Xinghua Shi
- Laboratory of Theoretical and Computational Nanoscience, National Center for Nanoscience and Technology, Chinese Academy of Sciences, Beijing, 100190, China
- University of Chinese Academy of Sciences, No. 19A Yuquan Road, Beijing, 100049, China
| |
Collapse
|
6
|
Chen M, Jiang X, Zhang L, Chen X, Wen Y, Gu Z, Li X, Zheng M. The emergence of machine learning force fields in drug design. Med Res Rev 2024; 44:1147-1182. [PMID: 38173298 DOI: 10.1002/med.22008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2023] [Revised: 11/29/2023] [Accepted: 12/05/2023] [Indexed: 01/05/2024]
Abstract
In the field of molecular simulation for drug design, traditional molecular mechanic force fields and quantum chemical theories have been instrumental but limited in terms of scalability and computational efficiency. To overcome these limitations, machine learning force fields (MLFFs) have emerged as a powerful tool capable of balancing accuracy with efficiency. MLFFs rely on the relationship between molecular structures and potential energy, bypassing the need for a preconceived notion of interaction representations. Their accuracy depends on the machine learning models used, and the quality and volume of training data sets. With recent advances in equivariant neural networks and high-quality datasets, MLFFs have significantly improved their performance. This review explores MLFFs, emphasizing their potential in drug design. It elucidates MLFF principles, provides development and validation guidelines, and highlights successful MLFF implementations. It also addresses potential challenges in developing and applying MLFFs. The review concludes by illuminating the path ahead for MLFFs, outlining the challenges to be overcome and the opportunities to be harnessed. This inspires researchers to embrace MLFFs in their investigations as a new tool to perform molecular simulations in drug design.
Collapse
Affiliation(s)
- Mingan Chen
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
- School of Physical Science and Technology, ShanghaiTech University, Shanghai, China
- Lingang Laboratory, Shanghai, China
| | - Xinyu Jiang
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
- School of Pharmacy, University of Chinese Academy of Sciences, Beijing, China
| | - Lehan Zhang
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
- School of Pharmacy, University of Chinese Academy of Sciences, Beijing, China
| | - Xiaoxu Chen
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
- School of Pharmacy, University of Chinese Academy of Sciences, Beijing, China
- School of Pharmaceutical Science and Technology, Hangzhou Institute for Advanced Study, UCAS, Hangzhou, China
| | - Yiming Wen
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
- School of Pharmacy, University of Chinese Academy of Sciences, Beijing, China
- School of Pharmaceutical Science and Technology, Hangzhou Institute for Advanced Study, UCAS, Hangzhou, China
| | - Zhiyong Gu
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
- School of Pharmacy, University of Chinese Academy of Sciences, Beijing, China
- School of Pharmaceutical Science and Technology, Hangzhou Institute for Advanced Study, UCAS, Hangzhou, China
| | - Xutong Li
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
- School of Pharmacy, University of Chinese Academy of Sciences, Beijing, China
| | - Mingyue Zheng
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
- School of Pharmacy, University of Chinese Academy of Sciences, Beijing, China
- School of Pharmaceutical Science and Technology, Hangzhou Institute for Advanced Study, UCAS, Hangzhou, China
| |
Collapse
|
7
|
Wang Y, Inizan TJ, Liu C, Piquemal JP, Ren P. Incorporating Neural Networks into the AMOEBA Polarizable Force Field. J Phys Chem B 2024; 128:2381-2388. [PMID: 38445577 PMCID: PMC10985787 DOI: 10.1021/acs.jpcb.3c08166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/07/2024]
Abstract
Neural network potentials (NNPs) offer significant promise to bridge the gap between the accuracy of quantum mechanics and the efficiency of molecular mechanics in molecular simulation. Most NNPs rely on the locality assumption that ensures the model's transferability and scalability and thus lack the treatment of long-range interactions, which are essential for molecular systems in the condensed phase. Here we present an integrated hybrid model, AMOEBA+NN, which combines the AMOEBA potential for the short- and long-range noncovalent atomic interactions and an NNP to capture the remaining local covalent contributions. The AMOEBA+NN model was trained on the conformational energy of the ANI-1x data set and tested on several external data sets ranging from small molecules to tetrapeptides. The hybrid model demonstrated substantial improvements over the baseline models in term of accuracy as the molecule size increased, suggesting its potential as a next-generation approach for chemically accurate molecular simulations.
Collapse
Affiliation(s)
- Yanxing Wang
- Department of Biomedical Engineering, The University of Texas at Austin, Austin, Texas 78712, United States
| | - Théo Jaffrelot Inizan
- Sorbonne Université, Laboratoire de Chimie Théorique, UMR 7616 CNRS, Paris 75005, France
| | - Chengwen Liu
- Department of Biomedical Engineering, The University of Texas at Austin, Austin, Texas 78712, United States
| | - Jean-Philip Piquemal
- Sorbonne Université, Laboratoire de Chimie Théorique, UMR 7616 CNRS, Paris 75005, France
| | - Pengyu Ren
- Department of Biomedical Engineering, The University of Texas at Austin, Austin, Texas 78712, United States
| |
Collapse
|
8
|
Ding Y, Huang J. Implementation and Validation of an OpenMM Plugin for the Deep Potential Representation of Potential Energy. Int J Mol Sci 2024; 25:1448. [PMID: 38338727 PMCID: PMC10855459 DOI: 10.3390/ijms25031448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 01/08/2024] [Accepted: 01/11/2024] [Indexed: 02/12/2024] Open
Abstract
Machine learning potentials, particularly the deep potential (DP) model, have revolutionized molecular dynamics (MD) simulations, striking a balance between accuracy and computational efficiency. To facilitate the DP model's integration with the popular MD engine OpenMM, we have developed a versatile OpenMM plugin. This plugin supports a range of applications, from conventional MD simulations to alchemical free energy calculations and hybrid DP/MM simulations. Our extensive validation tests encompassed energy conservation in microcanonical ensemble simulations, fidelity in canonical ensemble generation, and the evaluation of the structural, transport, and thermodynamic properties of bulk water. The introduction of this plugin is expected to significantly expand the application scope of DP models within the MD simulation community, representing a major advancement in the field.
Collapse
Affiliation(s)
- Ye Ding
- College of Life Sciences, Zhejiang University, Hangzhou 310027, China;
- School of Life Sciences, Westlake University, Hangzhou 310024, China
- Westlake AI Therapeutics Lab, Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou 310024, China
| | - Jing Huang
- School of Life Sciences, Westlake University, Hangzhou 310024, China
- Westlake AI Therapeutics Lab, Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou 310024, China
| |
Collapse
|
9
|
Ding Y, Huang J. DP/MM: A Hybrid Model for Zinc-Protein Interactions in Molecular Dynamics. J Phys Chem Lett 2024; 15:616-627. [PMID: 38198685 DOI: 10.1021/acs.jpclett.3c03158] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2024]
Abstract
Zinc-containing proteins are vital for many biological processes, yet accurately modeling them using classical force fields is hindered by complicated polarization and charge transfer effects. This study introduces DP/MM, a hybrid force field scheme that utilizes a deep potential model to correct the atomic forces of zinc ions and their coordinated atoms, elevating them from MM to QM levels of accuracy. Trained on the difference between MM and QM atomic forces across diverse zinc coordination groups, the DP/MM model faithfully reproduces structural characteristics of zinc coordination during simulations, such as the tetrahedral coordination of Cys4 and Cys3His1 groups. Furthermore, DP/MM allows water exchange in the zinc coordination environment. With its unique blend of accuracy, efficiency, flexibility, and transferability, DP/MM serves as a valuable tool for studying structures and dynamics of zinc-containing proteins and also represents a pioneering approach in the evolving landscape of machine learning potentials for molecular modeling.
Collapse
Affiliation(s)
- Ye Ding
- College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang 310027, China
- School of Life Sciences, Westlake University, Hangzhou, Zhejiang 310024, China
- Westlake AI Therapeutics Lab, Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang 310024, China
| | - Jing Huang
- School of Life Sciences, Westlake University, Hangzhou, Zhejiang 310024, China
- Westlake AI Therapeutics Lab, Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang 310024, China
| |
Collapse
|
10
|
Galvelis R, Varela-Rial A, Doerr S, Fino R, Eastman P, Markland TE, Chodera JD, De Fabritiis G. NNP/MM: Accelerating Molecular Dynamics Simulations with Machine Learning Potentials and Molecular Mechanics. J Chem Inf Model 2023; 63:5701-5708. [PMID: 37694852 PMCID: PMC10577237 DOI: 10.1021/acs.jcim.3c00773] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
Machine learning potentials have emerged as a means to enhance the accuracy of biomolecular simulations. However, their application is constrained by the significant computational cost arising from the vast number of parameters compared with traditional molecular mechanics. To tackle this issue, we introduce an optimized implementation of the hybrid method (NNP/MM), which combines a neural network potential (NNP) and molecular mechanics (MM). This approach models a portion of the system, such as a small molecule, using NNP while employing MM for the remaining system to boost efficiency. By conducting molecular dynamics (MD) simulations on various protein-ligand complexes and metadynamics (MTD) simulations on a ligand, we showcase the capabilities of our implementation of NNP/MM. It has enabled us to increase the simulation speed by ∼5 times and achieve a combined sampling of 1 μs for each complex, marking the longest simulations ever reported for this class of simulations.
Collapse
Affiliation(s)
- Raimondas Galvelis
- Acellera Labs, C/Doctor Trueta 183, Barcelona 08005, Spain
- Computational Science Laboratory, Universitat Pompeu Fabra, PRBB, C/Doctor Aiguader 88, Barcelona 08003, Spain
| | - Alejandro Varela-Rial
- Acellera Ltd, Devonshire House 582 Honeypot Lane, Stanmore Middlesex, HA7 1JS, United Kingdom
| | - Stefan Doerr
- Acellera Ltd, Devonshire House 582 Honeypot Lane, Stanmore Middlesex, HA7 1JS, United Kingdom
| | - Roberto Fino
- Acellera Labs, C/Doctor Trueta 183, Barcelona 08005, Spain
| | - Peter Eastman
- Department of Chemistry, Stanford University, 337 Campus Drive, Stanford, California 94305, United States
| | - Thomas E Markland
- Department of Chemistry, Stanford University, 337 Campus Drive, Stanford, California 94305, United States
| | - John D Chodera
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, New York 10065, United States
| | - Gianni De Fabritiis
- Computational Science Laboratory, Universitat Pompeu Fabra, PRBB, C/Doctor Aiguader 88, Barcelona 08003, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluis Companys 23, Barcelona 08010, Spain
- Acellera Ltd, Devonshire House 582 Honeypot Lane, Stanmore Middlesex, HA7 1JS, United Kingdom
| |
Collapse
|
11
|
Raghavan B, Paulikat M, Ahmad K, Callea L, Rizzi A, Ippoliti E, Mandelli D, Bonati L, De Vivo M, Carloni P. Drug Design in the Exascale Era: A Perspective from Massively Parallel QM/MM Simulations. J Chem Inf Model 2023; 63:3647-3658. [PMID: 37319347 PMCID: PMC10302481 DOI: 10.1021/acs.jcim.3c00557] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Indexed: 06/17/2023]
Abstract
The initial phases of drug discovery - in silico drug design - could benefit from first principle Quantum Mechanics/Molecular Mechanics (QM/MM) molecular dynamics (MD) simulations in explicit solvent, yet many applications are currently limited by the short time scales that this approach can cover. Developing scalable first principle QM/MM MD interfaces fully exploiting current exascale machines - so far an unmet and crucial goal - will help overcome this problem, opening the way to the study of the thermodynamics and kinetics of ligand binding to protein with first principle accuracy. Here, taking two relevant case studies involving the interactions of ligands with rather large enzymes, we showcase the use of our recently developed massively scalable Multiscale Modeling in Computational Chemistry (MiMiC) QM/MM framework (currently using DFT to describe the QM region) to investigate reactions and ligand binding in enzymes of pharmacological relevance. We also demonstrate for the first time strong scaling of MiMiC-QM/MM MD simulations with parallel efficiency of ∼70% up to >80,000 cores. Thus, among many others, the MiMiC interface represents a promising candidate toward exascale applications by combining machine learning with statistical mechanics based algorithms tailored for exascale supercomputers.
Collapse
Affiliation(s)
- Bharath Raghavan
- Computational
Biomedicine, Institute of Advanced Simulations IAS-5/Institute for
Neuroscience and Medicine INM-9, Forschungszentrum
Jülich GmbH, Jülich 52428, Germany
- Department
of Physics, RWTH Aachen University, Aachen 52074, Germany
| | - Mirko Paulikat
- Computational
Biomedicine, Institute of Advanced Simulations IAS-5/Institute for
Neuroscience and Medicine INM-9, Forschungszentrum
Jülich GmbH, Jülich 52428, Germany
| | - Katya Ahmad
- Computational
Biomedicine, Institute of Advanced Simulations IAS-5/Institute for
Neuroscience and Medicine INM-9, Forschungszentrum
Jülich GmbH, Jülich 52428, Germany
| | - Lara Callea
- Department
of Earth and Environmental Sciences, University
of Milano-Bicocca, Piazza della Scienza 1, 20126 Milan, Italy
| | - Andrea Rizzi
- Computational
Biomedicine, Institute of Advanced Simulations IAS-5/Institute for
Neuroscience and Medicine INM-9, Forschungszentrum
Jülich GmbH, Jülich 52428, Germany
- Atomistic
Simulations, Italian Institute of Technology, Genova 16163, Italy
| | - Emiliano Ippoliti
- Computational
Biomedicine, Institute of Advanced Simulations IAS-5/Institute for
Neuroscience and Medicine INM-9, Forschungszentrum
Jülich GmbH, Jülich 52428, Germany
| | - Davide Mandelli
- Computational
Biomedicine, Institute of Advanced Simulations IAS-5/Institute for
Neuroscience and Medicine INM-9, Forschungszentrum
Jülich GmbH, Jülich 52428, Germany
| | - Laura Bonati
- Department
of Earth and Environmental Sciences, University
of Milano-Bicocca, Piazza della Scienza 1, 20126 Milan, Italy
| | - Marco De Vivo
- Molecular
Modelling and Drug Discovery, Italian Institute
of Technology, Genova 16163, Italy
| | - Paolo Carloni
- Computational
Biomedicine, Institute of Advanced Simulations IAS-5/Institute for
Neuroscience and Medicine INM-9, Forschungszentrum
Jülich GmbH, Jülich 52428, Germany
- Department
of Physics and Universitätsklinikum, RWTH Aachen University, Aachen 52074, Germany
| |
Collapse
|
12
|
Jaffrelot Inizan T, Plé T, Adjoua O, Ren P, Gökcan H, Isayev O, Lagardère L, Piquemal JP. Scalable hybrid deep neural networks/polarizable potentials biomolecular simulations including long-range effects. Chem Sci 2023; 14:5438-5452. [PMID: 37234902 PMCID: PMC10208042 DOI: 10.1039/d2sc04815a] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Accepted: 04/03/2023] [Indexed: 07/28/2023] Open
Abstract
Deep-HP is a scalable extension of the Tinker-HP multi-GPU molecular dynamics (MD) package enabling the use of Pytorch/TensorFlow Deep Neural Network (DNN) models. Deep-HP increases DNNs' MD capabilities by orders of magnitude offering access to ns simulations for 100k-atom biosystems while offering the possibility of coupling DNNs to any classical (FFs) and many-body polarizable (PFFs) force fields. It allows therefore the introduction of the ANI-2X/AMOEBA hybrid polarizable potential designed for ligand binding studies where solvent-solvent and solvent-solute interactions are computed with the AMOEBA PFF while solute-solute ones are computed by the ANI-2X DNN. ANI-2X/AMOEBA explicitly includes AMOEBA's physical long-range interactions via an efficient Particle Mesh Ewald implementation while preserving ANI-2X's solute short-range quantum mechanical accuracy. The DNN/PFF partition can be user-defined allowing for hybrid simulations to include key ingredients of biosimulation such as polarizable solvents, polarizable counter ions, etc.… ANI-2X/AMOEBA is accelerated using a multiple-timestep strategy focusing on the model's contributions to low-frequency modes of nuclear forces. It primarily evaluates AMOEBA forces while including ANI-2X ones only via correction-steps resulting in an order of magnitude acceleration over standard Velocity Verlet integration. Simulating more than 10 μs, we compute charged/uncharged ligand solvation free energies in 4 solvents, and absolute binding free energies of host-guest complexes from SAMPL challenges. ANI-2X/AMOEBA average errors are discussed in terms of statistical uncertainty and appear in the range of chemical accuracy compared to experiment. The availability of the Deep-HP computational platform opens the path towards large-scale hybrid DNN simulations, at force-field cost, in biophysics and drug discovery.
Collapse
Affiliation(s)
- Théo Jaffrelot Inizan
- Sorbonne Université, Laboratoire de Chimie Théorique UMR 7616 CNRS Paris 75005 France
| | - Thomas Plé
- Sorbonne Université, Laboratoire de Chimie Théorique UMR 7616 CNRS Paris 75005 France
| | - Olivier Adjoua
- Sorbonne Université, Laboratoire de Chimie Théorique UMR 7616 CNRS Paris 75005 France
| | - Pengyu Ren
- Department of Biomedical Engineering, University of Texas at Austin Austin Texas USA
| | - Hatice Gökcan
- Department of Chemistry, Carnegie Mellon University Pittsburgh Pennsylvania USA
| | - Olexandr Isayev
- Department of Chemistry, Carnegie Mellon University Pittsburgh Pennsylvania USA
| | - Louis Lagardère
- Sorbonne Université, Laboratoire de Chimie Théorique UMR 7616 CNRS Paris 75005 France
- Sorbonne Université, Institut Parisien de Chimie Physique et Théorique FR 2622 CNRS Paris France
| | - Jean-Philip Piquemal
- Sorbonne Université, Laboratoire de Chimie Théorique UMR 7616 CNRS Paris 75005 France
- Department of Biomedical Engineering, University of Texas at Austin Austin Texas USA
| |
Collapse
|
13
|
Boswell Z, Verga JU, Mackle J, Guerrero-Vazquez K, Thomas OP, Cray J, Wolf BJ, Choo YM, Croot P, Hamann MT, Hardiman G. In-Silico Approaches for the Screening and Discovery of Broad-Spectrum Marine Natural Product Antiviral Agents Against Coronaviruses. Infect Drug Resist 2023; 16:2321-2338. [PMID: 37155475 PMCID: PMC10122865 DOI: 10.2147/idr.s395203] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2022] [Accepted: 03/16/2023] [Indexed: 05/10/2023] Open
Abstract
The urgent need for SARS-CoV-2 controls has led to a reassessment of approaches to identify and develop natural product inhibitors of zoonotic, highly virulent, and rapidly emerging viruses. There are yet no clinically approved broad-spectrum antivirals available for beta-coronaviruses. Discovery pipelines for pan-virus medications against a broad range of betacoronaviruses are therefore a priority. A variety of marine natural product (MNP) small molecules have shown inhibitory activity against viral species. Access to large data caches of small molecule structural information is vital to finding new pharmaceuticals. Increasingly, molecular docking simulations are being used to narrow the space of possibilities and generate drug leads. Combining in-silico methods, augmented by metaheuristic optimization and machine learning (ML) allows the generation of hits from within a virtual MNP library to narrow screens for novel targets against coronaviruses. In this review article, we explore current insights and techniques that can be leveraged to generate broad-spectrum antivirals against betacoronaviruses using in-silico optimization and ML. ML approaches are capable of simultaneously evaluating different features for predicting inhibitory activity. Many also provide a semi-quantitative measure of feature relevance and can guide in selecting a subset of features relevant for inhibition of SARS-CoV-2.
Collapse
Affiliation(s)
- Zachary Boswell
- School of Biological Sciences and Institute for Global Security, Queen's University, Belfast, Northern Ireland, UK
| | - Jacopo Umberto Verga
- School of Biological Sciences and Institute for Global Security, Queen's University, Belfast, Northern Ireland, UK
- Genomic Data Science, University of Galway, Galway, Ireland
| | - James Mackle
- School of Biological Sciences and Institute for Global Security, Queen's University, Belfast, Northern Ireland, UK
| | | | - Olivier P Thomas
- School of Biological and Chemical Sciences, Ryan Institute, University of Galway, Galway, H91TK33Ireland
| | - James Cray
- Department of Biomedical Education and Anatomy, College of Medicine and Division of Biosciences, College of Dentistry, Ohio State University, Columbus, OH, USA
| | - Bethany J Wolf
- Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC, USA
| | - Yeun-Mun Choo
- Department of Chemistry, University of Malaya, Kuala Lumpur, Malaysia
| | - Peter Croot
- Irish Centre for Research in Applied Geoscience, Earth and Ocean Sciences and Ryan Institute, School of Natural Sciences, University of Galway, Galway, Ireland
| | - Mark T Hamann
- Departments of Drug Discovery and Biomedical Sciences and Public Health, Colleges of Pharmacy and Medicine, Medical University of South Carolina, Charleston, SC, USA
| | - Gary Hardiman
- School of Biological Sciences and Institute for Global Security, Queen's University, Belfast, Northern Ireland, UK
- Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC, USA
- Department of Medicine, Medical University of South Carolina, Charleston, SC, USA
| |
Collapse
|
14
|
Morado J, Mortenson PN, Nissink JWM, Essex JW, Skylaris CK. Does a Machine-Learned Potential Perform Better Than an Optimally Tuned Traditional Force Field? A Case Study on Fluorohydrins. J Chem Inf Model 2023; 63:2810-2827. [PMID: 37071825 PMCID: PMC10170518 DOI: 10.1021/acs.jcim.2c01510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/20/2023]
Abstract
We present a comparative study that evaluates the performance of a machine learning potential (ANI-2x), a conventional force field (GAFF), and an optimally tuned GAFF-like force field in the modeling of a set of 10 γ-fluorohydrins that exhibit a complex interplay between intra- and intermolecular interactions in determining conformer stability. To benchmark the performance of each molecular model, we evaluated their energetic, geometric, and sampling accuracies relative to quantum-mechanical data. This benchmark involved conformational analysis both in the gas phase and chloroform solution. We also assessed the performance of the aforementioned molecular models in estimating nuclear spin-spin coupling constants by comparing their predictions to experimental data available in chloroform. The results and discussion presented in this study demonstrate that ANI-2x tends to predict stronger-than-expected hydrogen bonding and overstabilize global minima and shows problems related to inadequate description of dispersion interactions. Furthermore, while ANI-2x is a viable model for modeling in the gas phase, conventional force fields still play an important role, especially for condensed-phase simulations. Overall, this study highlights the strengths and weaknesses of each model, providing guidelines for the use and future development of force fields and machine learning potentials.
Collapse
Affiliation(s)
- João Morado
- School of Chemistry, University of Southampton, Highfield, Southampton SO17 1BJ, United Kingdom
| | - Paul N Mortenson
- Astex Pharmaceuticals, 436 Cambridge Science Park, Milton Road, Cambridge CB4 0QA, United Kingdom
| | - J Willem M Nissink
- Computational Chemistry, Oncology R&D, AstraZeneca, Cambridge CB4 0WG, United Kingdom
| | - Jonathan W Essex
- School of Chemistry, University of Southampton, Highfield, Southampton SO17 1BJ, United Kingdom
| | - Chris-Kriton Skylaris
- School of Chemistry, University of Southampton, Highfield, Southampton SO17 1BJ, United Kingdom
| |
Collapse
|
15
|
Eastman P, Behara PK, Dotson DL, Galvelis R, Herr JE, Horton JT, Mao Y, Chodera JD, Pritchard BP, Wang Y, De Fabritiis G, Markland TE. SPICE, A Dataset of Drug-like Molecules and Peptides for Training Machine Learning Potentials. Sci Data 2023; 10:11. [PMID: 36599873 PMCID: PMC9813265 DOI: 10.1038/s41597-022-01882-6] [Citation(s) in RCA: 20] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 12/01/2022] [Indexed: 01/05/2023] Open
Abstract
Machine learning potentials are an important tool for molecular simulation, but their development is held back by a shortage of high quality datasets to train them on. We describe the SPICE dataset, a new quantum chemistry dataset for training potentials relevant to simulating drug-like small molecules interacting with proteins. It contains over 1.1 million conformations for a diverse set of small molecules, dimers, dipeptides, and solvated amino acids. It includes 15 elements, charged and uncharged molecules, and a wide range of covalent and non-covalent interactions. It provides both forces and energies calculated at the ωB97M-D3(BJ)/def2-TZVPPD level of theory, along with other useful quantities such as multipole moments and bond orders. We train a set of machine learning potentials on it and demonstrate that they can achieve chemical accuracy across a broad region of chemical space. It can serve as a valuable resource for the creation of transferable, ready to use potential functions for use in molecular simulations.
Collapse
Affiliation(s)
- Peter Eastman
- Department of Chemistry, Stanford University, Stanford, CA, 94305, USA.
| | - Pavan Kumar Behara
- Department of Pharmaceutical Sciences, University of California, Irvine, CA, 92697, USA
| | - David L Dotson
- The Open Force Field Initiative, Open Molecular Software Foundation, Davis, CA, 95616, USA
| | | | - John E Herr
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Josh T Horton
- School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne, NE1 7RU, United Kingdom
| | - Yuezhi Mao
- Department of Chemistry, Stanford University, Stanford, CA, 94305, USA
| | - John D Chodera
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
| | - Benjamin P Pritchard
- Molecular Sciences Software Institute, Virginia Polytechnic Institute and State University, Blacksburg, VA, 24060, USA
| | - Yuanqing Wang
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
- Graduate Program in Physiology, Biophysics, and Systems Biology, Weill Cornell Graduate School of Medical Sciences, New York, NY, 10065, USA
| | - Gianni De Fabritiis
- Acellera Labs, Doctor Trueta 183, 08005, Barcelona, Spain
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), Carrer Dr. Aiguader 88, 08003, Barcelona, Spain and ICREA, Passeig Lluis Companys 23, 08010, Barcelona, Spain
| | - Thomas E Markland
- Department of Chemistry, Stanford University, Stanford, CA, 94305, USA
| |
Collapse
|
16
|
Kong L, Bryce RA. Modeling pyranose ring pucker in carbohydrates using machine learning and semi-empirical quantum chemical methods. J Comput Chem 2022; 43:2009-2022. [PMID: 36165294 PMCID: PMC9828179 DOI: 10.1002/jcc.27000] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Revised: 08/17/2022] [Accepted: 08/22/2022] [Indexed: 01/12/2023]
Abstract
Pyranose ring pucker is a key coordinate governing the structure, interactions and reactivity of carbohydrates. We assess the ability of the machine learning potentials, ANI-1ccx and ANI-2x, and the GFN2-xTB semiempirical quantum chemical method, to model ring pucker conformers of five monosaccharides and oxane in the gas phase. Relative to coupled-cluster quantum mechanical calculations, we find that ANI-1ccx most accurately reproduces the ring pucker energy landscape for these molecules, with a correlation coefficient r2 of 0.83. This correlation in relative energies lowers to values of 0.70 for ANI-2x and 0.60 for GFN2-xTB. The ANI-1ccx also provides the most accurate estimate of the energetics of the 4 C1 -to-1 C4 minimum energy pathway for the six molecules. All three models reproduce chair more accurately than non-chair geometries. Analysis of small model molecules suggests that the ANI-1ccx model favors puckers with equatorial hydrogen bonding substituents; that ANI-2x and GFN2-xTB models overstabilize conformers with axially oriented groups; and that the endo-anomeric effect is overestimated by the machine learning models and underestimated via the GFN2-xTB method. While the pucker conformers considered in this study correspond to a gas phase environment, the accuracy and computational efficiency of the ANI-1ccx approach in modeling ring pucker in vacuo provides a promising basis for future evaluation and application to condensed phase environments.
Collapse
Affiliation(s)
- Linghan Kong
- Division of Pharmacy and Optometry, School of Health Sciences, Manchester Academic Health Sciences CentreUniversity of ManchesterManchesterUK
| | - Richard A. Bryce
- Division of Pharmacy and Optometry, School of Health Sciences, Manchester Academic Health Sciences CentreUniversity of ManchesterManchesterUK
| |
Collapse
|
17
|
Zhang Y, Luo M, Wu P, Wu S, Lee TY, Bai C. Application of Computational Biology and Artificial Intelligence in Drug Design. Int J Mol Sci 2022; 23:13568. [PMID: 36362355 PMCID: PMC9658956 DOI: 10.3390/ijms232113568] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Revised: 10/29/2022] [Accepted: 11/03/2022] [Indexed: 08/24/2023] Open
Abstract
Traditional drug design requires a great amount of research time and developmental expense. Booming computational approaches, including computational biology, computer-aided drug design, and artificial intelligence, have the potential to expedite the efficiency of drug discovery by minimizing the time and financial cost. In recent years, computational approaches are being widely used to improve the efficacy and effectiveness of drug discovery and pipeline, leading to the approval of plenty of new drugs for marketing. The present review emphasizes on the applications of these indispensable computational approaches in aiding target identification, lead discovery, and lead optimization. Some challenges of using these approaches for drug design are also discussed. Moreover, we propose a methodology for integrating various computational techniques into new drug discovery and design.
Collapse
Affiliation(s)
- Yue Zhang
- School of Life and Health Sciences, School of Medicine, The Chinese University of Hong Kong, Shenzhen 518172, China
- School of Chemistry and Materials Science, University of Science and Technology of China, Hefei 230026, China
- Warshel Institute for Computational Biology, Shenzhen 518172, China
| | - Mengqi Luo
- School of Life and Health Sciences, School of Medicine, The Chinese University of Hong Kong, Shenzhen 518172, China
- South China Hospital, Health Science Center, Shenzhen University, Shenzhen 518116, China
| | - Peng Wu
- School of Biomedical Engineering, Health Science Center, Shenzhen University, Shenzhen 518055, China
| | - Song Wu
- South China Hospital, Health Science Center, Shenzhen University, Shenzhen 518116, China
| | - Tzong-Yi Lee
- School of Life and Health Sciences, School of Medicine, The Chinese University of Hong Kong, Shenzhen 518172, China
- Warshel Institute for Computational Biology, Shenzhen 518172, China
| | - Chen Bai
- School of Life and Health Sciences, School of Medicine, The Chinese University of Hong Kong, Shenzhen 518172, China
- Warshel Institute for Computational Biology, Shenzhen 518172, China
| |
Collapse
|
18
|
Bieniek MK, Cree B, Pirie R, Horton JT, Tatum NJ, Cole DJ. An open-source molecular builder and free energy preparation workflow. Commun Chem 2022; 5:136. [PMID: 36320862 PMCID: PMC9607723 DOI: 10.1038/s42004-022-00754-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Accepted: 10/11/2022] [Indexed: 01/27/2023] Open
Abstract
Automated free energy calculations for the prediction of binding free energies of congeneric series of ligands to a protein target are growing in popularity, but building reliable initial binding poses for the ligands is challenging. Here, we introduce the open-source FEgrow workflow for building user-defined congeneric series of ligands in protein binding pockets for input to free energy calculations. For a given ligand core and receptor structure, FEgrow enumerates and optimises the bioactive conformations of the grown functional group(s), making use of hybrid machine learning/molecular mechanics potential energy functions where possible. Low energy structures are optionally scored using the gnina convolutional neural network scoring function, and output for more rigorous protein-ligand binding free energy predictions. We illustrate use of the workflow by building and scoring binding poses for ten congeneric series of ligands bound to targets from a standard, high quality dataset of protein-ligand complexes. Furthermore, we build a set of 13 inhibitors of the SARS-CoV-2 main protease from the literature, and use free energy calculations to retrospectively compute their relative binding free energies. FEgrow is freely available at https://github.com/cole-group/FEgrow, along with a tutorial.
Collapse
Affiliation(s)
- Mateusz K. Bieniek
- School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne, NE1 7RU UK
| | - Ben Cree
- School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne, NE1 7RU UK
| | - Rachael Pirie
- School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne, NE1 7RU UK
| | - Joshua T. Horton
- School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne, NE1 7RU UK
| | - Natalie J. Tatum
- Newcastle University Centre for Cancer, Translational and Clinical Research Institute, Newcastle University, Newcastle upon Tyne, NE2 4HH UK
| | - Daniel J. Cole
- School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne, NE1 7RU UK
| |
Collapse
|
19
|
Gallarati S, Fabregat R, Juraskova V, Inizan TJ, Corminboeuf C. How Robust Is the Reversible Steric Shielding Strategy for Photoswitchable Organocatalysts? J Org Chem 2022; 87:8849-8857. [PMID: 35762705 PMCID: PMC9295146 DOI: 10.1021/acs.joc.1c02991] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A highly appealing strategy to modulate a catalyst's activity and/or selectivity in a dynamic and noninvasive way is to incorporate a photoresponsive unit into a catalytically competent molecule. However, the description of the photoinduced conformational or structural changes that alter the catalyst's intrinsic reactivity is often reduced to a handful of intuitive static representations, which can struggle to capture the complexity of flexible organocatalysts. Here, we show how a comprehensive exploration of the free energy landscape of N-alkylated azobenzene-tethered piperidine catalysts is essential to unravel the conformational characteristics of each configurational state and explain the experimentally observed reactivity trends. Mapping the catalysts' conformational space highlights the existence of false ON or OFF states that lower their switching ability. Our findings expose the challenges associated with the realization of a reversible steric shielding for the photocontrol of Brønsted basicity of piperidine photoswitchable organocatalysts.
Collapse
Affiliation(s)
- Simone Gallarati
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne 1015, Switzerland
| | - Raimon Fabregat
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne 1015, Switzerland
| | - Veronika Juraskova
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne 1015, Switzerland
| | - Theo Jaffrelot Inizan
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne 1015, Switzerland
| | - Clemence Corminboeuf
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne 1015, Switzerland.,National Center for Competence in Research─Catalysis (NCCR-Catalysis), Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne 1015, Switzerland.,National Center for Computational Design and Discovery of Novel Materials (MARVEL), Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne 1015, Switzerland
| |
Collapse
|
20
|
Integration of machine learning with computational structural biology of plants. Biochem J 2022; 479:921-928. [PMID: 35484946 DOI: 10.1042/bcj20200942] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 04/01/2022] [Accepted: 04/06/2022] [Indexed: 11/17/2022]
Abstract
Computational structural biology of proteins has developed rapidly in recent decades with the development of new computational tools and the advancement of computing hardware. However, while these techniques have widely been used to make advancements in human medicine, these methods have seen less utilization in the plant sciences. In the last several years, machine learning methods have gained popularity in computational structural biology. These methods have enabled the development of new tools which are able to address the major challenges that have hampered the wide adoption of the computational structural biology of plants. This perspective examines the remaining challenges in computational structural biology and how the development of machine learning techniques enables more in-depth computational structural biology of plants.
Collapse
|
21
|
Hsueh SCC, Nijland M, Peng X, Hilton B, Plotkin SS. First Principles Calculation of Protein-Protein Dimer Affinities of ALS-Associated SOD1 Mutants. Front Mol Biosci 2022; 9:845013. [PMID: 35402516 PMCID: PMC8988244 DOI: 10.3389/fmolb.2022.845013] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Accepted: 02/08/2022] [Indexed: 01/03/2023] Open
Abstract
Cu,Zn superoxide dismutase (SOD1) is a 32 kDa homodimer that converts toxic oxygen radicals in neurons to less harmful species. The dimerization of SOD1 is essential to the stability of the protein. Monomerization increases the likelihood of SOD1 misfolding into conformations associated with aggregation, cellular toxicity, and neuronal death in familial amyotrophic lateral sclerosis (fALS). The ubiquity of disease-associated mutations throughout the primary sequence of SOD1 suggests an important role of physicochemical processes, including monomerization of SOD1, in the pathology of the disease. Herein, we use a first-principles statistical mechanics method to systematically calculate the free energy of dimer binding for SOD1 using molecular dynamics, which involves sequentially computing conformational, orientational, and separation distance contributions to the binding free energy. We consider the effects of two ALS-associated mutations in SOD1 protein on dimer stability, A4V and D101N, as well as the role of metal binding and disulfide bond formation. We find that the penalty for dimer formation arising from the conformational entropy of disordered loops in SOD1 is significantly larger than that for other protein-protein interactions previously considered. In the case of the disulfide-reduced protein, this leads to a bound complex whose formation is energetically disfavored. Somewhat surprisingly, the loop free energy penalty upon dimerization is still significant for the holoprotein, despite the increased structural order induced by the bound metal cations. This resulted in a surprisingly modest increase in dimer binding free energy of only about 1.5 kcal/mol upon metalation of the protein, suggesting that the most significant stabilizing effects of metalation are on folding stability rather than dimer binding stability. The mutant A4V has an unstable dimer due to weakened monomer-monomer interactions, which are manifested in the calculation by a separation free energy surface with a lower barrier. The mutant D101N has a stable dimer partially due to an unusually rigid β-barrel in the free monomer. D101N also exhibits anticooperativity in loop folding upon dimerization. These computational calculations are, to our knowledge, the most quantitatively accurate calculations of dimer binding stability in SOD1 to date.
Collapse
Affiliation(s)
- Shawn C. C. Hsueh
- Department of Physics and Astronomy, University of British Columbia, Vancouver, BC, Canada
| | - Mark Nijland
- Department of Physics and Astronomy, University of British Columbia, Vancouver, BC, Canada
- Laboratory of Organic Chemistry, Wageningen University and Research, Wageningen, Netherlands
- Laboratory of Physical Chemistry and Soft Matter, Wageningen University and Research, Wageningen, Netherlands
| | - Xubiao Peng
- Department of Physics and Astronomy, University of British Columbia, Vancouver, BC, Canada
- Center for Quantum Technology Research, School of Physics, Beijing Institute of Technology, Beijing, China
| | - Benjamin Hilton
- Department of Physics and Astronomy, University of British Columbia, Vancouver, BC, Canada
- Imperial College London, London, United Kingdom
| | - Steven S. Plotkin
- Department of Physics and Astronomy, University of British Columbia, Vancouver, BC, Canada
- Genome Science and Technology Program, University of British Columbia, Vancouver, BC, Canada
| |
Collapse
|
22
|
|
23
|
Gokcan H, Isayev O. Learning molecular potentials with neural networks. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1564] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Hatice Gokcan
- Department of Chemistry, Mellon College of Science Carnegie Mellon University Pittsburgh Pennsylvania USA
| | - Olexandr Isayev
- Department of Chemistry, Mellon College of Science Carnegie Mellon University Pittsburgh Pennsylvania USA
| |
Collapse
|
24
|
Zlotnikov ID, Kudryashova EV. Computer simulation of the Receptor-Ligand Interactions of Mannose Receptor CD206 in Comparison with the Lectin Concanavalin A Model. BIOCHEMISTRY. BIOKHIMIIA 2022; 87:54-69. [PMID: 35491020 PMCID: PMC8769089 DOI: 10.1134/s0006297922010059] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Computer modeling of complexation of mono- and oligosaccharide ligands with the main (fourth) carbohydrate-binding domain of the mannose receptor CD206 (CRD4), as well as with the model receptor concanavalin A (ConA), was carried out for the first time, using methods of molecular dynamics and neural network analysis. ConA was shown to be a relevant model of CD206 (CRD4) due to similarity of the structural organization of the binding sites and high correlation of the values of free energies of complexation between the literature data and computer modeling (r > 0.9). Role of the main factors affecting affinity of the ligand–receptor interactions is discussed: the number and nature of carbohydrate residues, presence of Me-group in the O1 position, type of the glycoside bond in dimannose. Complexation of ConA and CD206 with ligands is shown to be energetically caused by electrostatic interactions (E) of the charged residues (Asn, Asp, Arg) with oxygen and hydrogen atoms in carbohydrates; contributions of hydrophobic and van der Waals components is lower. Possibility of the additional stabilization of complexes due to the CH–π stacking interactions of Tyr with the Man plane is discussed. The role of calcium and manganese ions in binding ligands has been studied. The values of free energies of complexation calculated in the course of molecular dynamics simulation correlate with experimental data (published for the model ConA): correlation coefficient r = 0.68. The Pafnucy neural network was trained based on the set of PDBbind2020 ligand–receptor complexes, which significantly increased accuracy of the energy predictions to r = 0.8 and 0.82 for CD206 and ConA receptors, respectively. A model of normalization of the complexation energy values for calculating the relevant values of ΔGbind, Kd is proposed. Based on the developed technique, values of the dissociation constants of a series of CD206 complexes with nine carbohydrate ligands of different structures were determined, which were not previously known. The obtained data open up possibilities for using computer modeling for the development of optimal drug carriers capable of active macrophage targeting, and also determine the limits of applicability of using ConA as a relevant model for studying parameters of the CD206 binding to various carbohydrate ligands.
Collapse
Affiliation(s)
- Igor D Zlotnikov
- Faculty of Chemistry, Lomonosov Moscow State University, Moscow, 119991, Russia
| | - Elena V Kudryashova
- Faculty of Chemistry, Lomonosov Moscow State University, Moscow, 119991, Russia.
| |
Collapse
|
25
|
Yilmaz DE, Woodward WH, van Duin ACT. Machine Learning-Assisted Hybrid ReaxFF Simulations. J Chem Theory Comput 2021; 17:6705-6712. [PMID: 34644081 DOI: 10.1021/acs.jctc.1c00523] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We have developed a machine learning (ML)-assisted Hybrid ReaxFF simulation method ("Hybrid/Reax"), which alternates reactive and non-reactive molecular dynamics simulations with the assistance of ML models to simulate phenomena that require longer time scales and/or larger systems than are typically accessible to ReaxFF. Hybrid/Reax uses a specialized tracking tool during the reactive simulations to further accelerate chemical reactions. Non-reactive simulations are used to equilibrate the system after the reactive simulation stage. ML models are used between reactive and non-reactive stages to predict non-reactive force field parameters of the system based on the updated bond topology. Hybrid/Reax simulation cycles can be continued until the desired chemical reactions are observed. As a case study, this method was used to study the cross-linking of a polyethylene (PE) matrix analogue (decane) with the cross-linking agent dicumyl peroxide (DCP). We were able to run relatively long simulations [>20 million molecular dynamics (MD) steps] on a small test system (4660 atoms) to simulate cross-linking reactions of PE in the presence of DCP. Starting with 80 PE molecules, more than half of them cross-linked by the end of the Hybrid/Reax cycles on a single Xeon processor in under 48 h. This simulation would take approximately 1 month if run with pure ReaxFF MD on the same machine.
Collapse
Affiliation(s)
- Dundar E Yilmaz
- Department of Mechanical Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, United States
| | | | - Adri C T van Duin
- Department of Mechanical Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, United States
| |
Collapse
|
26
|
Bachmann J, Doltsinis NL. Adaptive partitioning molecular dynamics using an extended Hamiltonian approach. J Chem Phys 2021; 155:144104. [PMID: 34654314 DOI: 10.1063/5.0059206] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
A recently proposed extended Hamiltonian approach to switching interaction potentials is generalized to enable adaptive partitioning molecular dynamics simulations. Switching is performed along a fictitious classical degree of freedom whose value determines the mixing ratio of the two potentials on a time scale determined by its associated mass. We propose to choose this associated fictitious mass adaptively so as to ensure a constant time scale for all switching processes. For different model systems, including a harmonic oscillator and a Lennard-Jones fluid, we investigate the window of switching time scales that guarantees the conservation of the extended Hamiltonian for a large number of switching events. The methodology is first applied in the microcanonical ensemble and then generalized to the canonical ensemble using a Nosé-Hoover chain thermostat. It is shown that the method is stable for thousands of consecutive switching events during a single simulation, with constant temperature and a conserved extended Hamiltonian. A slight modification of the original Hamiltonian is introduced to avoid accumulation of small numerical errors incurred after each switching process.
Collapse
Affiliation(s)
- Jim Bachmann
- Institut für Festkörpertheorie, Westfälische Wilhelms-Universität Münster and Center for Multiscale Theory and Computation, Wilhelm-Klemm-Str. 10, 48149 Münster, Germany
| | - Nikos L Doltsinis
- Institut für Festkörpertheorie, Westfälische Wilhelms-Universität Münster and Center for Multiscale Theory and Computation, Wilhelm-Klemm-Str. 10, 48149 Münster, Germany
| |
Collapse
|
27
|
Zlobin A, Diankin I, Pushkarev S, Golovin A. Probing the Suitability of Different Ca 2+ Parameters for Long Simulations of Diisopropyl Fluorophosphatase. Molecules 2021; 26:5839. [PMID: 34641383 PMCID: PMC8510429 DOI: 10.3390/molecules26195839] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Revised: 09/23/2021] [Accepted: 09/24/2021] [Indexed: 11/16/2022] Open
Abstract
Organophosphate hydrolases are promising as potential biotherapeutic agents to treat poisoning with pesticides or nerve gases. However, these enzymes often need to be further engineered in order to become useful in practice. One example of such enhancement is the alteration of enantioselectivity of diisopropyl fluorophosphatase (DFPase). Molecular modeling techniques offer a unique opportunity to address this task rationally by providing a physical description of the substrate-binding process. However, DFPase is a metalloenzyme, and correct modeling of metal cations is a challenging task generally coming with a tradeoff between simulation speed and accuracy. Here, we probe several molecular mechanical parameter combinations for their ability to empower long simulations needed to achieve a quantitative description of substrate binding. We demonstrate that a combination of the Amber19sb force field with the recently developed 12-6 Ca2+ models allows us to both correctly model DFPase and obtain new insights into the DFP binding process.
Collapse
Affiliation(s)
- Alexander Zlobin
- Faculty of Bioengineering, Lomonosov Moscow State University, 119234 Moscow, Russia; (I.D.); (S.P.)
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 117997 Moscow, Russia
- Sirius University of Science and Technology, 354340 Sochi, Russia
| | - Igor Diankin
- Faculty of Bioengineering, Lomonosov Moscow State University, 119234 Moscow, Russia; (I.D.); (S.P.)
- Sirius University of Science and Technology, 354340 Sochi, Russia
| | - Sergey Pushkarev
- Faculty of Bioengineering, Lomonosov Moscow State University, 119234 Moscow, Russia; (I.D.); (S.P.)
| | - Andrey Golovin
- Faculty of Bioengineering, Lomonosov Moscow State University, 119234 Moscow, Russia; (I.D.); (S.P.)
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 117997 Moscow, Russia
- Sirius University of Science and Technology, 354340 Sochi, Russia
| |
Collapse
|
28
|
Wieder M, Fass J, Chodera JD. Fitting quantum machine learning potentials to experimental free energy data: predicting tautomer ratios in solution. Chem Sci 2021; 12:11364-11381. [PMID: 34567495 PMCID: PMC8409483 DOI: 10.1039/d1sc01185e] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Accepted: 07/05/2021] [Indexed: 11/21/2022] Open
Abstract
The computation of tautomer ratios of druglike molecules is enormously important in computer-aided drug discovery, as over a quarter of all approved drugs can populate multiple tautomeric species in solution. Unfortunately, accurate calculations of aqueous tautomer ratios—the degree to which these species must be penalized in order to correctly account for tautomers in modeling binding for computer-aided drug discovery—is surprisingly difficult. While quantum chemical approaches to computing aqueous tautomer ratios using continuum solvent models and rigid-rotor harmonic-oscillator thermochemistry are currently state of the art, these methods are still surprisingly inaccurate despite their enormous computational expense. Here, we show that a major source of this inaccuracy lies in the breakdown of the standard approach to accounting for quantum chemical thermochemistry using rigid rotor harmonic oscillator (RRHO) approximations, which are frustrated by the complex conformational landscape introduced by the migration of double bonds, creation of stereocenters, and introduction of multiple conformations separated by low energetic barriers induced by migration of a single proton. Using quantum machine learning (QML) methods that allow us to compute potential energies with quantum chemical accuracy at a fraction of the cost, we show how rigorous relative alchemical free energy calculations can be used to compute tautomer ratios in vacuum free from the limitations introduced by RRHO approximations. Furthermore, since the parameters of QML methods are tunable, we show how we can train these models to correct limitations in the underlying learned quantum chemical potential energy surface using free energies, enabling these methods to learn to generalize tautomer free energies across a broader range of predictions. We show how alchemical free energies can be calculated with QML potentials to identify deficiencies in RRHO approximations for computing tautomeric free energies, and how these potentials can be learned from experiment to improve prediction accuracy.![]()
Collapse
Affiliation(s)
- Marcus Wieder
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center New York NY 10065 USA
| | - Josh Fass
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center New York NY 10065 USA .,Tri-Institutional PhD Program in Computational Biology and Medicine, Weill Cornell Graduate School of Medical Sciences New York NY 10065 USA
| | - John D Chodera
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center New York NY 10065 USA
| |
Collapse
|
29
|
Unke O, Chmiela S, Sauceda HE, Gastegger M, Poltavsky I, Schütt KT, Tkatchenko A, Müller KR. Machine Learning Force Fields. Chem Rev 2021; 121:10142-10186. [PMID: 33705118 PMCID: PMC8391964 DOI: 10.1021/acs.chemrev.0c01111] [Citation(s) in RCA: 404] [Impact Index Per Article: 134.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Indexed: 12/27/2022]
Abstract
In recent years, the use of machine learning (ML) in computational chemistry has enabled numerous advances previously out of reach due to the computational complexity of traditional electronic-structure methods. One of the most promising applications is the construction of ML-based force fields (FFs), with the aim to narrow the gap between the accuracy of ab initio methods and the efficiency of classical FFs. The key idea is to learn the statistical relation between chemical structure and potential energy without relying on a preconceived notion of fixed chemical bonds or knowledge about the relevant interactions. Such universal ML approximations are in principle only limited by the quality and quantity of the reference data used to train them. This review gives an overview of applications of ML-FFs and the chemical insights that can be obtained from them. The core concepts underlying ML-FFs are described in detail, and a step-by-step guide for constructing and testing them from scratch is given. The text concludes with a discussion of the challenges that remain to be overcome by the next generation of ML-FFs.
Collapse
Affiliation(s)
- Oliver
T. Unke
- Machine
Learning Group, Technische Universität
Berlin, 10587 Berlin, Germany
- DFG
Cluster of Excellence “Unifying Systems in Catalysis”
(UniSysCat), Technische Universität Berlin, 10623 Berlin, Germany
| | - Stefan Chmiela
- Machine
Learning Group, Technische Universität
Berlin, 10587 Berlin, Germany
| | - Huziel E. Sauceda
- Machine
Learning Group, Technische Universität
Berlin, 10587 Berlin, Germany
- BASLEARN,
BASF-TU Joint Lab, Technische Universität
Berlin, 10587 Berlin, Germany
| | - Michael Gastegger
- Machine
Learning Group, Technische Universität
Berlin, 10587 Berlin, Germany
- DFG
Cluster of Excellence “Unifying Systems in Catalysis”
(UniSysCat), Technische Universität Berlin, 10623 Berlin, Germany
- BASLEARN,
BASF-TU Joint Lab, Technische Universität
Berlin, 10587 Berlin, Germany
| | - Igor Poltavsky
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Kristof T. Schütt
- Machine
Learning Group, Technische Universität
Berlin, 10587 Berlin, Germany
| | - Alexandre Tkatchenko
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Klaus-Robert Müller
- Machine
Learning Group, Technische Universität
Berlin, 10587 Berlin, Germany
- BIFOLD−Berlin
Institute for the Foundations of Learning and Data, Berlin, Germany
- Department
of Artificial Intelligence, Korea University, Anam-dong, Seongbuk-gu, Seoul 02841, Korea
- Max Planck
Institute for Informatics, Stuhlsatzenhausweg, 66123 Saarbrücken, Germany
- Google
Research, Brain Team, Berlin, Germany
| |
Collapse
|
30
|
Muratov EN, Amaro R, Andrade CH, Brown N, Ekins S, Fourches D, Isayev O, Kozakov D, Medina-Franco JL, Merz KM, Oprea TI, Poroikov V, Schneider G, Todd MH, Varnek A, Winkler DA, Zakharov AV, Cherkasov A, Tropsha A. A critical overview of computational approaches employed for COVID-19 drug discovery. Chem Soc Rev 2021; 50:9121-9151. [PMID: 34212944 PMCID: PMC8371861 DOI: 10.1039/d0cs01065k] [Citation(s) in RCA: 102] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2021] [Indexed: 01/18/2023]
Abstract
COVID-19 has resulted in huge numbers of infections and deaths worldwide and brought the most severe disruptions to societies and economies since the Great Depression. Massive experimental and computational research effort to understand and characterize the disease and rapidly develop diagnostics, vaccines, and drugs has emerged in response to this devastating pandemic and more than 130 000 COVID-19-related research papers have been published in peer-reviewed journals or deposited in preprint servers. Much of the research effort has focused on the discovery of novel drug candidates or repurposing of existing drugs against COVID-19, and many such projects have been either exclusively computational or computer-aided experimental studies. Herein, we provide an expert overview of the key computational methods and their applications for the discovery of COVID-19 small-molecule therapeutics that have been reported in the research literature. We further outline that, after the first year the COVID-19 pandemic, it appears that drug repurposing has not produced rapid and global solutions. However, several known drugs have been used in the clinic to cure COVID-19 patients, and a few repurposed drugs continue to be considered in clinical trials, along with several novel clinical candidates. We posit that truly impactful computational tools must deliver actionable, experimentally testable hypotheses enabling the discovery of novel drugs and drug combinations, and that open science and rapid sharing of research results are critical to accelerate the development of novel, much needed therapeutics for COVID-19.
Collapse
Affiliation(s)
- Eugene N. Muratov
- UNC Eshelman School of Pharmacy, University of North CarolinaChapel HillNCUSA
| | - Rommie Amaro
- University of California in San DiegoSan DiegoCAUSA
| | | | | | - Sean Ekins
- Collaborations PharmaceuticalsRaleighNCUSA
| | - Denis Fourches
- Department of Chemistry, North Carolina State UniversityRaleighNCUSA
| | - Olexandr Isayev
- Department of Chemistry, Carnegie Melon UniversityPittsburghPAUSA
| | - Dima Kozakov
- Department of Applied Mathematics and Statistics, Stony Brook UniversityStony BrookNYUSA
| | | | - Kenneth M. Merz
- Department of Chemistry, Michigan State UniversityEast LansingMIUSA
| | - Tudor I. Oprea
- Department of Internal Medicine and UNM Comprehensive Cancer Center, University of New Mexico, AlbuquerqueNMUSA
- Department of Rheumatology and Inflammation Research, Gothenburg UniversitySweden
- Novo Nordisk Foundation Center for Protein Research, University of CopenhagenDenmark
| | | | - Gisbert Schneider
- Institute of Pharmaceutical Sciences, Swiss Federal Institute of TechnologyZurichSwitzerland
| | | | - Alexandre Varnek
- Department of Chemistry, University of StrasbourgStrasbourgFrance
- Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido UniversitySapporoJapan
| | - David A. Winkler
- Monash Institute of Pharmaceutical Sciences, Monash UniversityMelbourneVICAustralia
- School of Biochemistry and Genetics, La Trobe Institute for Molecular Science, La Trobe UniversityBundooraAustralia
- School of Pharmacy, University of NottinghamNottinghamUK
| | | | - Artem Cherkasov
- Vancouver Prostate Centre, University of British ColumbiaVancouverBCCanada
| | - Alexander Tropsha
- UNC Eshelman School of Pharmacy, University of North CarolinaChapel HillNCUSA
| |
Collapse
|
31
|
Meli R, Anighoro A, Bodkin MJ, Morris GM, Biggin PC. Learning protein-ligand binding affinity with atomic environment vectors. J Cheminform 2021; 13:59. [PMID: 34391475 PMCID: PMC8364054 DOI: 10.1186/s13321-021-00536-w] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 07/21/2021] [Indexed: 12/03/2022] Open
Abstract
Scoring functions for the prediction of protein-ligand binding affinity have seen renewed interest in recent years when novel machine learning and deep learning methods started to consistently outperform classical scoring functions. Here we explore the use of atomic environment vectors (AEVs) and feed-forward neural networks, the building blocks of several neural network potentials, for the prediction of protein-ligand binding affinity. The AEV-based scoring function, which we term AEScore, is shown to perform as well or better than other state-of-the-art scoring functions on binding affinity prediction, with an RMSE of 1.22 pK units and a Pearson’s correlation coefficient of 0.83 for the CASF-2016 benchmark. However, AEScore does not perform as well in docking and virtual screening tasks, for which it has not been explicitly trained. Therefore, we show that the model can be combined with the classical scoring function AutoDock Vina in the context of \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Delta$$\end{document}Δ-learning, where corrections to the AutoDock Vina scoring function are learned instead of the protein-ligand binding affinity itself. Combined with AutoDock Vina, \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\Delta$$\end{document}Δ-AEScore has an RMSE of 1.32 pK units and a Pearson’s correlation coefficient of 0.80 on the CASF-2016 benchmark, while retaining the docking and screening power of the underlying classical scoring function.
Collapse
Affiliation(s)
- Rocco Meli
- Department of Biochemistry, University of Oxford, Oxford, UK
| | | | | | | | - Philip C Biggin
- Department of Biochemistry, University of Oxford, Oxford, UK.
| |
Collapse
|
32
|
Nigam A, Pollice R, Hurley MFD, Hickman RJ, Aldeghi M, Yoshikawa N, Chithrananda S, Voelz VA, Aspuru-Guzik A. Assigning confidence to molecular property prediction. Expert Opin Drug Discov 2021; 16:1009-1023. [DOI: 10.1080/17460441.2021.1925247] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Affiliation(s)
- AkshatKumar Nigam
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Canada
- Department of Computer Science, University of Toronto, Toronto, Canada
| | - Robert Pollice
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Canada
- Department of Computer Science, University of Toronto, Toronto, Canada
| | | | - Riley J. Hickman
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Canada
- Department of Computer Science, University of Toronto, Toronto, Canada
| | - Matteo Aldeghi
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Canada
- Department of Computer Science, University of Toronto, Toronto, Canada
- Vector Institute for Artificial Intelligence, University Ave Suite 710, Toronto, Canada
| | - Naruki Yoshikawa
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Canada
- Department of Computer Science, University of Toronto, Toronto, Canada
| | | | | | - Alán Aspuru-Guzik
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, Canada
- Department of Computer Science, University of Toronto, Toronto, Canada
- Vector Institute for Artificial Intelligence, University Ave Suite 710, Toronto, Canada
- Canadian Institute for Advanced Research (CIFAR), University Ave, Toronto, Canada
| |
Collapse
|
33
|
Zeni C, Rossi K, Glielmo A, de Gironcoli S. Compact atomic descriptors enable accurate predictions via linear models. J Chem Phys 2021; 154:224112. [PMID: 34241204 DOI: 10.1063/5.0052961] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
We probe the accuracy of linear ridge regression employing a three-body local density representation derived from the atomic cluster expansion. We benchmark the accuracy of this framework in the prediction of formation energies and atomic forces in molecules and solids. We find that such a simple regression framework performs on par with state-of-the-art machine learning methods which are, in most cases, more complex and more computationally demanding. Subsequently, we look for ways to sparsify the descriptor and further improve the computational efficiency of the method. To this aim, we use both principal component analysis and least absolute shrinkage operator regression for energy fitting on six single-element datasets. Both methods highlight the possibility of constructing a descriptor that is four times smaller than the original with a similar or even improved accuracy. Furthermore, we find that the reduced descriptors share a sizable fraction of their features across the six independent datasets, hinting at the possibility of designing material-agnostic, optimally compressed, and accurate descriptors.
Collapse
Affiliation(s)
- Claudio Zeni
- Physics Area, International School for Advanced Studies, Trieste, Italy
| | - Kevin Rossi
- Laboratory of Nanochemistry, Institute of Chemistry and Chemical Engineering, Ecole Polytechnique Fédérale de Lausanne, Lausanne, CH, Switzerland
| | - Aldo Glielmo
- Physics Area, International School for Advanced Studies, Trieste, Italy
| | | |
Collapse
|
34
|
Machine learning, artificial intelligence, and data science breaking into drug design and neglected diseases. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2021. [DOI: 10.1002/wcms.1513] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
35
|
Pant S, Smith Z, Wang Y, Tajkhorshid E, Tiwary P. Confronting pitfalls of AI-augmented molecular dynamics using statistical physics. J Chem Phys 2020; 153:234118. [PMID: 33353347 PMCID: PMC7863682 DOI: 10.1063/5.0030931] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 11/29/2020] [Indexed: 12/31/2022] Open
Abstract
Artificial intelligence (AI)-based approaches have had indubitable impact across the sciences through the ability to extract relevant information from raw data. Recently, AI has also found use in enhancing the efficiency of molecular simulations, wherein AI derived slow modes are used to accelerate the simulation in targeted ways. However, while typical fields where AI is used are characterized by a plethora of data, molecular simulations, per construction, suffer from limited sampling and thus limited data. As such, the use of AI in molecular simulations can suffer from a dangerous situation where the AI-optimization could get stuck in spurious regimes, leading to incorrect characterization of the reaction coordinate (RC) for the problem at hand. When such an incorrect RC is then used to perform additional simulations, one could start to deviate progressively from the ground truth. To deal with this problem of spurious AI-solutions, here, we report a novel and automated algorithm using ideas from statistical mechanics. It is based on the notion that a more reliable AI-solution will be one that maximizes the timescale separation between slow and fast processes. To learn this timescale separation even from limited data, we use a maximum caliber-based framework. We show the applicability of this automatic protocol for three classic benchmark problems, namely, the conformational dynamics of a model peptide, ligand-unbinding from a protein, and folding/unfolding energy landscape of the C-terminal domain of protein G. We believe that our work will lead to increased and robust use of trustworthy AI in molecular simulations of complex systems.
Collapse
Affiliation(s)
- Shashank Pant
- NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, Department of Biochemistry, Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | | | | | - Emad Tajkhorshid
- NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, Department of Biochemistry, Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | | |
Collapse
|
36
|
Gupta A, Zhou HX. Profiling SARS-CoV-2 Main Protease (M PRO) Binding to Repurposed Drugs Using Molecular Dynamics Simulations in Classical and Neural Network-Trained Force Fields. ACS COMBINATORIAL SCIENCE 2020; 22:826-832. [PMID: 33119257 PMCID: PMC7605330 DOI: 10.1021/acscombsci.0c00140] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 10/17/2020] [Indexed: 01/18/2023]
Abstract
The current COVID-19 pandemic caused by a novel coronavirus SARS-CoV-2 urgently calls for a working therapeutic. Here, we report a computation-based workflow for efficiently selecting a subset of FDA-approved drugs that can potentially bind to the SARS-CoV-2 main protease MPRO. The workflow started with docking (using Autodock Vina) each of 1615 FDA-approved drugs to the MPRO active site. This step selected 62 candidates with docking energies lower than -8.5 kcal/mol. Then, the 62 docked protein-drug complexes were subjected to 100 ns of molecular dynamics (MD) simulations in a molecular mechanics (MM) force field (CHARMM36). This step reduced the candidate pool to 26, based on the root-mean-square-deviations (RMSDs) of the drug molecules in the trajectories. Finally, we modeled the 26 drug molecules by a pseudoquantum mechanical (ANI) force field and ran 5 ns hybrid ANI/MM MD simulations of the 26 protein-drug complexes. ANI was trained by neural network models on quantum mechanical density functional theory (wB97X/6-31G(d)) data points. An RMSD cutoff winnowed down the pool to 12, and free energy analysis (MM/PBSA) produced the final selection of 9 drugs: dihydroergotamine, midostaurin, ziprasidone, etoposide, apixaban, fluorescein, tadalafil, rolapitant, and palbociclib. Of these, three are found to be active in literature reports of experimental studies. To provide physical insight into their mechanism of action, the interactions of the drug molecules with the protein are presented as 2D-interaction maps. These findings and mappings of drug-protein interactions may be potentially used to guide rational drug discovery against COVID-19.
Collapse
Affiliation(s)
- Aayush Gupta
- Department of Chemistry University of Illinois at Chicago, Chicago, IL 60607, USA
| | - Huan-Xiang Zhou
- Department of Chemistry University of Illinois at Chicago, Chicago, IL 60607, USA
- Department of Physics, University of Illinois at Chicago, Chicago, IL 60607, USA
| |
Collapse
|
37
|
Gao W, Mahajan SP, Sulam J, Gray JJ. Deep Learning in Protein Structural Modeling and Design. PATTERNS (NEW YORK, N.Y.) 2020; 1:100142. [PMID: 33336200 PMCID: PMC7733882 DOI: 10.1016/j.patter.2020.100142] [Citation(s) in RCA: 82] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Deep learning is catalyzing a scientific revolution fueled by big data, accessible toolkits, and powerful computational resources, impacting many fields, including protein structural modeling. Protein structural modeling, such as predicting structure from amino acid sequence and evolutionary information, designing proteins toward desirable functionality, or predicting properties or behavior of a protein, is critical to understand and engineer biological systems at the molecular level. In this review, we summarize the recent advances in applying deep learning techniques to tackle problems in protein structural modeling and design. We dissect the emerging approaches using deep learning techniques for protein structural modeling and discuss advances and challenges that must be addressed. We argue for the central importance of structure, following the "sequence → structure → function" paradigm. This review is directed to help both computational biologists to gain familiarity with the deep learning methods applied in protein modeling, and computer scientists to gain perspective on the biologically meaningful problems that may benefit from deep learning techniques.
Collapse
Affiliation(s)
- Wenhao Gao
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Sai Pooja Mahajan
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Jeremias Sulam
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Jeffrey J. Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| |
Collapse
|
38
|
Lahey SLJ, Thien Phuc TN, Rowley CN. Benchmarking Force Field and the ANI Neural Network Potentials for the Torsional Potential Energy Surface of Biaryl Drug Fragments. J Chem Inf Model 2020; 60:6258-6268. [PMID: 33263401 DOI: 10.1021/acs.jcim.0c00904] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Many drug molecules contain biaryl fragments, resulting in a torsional barrier corresponding to rotation around the bond linking the aryls. The potential energy surfaces of these torsions vary significantly because of steric and electronic effects, ultimately affecting the relative stability of the molecular conformations in the protein-bound and solution states. Simulations of protein-ligand binding require accurate computational models to represent the intramolecular interactions to provide accurate predictions of the structure and dynamics of binding. In this article, we compare four force fields [generalized AMBER force field (GAFF), open force field (OpenFF), CHARMM general force field (CGenFF), optimized potentials for liquid simulations (OPLS)] and two neural network potentials (ANI-2x and ANI-1ccx) for their ability to predict the torsional potential energy surfaces of 88 biaryls extracted from drug fragments. The root mean square deviation (rmsd) over the full potential energy surface and the mean absolute deviation of the torsion rotational barrier height (MADB) relative to high-level ab initio reference data (CCSD(T1)*) were used as the measure of accuracy. Uncertainties in these metrics due to the composition of the data set were estimated using bootstrap analysis. In comparison to high-level ab initio data, ANI-1ccx was most accurate for predicting the barrier height (rmsd: 0.5 ± 0.0 kcal/mol, MADB: 0.8 ± 0.1 kcal/mol), followed closely by ANI-2x (rmsd: 0.5 ± 0.0 kcal/mol, MADB: 1.0 ± 0.2 kcal/mol), then CGenFF (rmsd: 0.8 ± 0.1 kcal/mol, MADB: 1.3 ± 0.1 kcal/mol) and OpenFF (rmsd: 0.7 ± 0.1 kcal/mol, MADB: 1.3 ± 0.1 kcal/mol), then GAFF (rmsd: 1.2 ± 0.2 kcal/mol, MADB: 2.6 ± 0.5 kcal/mol), and finally OPLS (rmsd: 3.6 ± 0.3 kcal/mol, MADB: 3.6 ± 0.3 kcal/mol). Significantly, the neural network potentials (NNPs) are systematically more accurate and more reliable than any of the force fields. As a practical example, the NNP/molecular mechanics method was used to simulate the isomerization of ozanimod, a drug used for multiple sclerosis. Multinanosecond molecular dynamics (MD) simulations in an explicit aqueous solvent were performed, as well as umbrella sampling and adaptive biasing force-enhanced sampling techniques. The rate constant for this isomerization calculated using transition state theory was 4.30 × 10-1 ns-1, which is consistent with direct MD simulations.
Collapse
Affiliation(s)
- Shae-Lynn J Lahey
- Department of Chemistry, Memorial University of Newfoundland, St. John's, Newfoundland and Labrador A1B 3T4, Canada
| | - Tu Nguyen Thien Phuc
- Department of Chemistry, Memorial University of Newfoundland, St. John's, Newfoundland and Labrador A1B 3T4, Canada
| | - Christopher N Rowley
- Department of Chemistry, Memorial University of Newfoundland, St. John's, Newfoundland and Labrador A1B 3T4, Canada
| |
Collapse
|
39
|
Westermayr J, Marquetand P. Deep learning for UV absorption spectra with SchNarc: First steps toward transferability in chemical compound space. J Chem Phys 2020; 153:154112. [DOI: 10.1063/5.0021915] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Affiliation(s)
- J. Westermayr
- Faculty of Chemistry, Institute of Theoretical Chemistry, University of Vienna, Währinger Str. 17, 1090 Vienna, Austria
| | - P. Marquetand
- Faculty of Chemistry, Institute of Theoretical Chemistry, University of Vienna, Währinger Str. 17, 1090 Vienna, Austria
- Vienna Research Platform on Accelerating Photoreaction Discovery, University of Vienna, Währinger Str. 17, 1090 Vienna, Austria
- Faculty of Chemistry, Data Science @ Uni Vienna, University of Vienna, Währinger Str. 29, 1090 Vienna, Austria
| |
Collapse
|
40
|
Vant JW, Lahey SLJ, Jana K, Shekhar M, Sarkar D, Munk BH, Kleinekathöfer U, Mittal S, Rowley C, Singharoy A. Flexible Fitting of Small Molecules into Electron Microscopy Maps Using Molecular Dynamics Simulations with Neural Network Potentials. J Chem Inf Model 2020; 60:2591-2604. [PMID: 32207947 PMCID: PMC7311632 DOI: 10.1021/acs.jcim.9b01167] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
Despite significant advances in resolution, the potential for cryo-electron microscopy (EM) to be used in determining the structures of protein-drug complexes remains unrealized. Determination of accurate structures and coordination of bound ligands necessitates simultaneous fitting of the models into the density envelopes, exhaustive sampling of the ligand geometries, and, most importantly, concomitant rearrangements in the side chains to optimize the binding energy changes. In this article, we present a flexible-fitting pipeline where molecular dynamics flexible fitting (MDFF) is used to refine structures of protein-ligand complexes from 3 to 5 Å electron density data. Enhanced sampling is employed to explore the binding pocket rearrangements. To provide a model that can accurately describe the conformational dynamics of the chemically diverse set of small-molecule drugs inside MDFF, we use QM/MM and neural-network potential (NNP)/MM models of protein-ligand complexes, where the ligand is represented using the QM or NNP model, and the protein is represented using established molecular mechanical force fields (e.g., CHARMM). This pipeline offers structures commensurate to or better than recently submitted high-resolution cryo-EM or X-ray models, even when given medium to low-resolution data as input. The use of the NNPs makes the algorithm more robust to the choice of search models, offering a radius of convergence of 6.5 Å for ligand structure determination. The quality of the predicted structures was also judged by density functional theory calculations of ligand strain energy. This strain potential energy is found to systematically decrease with better fitting to density and improved ligand coordination, indicating correct binding interactions. A computationally inexpensive protocol for computing strain energy is reported as part of the model analysis protocol that monitors both the ligand fit as well as model quality.
Collapse
Affiliation(s)
- John W. Vant
- School of Molecular Sciences, Arizona State University, Tempe, USA
| | - Shae-Lynn J. Lahey
- Department of Chemistry, Memorial University of Newfoundland, St. John’s, NL, Canada
| | - Kalyanashis Jana
- Department of Physics and Earth Sciences, Jacobs University Bremen, 28759 Bremen, Germany
| | - Mrinal Shekhar
- School of Molecular Sciences, Arizona State University, Tempe, USA
- Center for Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, USA
| | - Daipayan Sarkar
- School of Molecular Sciences, Arizona State University, Tempe, USA
| | - Barbara H. Munk
- School of Molecular Sciences, Arizona State University, Tempe, USA
| | - Ulrich Kleinekathöfer
- Department of Physics and Earth Sciences, Jacobs University Bremen, 28759 Bremen, Germany
| | - Sumit Mittal
- School of Molecular Sciences, Arizona State University, Tempe, USA
- School of Advanced Sciences and Languages, VIT Bhopal University, Bhopal, India
| | - Christopher Rowley
- Department of Chemistry, Memorial University of Newfoundland, St. John’s, NL, Canada
| | | |
Collapse
|
41
|
Cole DJ, Mones L, Csányi G. A machine learning based intramolecular potential for a flexible organic molecule. Faraday Discuss 2020; 224:247-264. [DOI: 10.1039/d0fd00028k] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Here, we employ the kernel regression machine learning technique to construct an analytical potential that reproduces the quantum mechanical potential energy surface of a small, flexible, drug-like molecule, 3-(benzyloxy)pyridin-2-amine.
Collapse
Affiliation(s)
- Daniel J. Cole
- School of Natural and Environmental Sciences
- Newcastle University
- Newcastle upon Tyne NE1 7RU
- UK
| | - Letif Mones
- Engineering Laboratory
- University of Cambridge
- Cambridge CB2 1PZ
- UK
| | - Gábor Csányi
- Engineering Laboratory
- University of Cambridge
- Cambridge CB2 1PZ
- UK
| |
Collapse
|