1
|
Hwang W, Austin SL, Blondel A, Boittier ED, Boresch S, Buck M, Buckner J, Caflisch A, Chang HT, Cheng X, Choi YK, Chu JW, Crowley MF, Cui Q, Damjanovic A, Deng Y, Devereux M, Ding X, Feig MF, Gao J, Glowacki DR, Gonzales JE, Hamaneh MB, Harder ED, Hayes RL, Huang J, Huang Y, Hudson PS, Im W, Islam SM, Jiang W, Jones MR, Käser S, Kearns FL, Kern NR, Klauda JB, Lazaridis T, Lee J, Lemkul JA, Liu X, Luo Y, MacKerell AD, Major DT, Meuwly M, Nam K, Nilsson L, Ovchinnikov V, Paci E, Park S, Pastor RW, Pittman AR, Post CB, Prasad S, Pu J, Qi Y, Rathinavelan T, Roe DR, Roux B, Rowley CN, Shen J, Simmonett AC, Sodt AJ, Töpfer K, Upadhyay M, van der Vaart A, Vazquez-Salazar LI, Venable RM, Warrensford LC, Woodcock HL, Wu Y, Brooks CL, Brooks BR, Karplus M. CHARMM at 45: Enhancements in Accessibility, Functionality, and Speed. J Phys Chem B 2024; 128:9976-10042. [PMID: 39303207 PMCID: PMC11492285 DOI: 10.1021/acs.jpcb.4c04100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Revised: 08/15/2024] [Accepted: 08/22/2024] [Indexed: 09/22/2024]
Abstract
Since its inception nearly a half century ago, CHARMM has been playing a central role in computational biochemistry and biophysics. Commensurate with the developments in experimental research and advances in computer hardware, the range of methods and applicability of CHARMM have also grown. This review summarizes major developments that occurred after 2009 when the last review of CHARMM was published. They include the following: new faster simulation engines, accessible user interfaces for convenient workflows, and a vast array of simulation and analysis methods that encompass quantum mechanical, atomistic, and coarse-grained levels, as well as extensive coverage of force fields. In addition to providing the current snapshot of the CHARMM development, this review may serve as a starting point for exploring relevant theories and computational methods for tackling contemporary and emerging problems in biomolecular systems. CHARMM is freely available for academic and nonprofit research at https://academiccharmm.org/program.
Collapse
Affiliation(s)
- Wonmuk Hwang
- Department
of Biomedical Engineering, Texas A&M
University, College
Station, Texas 77843, United States
- Department
of Materials Science and Engineering, Texas
A&M University, College Station, Texas 77843, United States
- Department
of Physics and Astronomy, Texas A&M
University, College Station, Texas 77843, United States
- Center for
AI and Natural Sciences, Korea Institute
for Advanced Study, Seoul 02455, Republic
of Korea
| | - Steven L. Austin
- Department
of Chemistry, University of South Florida, Tampa, Florida 33620, United States
| | - Arnaud Blondel
- Institut
Pasteur, Université Paris Cité, CNRS UMR3825, Structural
Bioinformatics Unit, 28 rue du Dr. Roux F-75015 Paris, France
| | - Eric D. Boittier
- Department
of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
| | - Stefan Boresch
- Faculty of
Chemistry, Department of Computational Biological Chemistry, University of Vienna, Wahringerstrasse 17, 1090 Vienna, Austria
| | - Matthias Buck
- Department
of Physiology and Biophysics, Case Western
Reserve University, School of Medicine, Cleveland, Ohio 44106, United States
| | - Joshua Buckner
- Department
of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Amedeo Caflisch
- Department
of Biochemistry, University of Zürich, CH-8057 Zürich, Switzerland
| | - Hao-Ting Chang
- Institute
of Bioinformatics and Systems Biology, National
Yang Ming Chiao Tung University, Hsinchu 30010, Taiwan, ROC
| | - Xi Cheng
- Shanghai
Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
| | - Yeol Kyo Choi
- Department
of Biological Sciences, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Jhih-Wei Chu
- Institute
of Bioinformatics and Systems Biology, Department of Biological Science
and Technology, Institute of Molecular Medicine and Bioengineering,
and Center for Intelligent Drug Systems and Smart Bio-devices (IDSB), National Yang Ming Chiao Tung
University, Hsinchu 30010, Taiwan,
ROC
| | - Michael F. Crowley
- Renewable
Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, Colorado 80401, United States
| | - Qiang Cui
- Department
of Chemistry, Boston University, 590 Commonwealth Avenue, Boston, Massachusetts 02215, United States
- Department
of Physics, Boston University, 590 Commonwealth Avenue, Boston, Massachusetts 02215, United States
- Department
of Biomedical Engineering, Boston University, 44 Cummington Mall, Boston, Massachusetts 02215, United States
| | - Ana Damjanovic
- Department
of Biophysics, Johns Hopkins University, Baltimore, Maryland 21218, United States
- Department
of Physics and Astronomy, Johns Hopkins
University, Baltimore, Maryland 21218, United States
- Laboratory
of Computational Biology, National Heart
Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Yuqing Deng
- Shanghai
R&D Center, DP Technology, Ltd., Shanghai 201210, China
| | - Mike Devereux
- Department
of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
| | - Xinqiang Ding
- Department
of Chemistry, Tufts University, Medford, Massachusetts 02155, United States
| | - Michael F. Feig
- Department
of Biochemistry and Molecular Biology, Michigan
State University, East Lansing, Michigan 48824, United States
| | - Jiali Gao
- School
of Chemical Biology & Biotechnology, Peking University Shenzhen Graduate School, Shenzhen, Guangdong 518055, China
- Institute
of Systems and Physical Biology, Shenzhen
Bay Laboratory, Shenzhen, Guangdong 518055, China
- Department
of Chemistry and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - David R. Glowacki
- CiTIUS
Centro Singular de Investigación en Tecnoloxías Intelixentes
da USC, 15705 Santiago de Compostela, Spain
| | - James E. Gonzales
- Department
of Biomedical Engineering, Texas A&M
University, College
Station, Texas 77843, United States
- Laboratory
of Computational Biology, National Heart
Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Mehdi Bagerhi Hamaneh
- Department
of Physiology and Biophysics, Case Western
Reserve University, School of Medicine, Cleveland, Ohio 44106, United States
| | | | - Ryan L. Hayes
- Department
of Chemical and Biomolecular Engineering, University of California, Irvine, Irvine, California 92697, United States
- Department
of Pharmaceutical Sciences, University of
California, Irvine, Irvine, California 92697, United States
| | - Jing Huang
- Key Laboratory
of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, Hangzhou, Zhejiang 310024, China
| | - Yandong Huang
- College
of Computer Engineering, Jimei University, Xiamen 361021, China
| | - Phillip S. Hudson
- Department
of Chemistry, University of South Florida, Tampa, Florida 33620, United States
- Medicine
Design, Pfizer Inc., Cambridge, Massachusetts 02139, United States
| | - Wonpil Im
- Department
of Biological Sciences, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Shahidul M. Islam
- Department
of Chemistry, Delaware State University, Dover, Delaware 19901, United States
| | - Wei Jiang
- Computational
Science Division, Argonne National Laboratory, Argonne, Illinois 60439, United States
| | - Michael R. Jones
- Laboratory
of Computational Biology, National Heart
Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Silvan Käser
- Department
of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
| | - Fiona L. Kearns
- Department
of Chemistry, University of South Florida, Tampa, Florida 33620, United States
| | - Nathan R. Kern
- Department
of Biological Sciences, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Jeffery B. Klauda
- Department
of Chemical and Biomolecular Engineering, Institute for Physical Science
and Technology, Biophysics Program, University
of Maryland, College Park, Maryland 20742, United States
| | - Themis Lazaridis
- Department
of Chemistry, City College of New York, New York, New York 10031, United States
| | - Jinhyuk Lee
- Disease
Target Structure Research Center, Korea
Research Institute of Bioscience and Biotechnology, Daejeon 34141, Republic of Korea
- Department
of Bioinformatics, KRIBB School of Bioscience, University of Science and Technology, Daejeon 34141, Republic of Korea
| | - Justin A. Lemkul
- Department
of Biochemistry, Virginia Polytechnic Institute
and State University, Blacksburg, Virginia 24061, United States
| | - Xiaorong Liu
- Department
of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Yun Luo
- Department
of Biotechnology and Pharmaceutical Sciences, College of Pharmacy, Western University of Health Sciences, Pomona, California 91766, United States
| | - Alexander D. MacKerell
- Department
of Pharmaceutical Sciences, University of
Maryland School of Pharmacy, Baltimore, Maryland 21201, United States
| | - Dan T. Major
- Department
of Chemistry and Institute for Nanotechnology & Advanced Materials, Bar-Ilan University, Ramat-Gan 52900, Israel
| | - Markus Meuwly
- Department
of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
- Department
of Chemistry, Brown University, Providence, Rhode Island 02912, United States
| | - Kwangho Nam
- Department
of Chemistry and Biochemistry, University
of Texas at Arlington, Arlington, Texas 76019, United States
| | - Lennart Nilsson
- Karolinska
Institutet, Department of Biosciences and
Nutrition, SE-14183 Huddinge, Sweden
| | - Victor Ovchinnikov
- Harvard
University, Department of Chemistry
and Chemical Biology, Cambridge, Massachusetts 02138, United States
| | - Emanuele Paci
- Dipartimento
di Fisica e Astronomia, Universitá
di Bologna, Bologna 40127, Italy
| | - Soohyung Park
- Department
of Biological Sciences, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Richard W. Pastor
- Laboratory
of Computational Biology, National Heart
Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Amanda R. Pittman
- Department
of Chemistry, University of South Florida, Tampa, Florida 33620, United States
| | - Carol Beth Post
- Borch Department
of Medicinal Chemistry and Molecular Pharmacology, Purdue University, West Lafayette, Indiana 47907, United States
| | - Samarjeet Prasad
- Laboratory
of Computational Biology, National Heart
Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Jingzhi Pu
- Department
of Chemistry and Chemical Biology, Indiana
University Indianapolis, Indianapolis, Indiana 46202, United States
| | - Yifei Qi
- School
of Pharmacy, Fudan University, Shanghai 201203, China
| | | | - Daniel R. Roe
- Laboratory
of Computational Biology, National Heart
Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Benoit Roux
- Department
of Chemistry, University of Chicago, Chicago, Illinois 60637, United States
| | | | - Jana Shen
- Department
of Pharmaceutical Sciences, University of
Maryland School of Pharmacy, Baltimore, Maryland 21201, United States
| | - Andrew C. Simmonett
- Laboratory
of Computational Biology, National Heart
Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Alexander J. Sodt
- Eunice
Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Kai Töpfer
- Department
of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
| | - Meenu Upadhyay
- Department
of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel, Switzerland
| | - Arjan van der Vaart
- Department
of Chemistry, University of South Florida, Tampa, Florida 33620, United States
| | | | - Richard M. Venable
- Laboratory
of Computational Biology, National Heart
Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Luke C. Warrensford
- Department
of Chemistry, University of South Florida, Tampa, Florida 33620, United States
| | - H. Lee Woodcock
- Department
of Chemistry, University of South Florida, Tampa, Florida 33620, United States
| | - Yujin Wu
- Department
of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Charles L. Brooks
- Department
of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Bernard R. Brooks
- Laboratory
of Computational Biology, National Heart
Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Martin Karplus
- Harvard
University, Department of Chemistry
and Chemical Biology, Cambridge, Massachusetts 02138, United States
- Laboratoire
de Chimie Biophysique, ISIS, Université
de Strasbourg, 67000 Strasbourg, France
| |
Collapse
|
2
|
Kalayan J, Ramzan I, Williams CD, Bryce RA, Burton NA. A neural network potential based on pairwise resolved atomic forces and energies. J Comput Chem 2024; 45:1143-1151. [PMID: 38284556 DOI: 10.1002/jcc.27313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 12/23/2023] [Accepted: 01/05/2024] [Indexed: 01/30/2024]
Abstract
Molecular simulations have become a key tool in molecular and materials design. Machine learning (ML)-based potential energy functions offer the prospect of simulating complex molecular systems efficiently at quantum chemical accuracy. In previous work, we have introduced the ML-based PairF-Net approach to neural network potentials, that adopts a pairwise interatomic scheme to predicting forces within a molecular system. Here, we further develop the PairF-Net model to intrinsically incorporate energy conservation and couple the model to a molecular mechanical (MM) environment within the OpenMM package. The updated PairF-Net model yields energy and force predictions and dynamical distributions in good agreement with the rMD17 dataset of ten small organic molecules in the gas-phase. We further show that these in vacuo ML models of small molecules can be applied to force predictions in aqueous solution via hybrid ML/MM simulations. We present a new benchmark dataset for these ten molecules in solution, obtained from QM/MM simulations, which we denote as rMD17-aq (https://zenodo.org/records/10048644); and assess the ability of PairF-Net to reproduce the molecular energy, atomic forces and dynamical distributions of these solution conformations via ML/MM simulations.
Collapse
Affiliation(s)
- Jas Kalayan
- Division of Pharmacy and Optometry, School of Health Sciences, University of Manchester, Manchester, UK
| | - Ismaeel Ramzan
- Division of Pharmacy and Optometry, School of Health Sciences, University of Manchester, Manchester, UK
- Neural Circuits and Computations Unit, RIKEN Center for Brain Science, Wako, Japan
| | - Christopher D Williams
- Division of Pharmacy and Optometry, School of Health Sciences, University of Manchester, Manchester, UK
| | - Richard A Bryce
- Division of Pharmacy and Optometry, School of Health Sciences, University of Manchester, Manchester, UK
| | - Neil A Burton
- Department of Chemistry, University of Manchester, Manchester, UK
| |
Collapse
|
3
|
Kumar A, MacKerell AD. FFParam-v2.0: A Comprehensive Tool for CHARMM Additive and Drude Polarizable Force-Field Parameter Optimization and Validation. J Phys Chem B 2024; 128:4385-4395. [PMID: 38690986 PMCID: PMC11260432 DOI: 10.1021/acs.jpcb.4c01314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2024]
Abstract
Developing production quality CHARMM force-field (FF) parameters is a very detailed process involving a variety of calculations, many of which are specific for the molecule of interest. The first version of FFParam was developed as a standalone Python package designed for the optimization of electrostatic and bonded parameters of the CHARMM additive and polarizable Drude FFs by using quantum mechanical (QM) target data. The new version of FFParam has multiple new capabilities for FF parameter optimization and validation, with an emphasis on the ability to use condensed-phase target data in optimization. FFParam-v2 allows optimization of Lennard-Jones (LJ) parameters using potential energy scans of interactions between selected atoms in a molecule and noble gases, viz., He and Ne, and through condensed-phase calculations, from which experimental observables such as heats of vaporization and free energies of solvation may be obtained. This functionality serves as a gold standard for both optimizing parameters and validating the performance of the final parameters. A new bonded parameter optimization algorithm has been introduced to account for simultaneously optimizing multiple molecules sharing parameters. FFParam-v2 also supports the comparison of normal modes and the potential energy distribution of internal coordinates towards each normal mode obtained from QM and molecular mechanics calculations. Such comparison capability is vital to validate the balance among various bonded parameters that contribute to the complex normal modes of molecules. User interaction has been extended beyond the original graphical user interface to include command-line interface capabilities that allow for integration of FFParam in workflows, thereby facilitating the automation of parameter optimization. With these new functionalities, FFParam is a more comprehensive parameter optimization tool for both beginners and advanced users.
Collapse
Affiliation(s)
- Anmol Kumar
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland, Baltimore, MD 21201, USA
| | - Alexander D. MacKerell
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland, Baltimore, MD 21201, USA
| |
Collapse
|
4
|
Demir Gİ, Tekin A. NICE-FF: A non-empirical, intermolecular, consistent, and extensible force field for nucleic acids and beyond. J Chem Phys 2023; 159:244117. [PMID: 38153156 DOI: 10.1063/5.0176641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 12/04/2023] [Indexed: 12/29/2023] Open
Abstract
A new non-empirical ab initio intermolecular force field (NICE-FF in buffered 14-7 potential form) has been developed for nucleic acids and beyond based on the dimer interaction energies (IEs) calculated at the spin component scaled-MI-second order Møller-Plesset perturbation theory. A fully automatic framework has been implemented for this purpose, capable of generating well-polished computational grids, performing the necessary ab initio calculations, conducting machine learning (ML) assisted force field (FF) parametrization, and extending existing FF parameters by incorporating new atom types. For the ML-assisted parametrization of NICE-FF, interaction energies of ∼18 000 dimer geometries (with IE < 0) were used, and the best fit gave a mean square deviation of about 0.46 kcal/mol. During this parametrization, atom types apparent in four deoxyribonucleic acid (DNA) bases have been first trained using the generated DNA base datasets. Both uracil and hypoxanthine, which contain the same atom types found in DNA bases, have been considered as test molecules. Three new atom types have been added to the DNA atom types by using IE datasets of both pyrazinamide and 9-methylhypoxanthine. Finally, the last test molecule, theophylline, has been selected, which contains already-fitted atom-type parameters. The performance of NICE-FF has been investigated on the S22 dataset, and it has been found that NICE-FF outperforms the well-known FFs by generating the most consistent IEs with the high-level ab initio ones. Moreover, NICE-FF has been integrated into our in-house developed crystal structure prediction (CSP) tool [called FFCASP (Fast and Flexible CrystAl Structure Predictor)], aiming to find the experimental crystal structures of all considered molecules. CSPs, which were performed up to 4 formula units (Z), resulted in NICE-FF being able to locate almost all the known experimental crystal structures with sufficiently low RMSD20 values to provide good starting points for density functional theory optimizations.
Collapse
Affiliation(s)
- Gözde İniş Demir
- Informatics Institute, Istanbul Technical University, 34469 Maslak, Istanbul, Türkiye
| | - Adem Tekin
- Informatics Institute, Istanbul Technical University, 34469 Maslak, Istanbul, Türkiye
- Research Institute for Fundamental Sciences (TÜBİTAK-TBAE), Kocaeli, Türkiye
| |
Collapse
|
5
|
Ge Y, Wang X, Zhu Q, Yang Y, Dong H, Ma J. Machine Learning-Guided Adaptive Parametrization for Coupling Terms in a Mixed United-Atom/Coarse-Grained Model for Diphenylalanine Self-Assembly in Aqueous Ionic Liquids. J Chem Theory Comput 2023; 19:6718-6732. [PMID: 37725682 DOI: 10.1021/acs.jctc.3c00809] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/21/2023]
Abstract
Precise regulation of the peptide self-assembly into ordered nanostructures with intriguing properties has attracted intense attention. However, predicting peptide assembly at atomic resolution is a challenge due to both the structural flexibility of peptides and the associated huge computational costs. A machine learning-guided adaptive parametrization method was proposed for developing a mixed atomic and coarse-grained (CG) model through a multiobjective optimization strategy. Our model incorporates the united-atom (UA) model for diphenylalanine (P) and the polarizable electrostatic-variable coarse-grained (VaCG) model for aqueous ionic liquid [BMIM]+[BF4]- solution. In this mixed model, the coupling van der Waals (vdW) interaction is addressed by introducing virtual sites (VS) in the UA model to interact with solvent CG beads. The coupling parameters, including the electrostatic parameter and vdW parameters, are automatically optimized through ML-guided adaptive parametrization. The performance of this model was tested by some microstructural properties, e.g., the average number of P-P intermolecular hydrogen bonds (HBs) and radius distribution functions (RDFs) between P and different fragments of IL, in comparison with all-atom (AA) simulations. The computational cost is significantly reduced using such a parametrization scheme, which could search tens of thousands of force-field parameter sets, while needing only a small fraction of them to be assessed with molecular dynamics (MD) simulations. We used such a mixed resolution model to investigate the self-assembly in IL-water mixtures with variants of IL concentration (X). The long-range-ordered fibril structure is formed in a pure water system (X = 0). With an increase of IL concentrations, the formation of an ordered self-assembly nanostructure is prohibited, instead forming branched fibril at X = 2 mol % or amorphous aggregates when X > 10 mol %, resulting from the interplay between π-stacking and HB interactions between P and IL. The qualitative agreement between the simulated structures and the observed morphologies in experiments indicates the applicability of ML-guided parametrization strategy in the study of complex systems, such as polymers, lipid bilayers, and polysaccharides.
Collapse
Affiliation(s)
- Yang Ge
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, China
| | - Xueping Wang
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, China
| | - Qiang Zhu
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, China
| | - Yuqin Yang
- Kuang Yaming Honors School, Nanjing University, Nanjing 210023, China
| | - Hao Dong
- Kuang Yaming Honors School, Nanjing University, Nanjing 210023, China
- State Key Laboratory of Analytical Chemistry for Life Science, Nanjing University, Nanjing 210023, China
- Institute for Brain Sciences, Nanjing University, Nanjing 210023, China
| | - Jing Ma
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, China
| |
Collapse
|
6
|
Conflitti P, Raniolo S, Limongelli V. Perspectives on Ligand/Protein Binding Kinetics Simulations: Force Fields, Machine Learning, Sampling, and User-Friendliness. J Chem Theory Comput 2023; 19:6047-6061. [PMID: 37656199 PMCID: PMC10536999 DOI: 10.1021/acs.jctc.3c00641] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Indexed: 09/02/2023]
Abstract
Computational techniques applied to drug discovery have gained considerable popularity for their ability to filter potentially active drugs from inactive ones, reducing the time scale and costs of preclinical investigations. The main focus of these studies has historically been the search for compounds endowed with high affinity for a specific molecular target to ensure the formation of stable and long-lasting complexes. Recent evidence has also correlated the in vivo drug efficacy with its binding kinetics, thus opening new fascinating scenarios for ligand/protein binding kinetic simulations in drug discovery. The present article examines the state of the art in the field, providing a brief summary of the most popular and advanced ligand/protein binding kinetics techniques and evaluating their current limitations and the potential solutions to reach more accurate kinetic models. Particular emphasis is put on the need for a paradigm change in the present methodologies toward ligand and protein parametrization, the force field problem, characterization of the transition states, the sampling issue, and algorithms' performance, user-friendliness, and data openness.
Collapse
Affiliation(s)
- Paolo Conflitti
- Faculty
of Biomedical Sciences, Euler Institute, Universitá della Svizzera italiana (USI), 6900 Lugano, Switzerland
| | - Stefano Raniolo
- Faculty
of Biomedical Sciences, Euler Institute, Universitá della Svizzera italiana (USI), 6900 Lugano, Switzerland
| | - Vittorio Limongelli
- Faculty
of Biomedical Sciences, Euler Institute, Universitá della Svizzera italiana (USI), 6900 Lugano, Switzerland
- Department
of Pharmacy, University of Naples “Federico
II”, 80131 Naples, Italy
| |
Collapse
|
7
|
Yu Y, Venable RM, Thirman J, Chatterjee P, Kumar A, Pastor RW, Roux B, MacKerell AD, Klauda JB. Drude Polarizable Lipid Force Field with Explicit Treatment of Long-Range Dispersion: Parametrization and Validation for Saturated and Monounsaturated Zwitterionic Lipids. J Chem Theory Comput 2023; 19:2590-2605. [PMID: 37071552 PMCID: PMC10404126 DOI: 10.1021/acs.jctc.3c00203] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2023]
Abstract
Accurate empirical force fields of lipid molecules are a critical component of molecular dynamics simulation studies aimed at investigating properties of monolayers, bilayers, micelles, vesicles, and liposomes, as well as heterogeneous systems, such as protein-membrane complexes, bacterial cell walls, and more. While the majority of lipid force field-based simulations have been performed using pairwise-additive nonpolarizable models, advances have been made in the development of the polarizable force field based on the classical Drude oscillator model. In the present study, we undertake further optimization of the Drude lipid force field, termed Drude2023, including improved treatment of the phosphate and glycerol linker region of PC and PE headgroups, additional optimization of the alkene group in monounsaturated lipids, and inclusion of long-range Lennard-Jones interactions using the particle-mesh Ewald method. Initial optimization targeted quantum mechanical (QM) data on small model compounds representative of the linker region. Subsequent optimization targeted QM data on larger model compounds, experimental data, and dihedral potentials of mean force from the CHARMM36 additive lipid force field using a parameter reweighting protocol. The use of both experimental and QM target data during the reweighting protocol is shown to produce physically reasonable parameters that reproduce a collection of experimental observables. Target data for optimization included surface area/lipid for DPPC, DSPC, DMPC, and DLPC bilayers and nuclear magnetic resonance (NMR) order parameters for DPPC bilayers. Validation data include prediction of membrane thickness, scattering form factors, electrostatic potential profiles, compressibility moduli, surface area per lipid, water permeability, NMR T1 relaxation times, diffusion constants, and monolayer surface tensions for a variety of saturated and unsaturated lipid mono- and bilayers. Overall, the agreement with experimental data is quite good, though the results are less satisfactory for the NMR T1 relaxation times for carbons near the ester groups. Notable improvements compared to the additive C36 force field were obtained for membrane dipole potentials, lipid diffusion coefficients, and water permeability with the exception of monounsaturated lipid bilayers. It is anticipated that the optimized polarizable Drude2023 force field will help generate more accurate molecular simulations of pure bilayers and heterogeneous systems containing membranes, advancing our understanding of the role of electronic polarization in these systems.
Collapse
Affiliation(s)
- Yalun Yu
- Biophysics Graduate Program, University of Maryland, College Park, Maryland 20742, United States
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Richard M Venable
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Jonathan Thirman
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois 60637, United States
| | - Payal Chatterjee
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland, Baltimore, Maryland 21201, United States
| | - Anmol Kumar
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland, Baltimore, Maryland 21201, United States
| | - Richard W Pastor
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Benoît Roux
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois 60637, United States
| | - Alexander D MacKerell
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland, Baltimore, Maryland 21201, United States
| | - Jeffery B Klauda
- Biophysics Graduate Program, University of Maryland, College Park, Maryland 20742, United States
- Department of Chemical and Biomolecular Engineering, University of Maryland, College Park, Maryland 20742, United States
| |
Collapse
|
8
|
Ricci E, Vergadou N. Integrating Machine Learning in the Coarse-Grained Molecular Simulation of Polymers. J Phys Chem B 2023; 127:2302-2322. [PMID: 36888553 DOI: 10.1021/acs.jpcb.2c06354] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/09/2023]
Abstract
Machine learning (ML) is having an increasing impact on the physical sciences, engineering, and technology and its integration into molecular simulation frameworks holds great potential to expand their scope of applicability to complex materials and facilitate fundamental knowledge and reliable property predictions, contributing to the development of efficient materials design routes. The application of ML in materials informatics in general, and polymer informatics in particular, has led to interesting results, however great untapped potential lies in the integration of ML techniques into the multiscale molecular simulation methods for the study of macromolecular systems, specifically in the context of Coarse Grained (CG) simulations. In this Perspective, we aim at presenting the pioneering recent research efforts in this direction and discussing how these new ML-based techniques can contribute to critical aspects of the development of multiscale molecular simulation methods for bulk complex chemical systems, especially polymers. Prerequisites for the implementation of such ML-integrated methods and open challenges that need to be met toward the development of general systematic ML-based coarse graining schemes for polymers are discussed.
Collapse
Affiliation(s)
- Eleonora Ricci
- Institute of Nanoscience and Nanotechnology, National Center for Scientific Research "Demokritos", GR-15341 Agia Paraskevi, Athens, Greece
- Institute of Informatics and Telecommunications, National Center for Scientific Research "Demokritos", GR-15341 Agia Paraskevi, Athens, Greece
| | - Niki Vergadou
- Institute of Nanoscience and Nanotechnology, National Center for Scientific Research "Demokritos", GR-15341 Agia Paraskevi, Athens, Greece
| |
Collapse
|
9
|
Ding Y, Yu K, Huang J. Data science techniques in biomolecular force field development. Curr Opin Struct Biol 2023; 78:102502. [PMID: 36462448 DOI: 10.1016/j.sbi.2022.102502] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Revised: 10/18/2022] [Accepted: 10/25/2022] [Indexed: 12/03/2022]
Abstract
Recent advances in data science are impacting the development of classical force fields. Here we review some ideas and techniques from data science that have been used in force field development, including database construction, atom typing, and machine learning potentials. We highlight how new tools such as active learning and automatic differentiation are facilitating the generation of target data and the direct fitting with macroscopic observables. Philosophical changes on how force field models should be built and used are also discussed. It's inspiring that more accurate biomolecular force fields can be developed with the aid of data science techniques.
Collapse
Affiliation(s)
- Ye Ding
- Westlake AI Therapeutics Lab, Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang, 310024, China; Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, Hangzhou, Zhejiang, 310024, China
| | - Kuang Yu
- Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, Guangdong, 518055, China
| | - Jing Huang
- Westlake AI Therapeutics Lab, Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang, 310024, China; Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, Hangzhou, Zhejiang, 310024, China.
| |
Collapse
|
10
|
Thürlemann M, Böselt L, Riniker S. Regularized by Physics: Graph Neural Network Parametrized Potentials for the Description of Intermolecular Interactions. J Chem Theory Comput 2023; 19:562-579. [PMID: 36633918 PMCID: PMC9878731 DOI: 10.1021/acs.jctc.2c00661] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Indexed: 01/13/2023]
Abstract
Simulations of molecular systems using electronic structure methods are still not feasible for many systems of biological importance. As a result, empirical methods such as force fields (FF) have become an established tool for the simulation of large and complex molecular systems. The parametrization of FF is, however, time-consuming and has traditionally been based on experimental data. Recent years have therefore seen increasing efforts to automatize FF parametrization or to replace FF with machine-learning (ML) based potentials. Here, we propose an alternative strategy to parametrize FF, which makes use of ML and gradient-descent based optimization while retaining a functional form founded in physics. Using a predefined functional form is shown to enable interpretability, robustness, and efficient simulations of large systems over long time scales. To demonstrate the strength of the proposed method, a fixed-charge and a polarizable model are trained on ab initio potential-energy surfaces. Given only information about the constituting elements, the molecular topology, and reference potential energies, the models successfully learn to assign atom types and corresponding FF parameters from scratch. The resulting models and parameters are validated on a wide range of experimentally and computationally derived properties of systems including dimers, pure liquids, and molecular crystals.
Collapse
Affiliation(s)
- Moritz Thürlemann
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Lennard Böselt
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Sereina Riniker
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| |
Collapse
|
11
|
Wang Y, Fass J, Kaminow B, Herr JE, Rufa D, Zhang I, Pulido I, Henry M, Bruce Macdonald HE, Takaba K, Chodera JD. End-to-end differentiable construction of molecular mechanics force fields. Chem Sci 2022; 13:12016-12033. [PMID: 36349096 PMCID: PMC9600499 DOI: 10.1039/d2sc02739a] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Accepted: 09/05/2022] [Indexed: 01/07/2023] Open
Abstract
Molecular mechanics (MM) potentials have long been a workhorse of computational chemistry. Leveraging accuracy and speed, these functional forms find use in a wide variety of applications in biomolecular modeling and drug discovery, from rapid virtual screening to detailed free energy calculations. Traditionally, MM potentials have relied on human-curated, inflexible, and poorly extensible discrete chemical perception rules (atom types) for applying parameters to small molecules or biopolymers, making it difficult to optimize both types and parameters to fit quantum chemical or physical property data. Here, we propose an alternative approach that uses graph neural networks to perceive chemical environments, producing continuous atom embeddings from which valence and nonbonded parameters can be predicted using invariance-preserving layers. Since all stages are built from smooth neural functions, the entire process-spanning chemical perception to parameter assignment-is modular and end-to-end differentiable with respect to model parameters, allowing new force fields to be easily constructed, extended, and applied to arbitrary molecules. We show that this approach is not only sufficiently expressive to reproduce legacy atom types, but that it can learn to accurately reproduce and extend existing molecular mechanics force fields. Trained with arbitrary loss functions, it can construct entirely new force fields self-consistently applicable to both biopolymers and small molecules directly from quantum chemical calculations, with superior fidelity than traditional atom or parameter typing schemes. When adapted to simultaneously fit partial charge models, espaloma delivers high-quality partial atomic charges orders of magnitude faster than current best-practices with low inaccuracy. When trained on the same quantum chemical small molecule dataset used to parameterize the Open Force Field ("Parsley") openff-1.2.0 small molecule force field augmented with a peptide dataset, the resulting espaloma model shows superior accuracy vis-á-vis experiments in computing relative alchemical free energy calculations for a popular benchmark. This approach is implemented in the free and open source package espaloma, available at https://github.com/choderalab/espaloma.
Collapse
Affiliation(s)
- Yuanqing Wang
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA,Physiology, Biophysics and System Biology PhD Program, Weill Cornell Medical College, Cornell UniversityNew York 10065NYUSA,MFA Program in Creative Writing, Division of Humanities and Arts, City College of New York, City University of New YorkNew York 10031NYUSA
| | - Josh Fass
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA,Tri-Institutional PhD Program in Computational Biology and Medicine, Weill Cornell Medical College, Cornell UniversityNew York 10065NYUSA
| | - Benjamin Kaminow
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA,Tri-Institutional PhD Program in Computational Biology and Medicine, Weill Cornell Medical College, Cornell UniversityNew York 10065NYUSA
| | - John E. Herr
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA
| | - Dominic Rufa
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA,Tri-Institutional PhD Program in Chemical Biology, Weill Cornell Medical College, Cornell UniversityNew York 10065NYUSA
| | - Ivy Zhang
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA,Tri-Institutional PhD Program in Computational Biology and Medicine, Weill Cornell Medical College, Cornell UniversityNew York 10065NYUSA
| | - Iván Pulido
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA
| | - Mike Henry
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA
| | - Hannah E. Bruce Macdonald
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA
| | - Kenichiro Takaba
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA,Pharmaceutical Research Center, Advanced Drug Discovery, Asahi Kasei Pharma CorporationShizuoka 410-2321Japan
| | - John D. Chodera
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA
| |
Collapse
|
12
|
Theoretical studies of metal-organic frameworks: Calculation methods and applications in catalysis, gas separation, and energy storage. Coord Chem Rev 2022. [DOI: 10.1016/j.ccr.2022.214670] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
13
|
Rieger M, Zacharias M. Nearest-Neighbor dsDNA Stability Analysis Using Alchemical Free-Energy Simulations. J Phys Chem B 2022; 126:3640-3647. [PMID: 35549273 DOI: 10.1021/acs.jpcb.2c01138] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The thermodynamic stability of double-stranded (ds)DNA depends on its sequence. It is influenced by the base pairing and stacking with neighboring bases along DNA molecules. Semiempirical schemes are available that allow us to predict the thermodynamic stability of DNA sequences based on empirically derived nearest-neighbor contributions of base pairs formed in the context of all possible nearest-neighbor base pairs. Current molecular dynamics (MD) simulations allow one to simulate the dynamics of DNA molecules in good agreement with experimentally obtained structures and available data on conformational flexibility. However, the suitability of current force field methods to reproduce dsDNA stability and its sequence dependence has been much less well tested. We have employed alchemical free-energy simulations of whole base pair transversions in dsDNA and in unbound single-stranded partner molecules. Such transversions change the sequence context but not the nucleotide content or base pairing in dsDNA and allow a direct comparison with the empirical nearest-neighbor dsDNA stability model. For the alchemical free-energy changes in the unbound single-stranded (ss)DNA partner molecules, we tested different setups assuming either complete unstacking or unrestrained simulations with partial stacking in the unbound ssDNA. The free-energy simulations predicted nearest-neighbor effects of similar magnitude, as observed experimentally but showed overall limited correlation with experimental data. An inaccurate description of stacking interactions and other possible reasons such as the neglect of electronic polarization effects are discussed. The results indicate the need to improve the realistic description of stacking interactions in current molecular mechanic force fields.
Collapse
Affiliation(s)
- Manuel Rieger
- Physics Department and Center of Protein Assemblies, Technical University of Munich, 85748 Garching, Germany
| | - Martin Zacharias
- Physics Department and Center of Protein Assemblies, Technical University of Munich, 85748 Garching, Germany
| |
Collapse
|
14
|
Challenges and frontiers of computational modelling of biomolecular recognition. QRB DISCOVERY 2022. [DOI: 10.1017/qrd.2022.11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
Abstract
Biomolecular recognition including binding of small molecules, peptides and proteins to their target receptors plays a key role in cellular function and has been targeted for therapeutic drug design. However, the high flexibility of biomolecules and slow binding and dissociation processes have presented challenges for computational modelling. Here, we review the challenges and computational approaches developed to characterise biomolecular binding, including molecular docking, molecular dynamics simulations (especially enhanced sampling) and machine learning. Further improvements are still needed in order to accurately and efficiently characterise binding structures, mechanisms, thermodynamics and kinetics of biomolecules in the future.
Collapse
|