1
|
Han B, Yu K. Refining potential energy surface through dynamical properties via differentiable molecular simulation. Nat Commun 2025; 16:816. [PMID: 39827185 PMCID: PMC11742923 DOI: 10.1038/s41467-025-56061-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Accepted: 01/08/2025] [Indexed: 01/22/2025] Open
Abstract
Recently, machine learning potential (MLP) largely enhances the reliability of molecular dynamics, but its accuracy is limited by the underlying ab initio methods. A viable approach to overcome this limitation is to refine the potential by learning from experimental data, which now can be done efficiently using modern automatic differentiation technique. However, potential refinement is mostly performed using thermodynamic properties, leaving the most accessible and informative dynamical data (like spectroscopy) unexploited. In this work, through a comprehensive application of adjoint and gradient truncation methods, we show that both memory and gradient explosion issues can be circumvented in many situations, so the dynamical property differentiation is well-behaved. Consequently, both transport coefficients and spectroscopic data can be used to improve the density functional theory based MLP towards higher accuracy. Essentially, this work contributes to the solution of the inverse problem of spectroscopy by extracting microscopic interactions from vibrational spectroscopic data.
Collapse
Affiliation(s)
- Bin Han
- Institute of Materials Research, Tsinghua Shenzhen International Graduate School (TSIGS), Shenzhen, PR China
| | - Kuang Yu
- Institute of Materials Research, Tsinghua Shenzhen International Graduate School (TSIGS), Shenzhen, PR China.
| |
Collapse
|
2
|
Mlýnský V, Kührová P, Pykal M, Krepl M, Stadlbauer P, Otyepka M, Banáš P, Šponer J. Can We Ever Develop an Ideal RNA Force Field? Lessons Learned from Simulations of the UUCG RNA Tetraloop and Other Systems. J Chem Theory Comput 2025. [PMID: 39813107 DOI: 10.1021/acs.jctc.4c01357] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2025]
Abstract
Molecular dynamics (MD) simulations are an important and well-established tool for investigating RNA structural dynamics, but their accuracy relies heavily on the quality of the employed force field (ff). In this work, we present a comprehensive evaluation of widely used pair-additive and polarizable RNA ffs using the challenging UUCG tetraloop (TL) benchmark system. Extensive standard MD simulations, initiated from the NMR structure of the 14-mer UUCG TL, revealed that most ffs did not maintain the native state, instead favoring alternative loop conformations. Notably, three very recent variants of pair-additive ffs, OL3CP-gHBfix21, DES-Amber, and OL3R2.7, successfully preserved the native structure over a 10 × 20 μs time scale. To further assess these ffs, we performed enhanced sampling folding simulations of the shorter 8-mer UUCG TL, starting from the single-stranded conformation. Estimated folding free energies (ΔG°fold) varied significantly among these three ffs, with values of 0.0 ± 0.6, 2.4 ± 0.8, and 7.4 ± 0.2 kcal/mol for OL3CP-gHBfix21, DES-Amber, and OL3R2.7, respectively. The ΔG°fold value predicted by the OL3CP-gHBfix21 ff was closest to experimental estimates, ranging from -1.6 to -0.7 kcal/mol. In contrast, the higher ΔG°fold values obtained using DES-Amber and OL3R2.7 were unexpected, suggesting that key interactions are inaccurately described in the folded, unfolded, or misfolded ensembles. These discrepancies led us to further test DES-Amber and OL3R2.7 ffs on additional RNA and DNA systems, where further performance issues were observed. Our results emphasize the complexity of accurately modeling RNA dynamics and suggest that creating an RNA ff capable of reliably performing across a wide range of RNA systems remains extremely challenging. In conclusion, our study provides valuable insights into the capabilities of current RNA ffs and highlights key areas for future ff development.
Collapse
Affiliation(s)
- Vojtěch Mlýnský
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 00 Brno, Czech Republic
| | - Petra Kührová
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 00 Brno, Czech Republic
- Regional Center of Advanced Technologies and Materials, The Czech Advanced Technology and Research Institute (CATRIN), Palacký University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
| | - Martin Pykal
- Regional Center of Advanced Technologies and Materials, The Czech Advanced Technology and Research Institute (CATRIN), Palacký University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
| | - Miroslav Krepl
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 00 Brno, Czech Republic
| | - Petr Stadlbauer
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 00 Brno, Czech Republic
| | - Michal Otyepka
- Regional Center of Advanced Technologies and Materials, The Czech Advanced Technology and Research Institute (CATRIN), Palacký University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
- IT4Innovations, VSB-Technical University of Ostrava, 17. listopadu 2172/15, 708 00 Ostrava-Poruba, Czech Republic
| | - Pavel Banáš
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 00 Brno, Czech Republic
- Regional Center of Advanced Technologies and Materials, The Czech Advanced Technology and Research Institute (CATRIN), Palacký University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
- IT4Innovations, VSB-Technical University of Ostrava, 17. listopadu 2172/15, 708 00 Ostrava-Poruba, Czech Republic
| | - Jiří Šponer
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 00 Brno, Czech Republic
| |
Collapse
|
3
|
Röcken S, Burnet AF, Zavadlav J. Predicting solvation free energies with an implicit solvent machine learning potential. J Chem Phys 2024; 161:234101. [PMID: 39679504 DOI: 10.1063/5.0235189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2024] [Accepted: 11/29/2024] [Indexed: 12/17/2024] Open
Abstract
Machine learning (ML) potentials are a powerful tool in molecular modeling, enabling ab initio accuracy for comparably small computational costs. Nevertheless, all-atom simulations employing best-performing graph neural network architectures are still too expensive for applications requiring extensive sampling, such as free energy computations. Implicit solvent models could provide the necessary speed-up due to reduced degrees of freedom and faster dynamics. Here, we introduce a Solvation Free Energy Path Reweighting (ReSolv) framework to parameterize an implicit solvent ML potential for small organic molecules that accurately predicts the hydration free energy, an essential parameter in drug design and pollutant modeling. Learning on a combination of experimental hydration free energy data and ab initio data of molecules in vacuum, ReSolv bypasses the need for intractable ab initio data of molecules in an explicit bulk solvent and does not have to resort to less accurate data-generating models. On the FreeSolv dataset, ReSolv achieves a mean absolute error close to average experimental uncertainty, significantly outperforming standard explicit solvent force fields. Compared to the explicit solvent ML potential, ReSolv offers a computational speedup of four orders of magnitude and attains closer agreement with experiments. The presented framework paves the way for deep molecular models that are more accurate yet computationally more cost-effective than classical atomistic models.
Collapse
Affiliation(s)
- Sebastien Röcken
- Multiscale Modeling of Fluid Materials, Department of Engineering Physics and Computation, TUM School of Engineering and Design, Technical University of Munich, Munich, Germany
| | - Anton F Burnet
- Multiscale Modeling of Fluid Materials, Department of Engineering Physics and Computation, TUM School of Engineering and Design, Technical University of Munich, Munich, Germany
| | - Julija Zavadlav
- Multiscale Modeling of Fluid Materials, Department of Engineering Physics and Computation, TUM School of Engineering and Design, Technical University of Munich, Munich, Germany
| |
Collapse
|
4
|
Piomponi V, Krepl M, Sponer J, Bussi G. Molecular Simulations to Investigate the Impact of N6-Methylation in RNA Recognition: Improving Accuracy and Precision of Binding Free Energy Prediction. J Phys Chem B 2024; 128:8896-8907. [PMID: 39240243 DOI: 10.1021/acs.jpcb.4c03397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/07/2024]
Abstract
N6-Methyladenosine (m6A) is a prevalent RNA post-transcriptional modification that plays crucial roles in RNA stability, structural dynamics, and interactions with proteins. The YT521-B (YTH) family of proteins, which are notable m6A readers, functions through its highly conserved YTH domain. Recent structural investigations and molecular dynamics (MD) simulations have shed light on the mechanism of recognition of m6A by the YTHDC1 protein. Despite advancements, using MD to predict the stabilization induced by m6A on the free energy of binding between RNA and YTH proteins remains challenging due to inaccuracy of the employed force field and limited sampling. For instance, simulations often fail to sufficiently capture the hydration dynamics of the binding pocket. This study addresses these challenges through an innovative methodology that integrates metadynamics, alchemical simulations, and force-field refinement. Importantly, our research identifies hydration of the binding pocket as giving only a minor contribution to the binding free energy and emphasizes the critical importance of precisely tuning force-field parameters to experimental data. By employing a fitting strategy built on alchemical calculations, we refine the m6A partial charge parameters, thereby enabling the simultaneous reproduction of N6 methylation on both the protein binding free energy and the thermodynamic stability of nine RNA duplexes. Our findings underscore the sensitivity of binding free energies to partial charges, highlighting the necessity for thorough parametrization and validation against experimental observations across a range of structural contexts.
Collapse
Affiliation(s)
- Valerio Piomponi
- Scuola Internazionale Superiore di Studi Avanzati, SISSA, Via Bonomea 265, Trieste 34136, Italy
- Area Science Park, località Padriciano, 99, Trieste 34149, Italy
| | - Miroslav Krepl
- Institute of Biophysics of the Czech Academy of Sciences, Kralovopolská 135, Brno 612 00, Czech Republic
| | - Jiri Sponer
- Institute of Biophysics of the Czech Academy of Sciences, Kralovopolská 135, Brno 612 00, Czech Republic
| | - Giovanni Bussi
- Scuola Internazionale Superiore di Studi Avanzati, SISSA, Via Bonomea 265, Trieste 34136, Italy
| |
Collapse
|
5
|
Tkaczyk S, Karwounopoulos J, Schöller A, Woodcock HL, Langer T, Boresch S, Wieder M. Reweighting from Molecular Mechanics Force Fields to the ANI-2x Neural Network Potential. J Chem Theory Comput 2024; 20:2719-2728. [PMID: 38527958 DOI: 10.1021/acs.jctc.3c01274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/27/2024]
Abstract
To achieve chemical accuracy in free energy calculations, it is necessary to accurately describe the system's potential energy surface and efficiently sample configurations from its Boltzmann distribution. While neural network potentials (NNPs) have shown significantly higher accuracy than classical molecular mechanics (MM) force fields, they have a limited range of applicability and are considerably slower than MM potentials, often by orders of magnitude. To address this challenge, Rufa et al. [Rufa et al. bioRxiv 2020, 10.1101/2020.07.29.227959.] suggested a two-stage approach that uses a fast and established MM alchemical energy protocol, followed by reweighting the results using NNPs, known as endstate correction or indirect free energy calculation. This study systematically investigates the accuracy and robustness of reweighting from an MM reference to a neural network target potential (ANI-2x) for an established data set in vacuum, using single-step free-energy perturbation (FEP) and nonequilibrium (NEQ) switching simulation. We assess the influence of longer switching lengths and the impact of slow degrees of freedom on outliers in the work distribution and compare the results to those of multistate equilibrium free energy simulations. Our results demonstrate that free energy calculations between NNPs and MM potentials should be preferably performed using NEQ switching simulations to obtain accurate free energy estimates. NEQ switching simulations between the MM potentials and NNPs are efficient, robust, and trivial to implement.
Collapse
Affiliation(s)
- Sara Tkaczyk
- Department of Pharmaceutical Sciences, Pharmaceutical Chemistry Division, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria
- Vienna Doctoral School of Pharmaceutical, Nutritional and Sport Sciences (PhaNuSpo), University of Vienna, 1090 Vienna, Austria
| | - Johannes Karwounopoulos
- Faculty of Chemistry, Institute of Computational Biological Chemistry, University of Vienna, Währingerstrasse 17, 1090 Vienna, Austria
- Vienna Doctoral School of Chemistry (DoSChem), University of Vienna, Währingerstrasse 42, 1090 Vienna, Austria
| | - Andreas Schöller
- Faculty of Chemistry, Institute of Computational Biological Chemistry, University of Vienna, Währingerstrasse 17, 1090 Vienna, Austria
- Vienna Doctoral School of Chemistry (DoSChem), University of Vienna, Währingerstrasse 42, 1090 Vienna, Austria
| | - H Lee Woodcock
- Department of Chemistry, University of South Florida, 4202 E. Fowler Ave., CHE205, Tampa, Florida 33620-5250, United States
| | - Thierry Langer
- Department of Pharmaceutical Sciences, Pharmaceutical Chemistry Division, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria
| | - Stefan Boresch
- Faculty of Chemistry, Institute of Computational Biological Chemistry, University of Vienna, Währingerstrasse 17, 1090 Vienna, Austria
| | - Marcus Wieder
- Faculty of Chemistry, Institute of Computational Biological Chemistry, University of Vienna, Währingerstrasse 17, 1090 Vienna, Austria
| |
Collapse
|
6
|
Greener JG. Differentiable simulation to develop molecular dynamics force fields for disordered proteins. Chem Sci 2024; 15:4897-4909. [PMID: 38550690 PMCID: PMC10966991 DOI: 10.1039/d3sc05230c] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Accepted: 02/08/2024] [Indexed: 11/11/2024] Open
Abstract
Implicit solvent force fields are computationally efficient but can be unsuitable for running molecular dynamics on disordered proteins. Here I improve the a99SB-disp force field and the GBNeck2 implicit solvent model to better describe disordered proteins. Differentiable molecular simulations with 5 ns trajectories are used to jointly optimise 108 parameters to better match explicit solvent trajectories. Simulations with the improved force field better reproduce the radius of gyration and secondary structure content seen in experiments, whilst showing slightly degraded performance on folded proteins and protein complexes. The force field, called GB99dms, reproduces the results of a small molecule binding study and improves agreement with experiment for the aggregation of amyloid peptides. GB99dms, which can be used in OpenMM, is available at https://github.com/greener-group/GB99dms. This work is the first to show that gradients can be obtained directly from nanosecond-length differentiable simulations of biomolecules and highlights the effectiveness of this approach to training whole force fields to match desired properties.
Collapse
Affiliation(s)
- Joe G Greener
- Medical Research Council Laboratory of Molecular Biology Cambridge CB2 0QH UK
| |
Collapse
|
7
|
Rinaldi S, Moroni E, Rozza R, Magistrato A. Frontiers and Challenges of Computing ncRNAs Biogenesis, Function and Modulation. J Chem Theory Comput 2024; 20:993-1018. [PMID: 38287883 DOI: 10.1021/acs.jctc.3c01239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2024]
Abstract
Non-coding RNAs (ncRNAs), generated from nonprotein coding DNA sequences, constitute 98-99% of the human genome. Non-coding RNAs encompass diverse functional classes, including microRNAs, small interfering RNAs, PIWI-interacting RNAs, small nuclear RNAs, small nucleolar RNAs, and long non-coding RNAs. With critical involvement in gene expression and regulation across various biological and physiopathological contexts, such as neuronal disorders, immune responses, cardiovascular diseases, and cancer, non-coding RNAs are emerging as disease biomarkers and therapeutic targets. In this review, after providing an overview of non-coding RNAs' role in cell homeostasis, we illustrate the potential and the challenges of state-of-the-art computational methods exploited to study non-coding RNAs biogenesis, function, and modulation. This can be done by directly targeting them with small molecules or by altering their expression by targeting the cellular engines underlying their biosynthesis. Drawing from applications, also taken from our work, we showcase the significance and role of computer simulations in uncovering fundamental facets of ncRNA mechanisms and modulation. This information may set the basis to advance gene modulation tools and therapeutic strategies to address unmet medical needs.
Collapse
Affiliation(s)
- Silvia Rinaldi
- National Research Council of Italy (CNR) - Institute of Chemistry of OrganoMetallic Compounds (ICCOM), c/o Area di Ricerca CNR di Firenze Via Madonna del Piano 10, 50019 Sesto Fiorentino, Florence, Italy
| | - Elisabetta Moroni
- National Research Council of Italy (CNR) - Institute of Chemical Sciences and Technologies (SCITEC), via Mario Bianco 9, 20131 Milano, Italy
| | - Riccardo Rozza
- National Research Council of Italy (CNR) - Institute of Material Foundry (IOM) c/o International School for Advanced Studies (SISSA), Via Bonomea, 265, 34136 Trieste, Italy
| | - Alessandra Magistrato
- National Research Council of Italy (CNR) - Institute of Material Foundry (IOM) c/o International School for Advanced Studies (SISSA), Via Bonomea, 265, 34136 Trieste, Italy
| |
Collapse
|
8
|
Gilardoni I, Fröhlking T, Bussi G. Boosting Ensemble Refinement with Transferable Force-Field Corrections: Synergistic Optimization for Molecular Simulations. J Phys Chem Lett 2024; 15:1204-1210. [PMID: 38272001 DOI: 10.1021/acs.jpclett.3c03423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2024]
Abstract
A novel method combining the force-field fitting approach and ensemble refinement by the maximum entropy principle is presented. Its formulation allows us to continuously interpolate between these two methods, which can thus be interpreted as two limiting cases. A cross-validation procedure enables us to correctly assess the relative weight of both of them, distinguishing scenarios in which the combined approach is meaningful from those in which either ensemble refinement or force-field fitting separately prevails. The efficacy of their combination is examined for a realistic case study of RNA oligomers. Within the new scheme, molecular dynamics simulations are integrated with experimental data provided by nuclear magnetic resonance measures. We show that force-field corrections are in general superior when applied to the appropriate force-field terms but are automatically discarded by the method when applied to inappropriate force-field terms.
Collapse
Affiliation(s)
- Ivan Gilardoni
- Scuola Internazionale Superiore di Studi Avanzati, via Bonomea 265, 34136 Trieste, Italy
| | - Thorben Fröhlking
- Scuola Internazionale Superiore di Studi Avanzati, via Bonomea 265, 34136 Trieste, Italy
| | - Giovanni Bussi
- Scuola Internazionale Superiore di Studi Avanzati, via Bonomea 265, 34136 Trieste, Italy
| |
Collapse
|
9
|
Matin S, Allen AEA, Smith J, Lubbers N, Jadrich RB, Messerly R, Nebgen B, Li YW, Tretiak S, Barros K. Machine Learning Potentials with the Iterative Boltzmann Inversion: Training to Experiment. J Chem Theory Comput 2024. [PMID: 38307009 DOI: 10.1021/acs.jctc.3c01051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2024]
Abstract
Methodologies for training machine learning potentials (MLPs) with quantum-mechanical simulation data have recently seen tremendous progress. Experimental data have a very different character than simulated data, and most MLP training procedures cannot be easily adapted to incorporate both types of data into the training process. We investigate a training procedure based on iterative Boltzmann inversion that produces a pair potential correction to an existing MLP using equilibrium radial distribution function data. By applying these corrections to an MLP for pure aluminum based on density functional theory, we observe that the resulting model largely addresses previous overstructuring in the melt phase. Interestingly, the corrected MLP also exhibits improved performance in predicting experimental diffusion constants, which are not included in the training procedure. The presented method does not require autodifferentiating through a molecular dynamics solver and does not make assumptions about the MLP architecture. Our results suggest a practical framework for incorporating experimental data into machine learning models to improve the accuracy of molecular dynamics simulations.
Collapse
Affiliation(s)
- Sakib Matin
- Department of Physics, Boston University, Boston, Massachusetts 02215, United States
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87546, United States
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico 87546, United States
| | - Alice E A Allen
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87546, United States
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico 87546, United States
| | - Justin Smith
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87546, United States
- NVIDIA Corp., Santa Clara, California 95051, United States
| | - Nicholas Lubbers
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Ryan B Jadrich
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87546, United States
| | - Richard Messerly
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87546, United States
| | - Benjamin Nebgen
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87546, United States
| | - Ying Wai Li
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Sergei Tretiak
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87546, United States
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico 87546, United States
- Center for Integrated Nanotechnologies, Los Alamos National Laboratory, Los Alamos, New Mexico 87546, United States
| | - Kipton Barros
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87546, United States
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico 87546, United States
| |
Collapse
|
10
|
Wang K, Xu L, Shao W, Jin H, Wang Q, Ma M. A Multiple-Fidelity Method for Accurate Simulation of MoS 2 Properties Using JAX-ReaxFF and Neural Network Potentials. J Phys Chem Lett 2024; 15:371-379. [PMID: 38175525 DOI: 10.1021/acs.jpclett.3c03080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
Reactive force field (ReaxFF) is a commonly used force field for modeling chemical reactions at the atomic level. Recently, JAX-ReaxFF, combined with automatic differentiation, has been used to efficiently parametrize ReaxFF. However, its analytical formula may lead to inaccurate predictions. While neural network-based potentials (NNPs) trained on density functional theory-labeled data offer a more accurate method, it requires a large amount of training data to be trained from scratch. To overcome these issues, we present a multiple-fidelity method that combines JAX-ReaxFF and NNP and apply the method on MoS2, a promising two-dimensional semiconductor for flexible electronics. By incorporating implicit prior physical information, ReaxFF can serve as a cost-effective way to generate pretraining data, facilitating more accurate simulations of MoS2. Moreover, in the Mo-S-H system, the pretraining strategy can reduce root-mean-square errors of energy by 20%. This approach can be extended to a wide variety of material systems, accelerating their computational research.
Collapse
Affiliation(s)
- Kehan Wang
- State Key Laboratory of Tribology in Advanced Equipment (SKLT), Department of Mechanical Engineering, Tsinghua University, Beijing 100084, China
- Center for Nano and Micro Mechanics, Tsinghua University, Beijing 100084, China
| | - Longkun Xu
- Samsung Research China - Beijing (SRC-B), Beijing 100102, China
| | - Wei Shao
- Samsung Research China - Beijing (SRC-B), Beijing 100102, China
| | - Haishun Jin
- Samsung Research China - Beijing (SRC-B), Beijing 100102, China
| | - Qiang Wang
- Samsung Research China - Beijing (SRC-B), Beijing 100102, China
| | - Ming Ma
- State Key Laboratory of Tribology in Advanced Equipment (SKLT), Department of Mechanical Engineering, Tsinghua University, Beijing 100084, China
- Center for Nano and Micro Mechanics, Tsinghua University, Beijing 100084, China
- Institute of Superlubricity Technology, Research Institute of Tsinghua University in Shenzhen, Shenzhen 518063, Guangdong, China
| |
Collapse
|
11
|
Menchon G, Maveyraud L, Czaplicki G. Molecular Dynamics as a Tool for Virtual Ligand Screening. Methods Mol Biol 2024; 2714:33-83. [PMID: 37676592 DOI: 10.1007/978-1-0716-3441-7_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
Rational drug design is essential for new drugs to emerge, especially when the structure of a target protein or nucleic acid is known. To that purpose, high-throughput virtual ligand screening campaigns aim at discovering computationally new binding molecules or fragments to modulate particular biomolecular interactions or biological activities, related to a disease process. The structure-based virtual ligand screening process primarily relies on docking methods which allow predicting the binding of a molecule to a biological target structure with a correct conformation and the best possible affinity. The docking method itself is not sufficient as it suffers from several and crucial limitations (lack of full protein flexibility information, no solvation and ion effects, poor scoring functions, and unreliable molecular affinity estimation).At the interface of computer techniques and drug discovery, molecular dynamics (MD) allows introducing protein flexibility before or after a docking protocol, refining the structure of protein-drug complexes in the presence of water, ions, and even in membrane-like environments, describing more precisely the temporal evolution of the biological complex and ranking these complexes with more accurate binding energy calculations. In this chapter, we describe the up-to-date MD, which plays the role of supporting tools in the virtual ligand screening (VS) process.Without a doubt, using docking in combination with MD is an attractive approach in structure-based drug discovery protocols nowadays. It has proved its efficiency through many examples in the literature and is a powerful method to significantly reduce the amount of required wet experimentations (Tarcsay et al, J Chem Inf Model 53:2990-2999, 2013; Barakat et al, PLoS One 7:e51329, 2012; De Vivo et al, J Med Chem 59:4035-4061, 2016; Durrant, McCammon, BMC Biol 9:71-79, 2011; Galeazzi, Curr Comput Aided Drug Des 5:225-240, 2009; Hospital et al, Adv Appl Bioinforma Chem 8:37-47, 2015; Jiang et al, Molecules 20:12769-12786, 2015; Kundu et al, J Mol Graph Model 61:160-174, 2015; Mirza et al, J Mol Graph Model 66:99-107, 2016; Moroy et al, Future Med Chem 7:2317-2331, 2015; Naresh et al, J Mol Graph Model 61:272-280, 2015; Nichols et al, J Chem Inf Model 51:1439-1446, 2011; Nichols et al, Methods Mol Biol 819:93-103, 2012; Okimoto et al, PLoS Comput Biol 5:e1000528, 2009; Rodriguez-Bussey et al, Biopolymers 105:35-42, 2016; Sliwoski et al, Pharmacol Rev 66:334-395, 2014).
Collapse
Affiliation(s)
- Grégory Menchon
- Inserm U1242, Oncogenesis, Stress and Signaling (OSS), Université de Rennes 1, Rennes, France
| | - Laurent Maveyraud
- Institut de Pharmacologie et de Biologie Structurale (IPBS), Université de Toulouse, CNRS, Université Toulouse III - Paul Sabatier (UT3), Toulouse, France
| | - Georges Czaplicki
- Institut de Pharmacologie et de Biologie Structurale (IPBS), Université de Toulouse, CNRS, Université Toulouse III - Paul Sabatier (UT3), Toulouse, France.
| |
Collapse
|
12
|
Lemcke S, Appeldorn JH, Wand M, Speck T. Toward a structural identification of metastable molecular conformations. J Chem Phys 2023; 159:114105. [PMID: 37712784 DOI: 10.1063/5.0164145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Accepted: 08/21/2023] [Indexed: 09/16/2023] Open
Abstract
Interpreting high-dimensional data from molecular dynamics simulations is a persistent challenge. In this paper, we show that for a small peptide, deca-alanine, metastable states can be identified through a neural net based on structural information alone. While processing molecular dynamics data, dimensionality reduction is a necessary step that projects high-dimensional data onto a low-dimensional representation that, ideally, captures the conformational changes in the underlying data. Conventional methods make use of the temporal information contained in trajectories generated through integrating the equations of motion, which forgoes more efficient sampling schemes. We demonstrate that EncoderMap, an autoencoder architecture with an additional distance metric, can find a suitable low-dimensional representation to identify long-lived molecular conformations using exclusively structural information. For deca-alanine, which exhibits several helix-forming pathways, we show that this approach allows us to combine simulations with different biasing forces and yields representations comparable in quality to other established methods. Our results contribute to computational strategies for the rapid automatic exploration of the configuration space of peptides and proteins.
Collapse
Affiliation(s)
- Simon Lemcke
- Institut für Physik, Johannes Gutenberg-Universität Mainz, Staudingerweg 7-9, 55128 Mainz, Germany
| | - Jörn H Appeldorn
- Institut für Physik, Johannes Gutenberg-Universität Mainz, Staudingerweg 7-9, 55128 Mainz, Germany
| | - Michael Wand
- Institut für Informatik, Johannes Gutenberg-Universität Mainz, Staudingerweg 9, 55128 Mainz, Germany
| | - Thomas Speck
- Institut für Theoretische Physik IV, Universität Stuttgart, Heisenbergstr. 3, 70569 Stuttgart, Germany
| |
Collapse
|
13
|
Wang X, Li J, Yang L, Chen F, Wang Y, Chang J, Chen J, Feng W, Zhang L, Yu K. DMFF: An Open-Source Automatic Differentiable Platform for Molecular Force Field Development and Molecular Dynamics Simulation. J Chem Theory Comput 2023; 19:5897-5909. [PMID: 37589304 DOI: 10.1021/acs.jctc.2c01297] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/18/2023]
Abstract
In the simulation of molecular systems, the underlying force field (FF) model plays an extremely important role in determining the reliability of the simulation. However, the quality of the state-of-the-art molecular force fields is still unsatisfactory in many cases, and the FF parameterization process largely relies on human experience, which is not scalable. To address this issue, we introduce DMFF, an open-source molecular FF development platform based on an automatic differentiation technique. DMFF serves as a powerful tool for both top-down and bottom-up FF development. Using DMFF, both energies/forces and thermodynamic quantities such as ensemble averages and free energies can be evaluated in a differentiable way, realizing an automatic, yet highly flexible FF optimization workflow. DMFF also eases the evaluation of forces and virial tensors for complicated advanced FFs, helping the fast validation of new models in molecular dynamics simulation. DMFF has been released as an open-source package under the LGPL-3.0 license and is available at https://github.com/deepmodeling/DMFF.
Collapse
Affiliation(s)
| | - Jichen Li
- DP Technology, Beijing 100080, P. R. China
| | - Lan Yang
- Tsinghua-Berkley Shenzhen Institute, Shenzhen, Guangdong 518055, P. R. China
| | | | | | | | - Junmin Chen
- Tsinghua-Berkley Shenzhen Institute, Shenzhen, Guangdong 518055, P. R. China
| | - Wei Feng
- DP Technology, Beijing 100080, P. R. China
| | - Linfeng Zhang
- AI for Science Institute, Beijing 100080, P. R. China
| | - Kuang Yu
- Tsinghua-Berkley Shenzhen Institute, Shenzhen, Guangdong 518055, P. R. China
- Tsinghua Shenzhen International Graduate School, Shenzhen, Guangdong 518055, P. R. China
| |
Collapse
|
14
|
Bolhuis PG, Brotzakis ZF, Keller BG. Optimizing molecular potential models by imposing kinetic constraints with path reweighting. J Chem Phys 2023; 159:074102. [PMID: 37581416 DOI: 10.1063/5.0151166] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2023] [Accepted: 06/19/2023] [Indexed: 08/16/2023] Open
Abstract
Empirical force fields employed in molecular dynamics simulations of complex systems are often optimized to reproduce experimentally determined structural and thermodynamic properties. In contrast, experimental knowledge about the interconversion rates between metastable states in such systems is hardly ever incorporated in a force field due to a lack of an efficient approach. Here, we introduce such a framework based on the relationship between dynamical observables, such as rate constants, and the underlying molecular model parameters using the statistical mechanics of trajectories. Given a prior ensemble of molecular dynamics trajectories produced with imperfect force field parameters, the approach allows for the optimal adaption of these parameters such that the imposed constraint of equally predicted and experimental rate constant is obeyed. To do so, the method combines the continuum path ensemble maximum caliber approach with path reweighting methods for stochastic dynamics. When multiple solutions are found, the method selects automatically the combination that corresponds to the smallest perturbation of the entire path ensemble, as required by the maximum entropy principle. To show the validity of the approach, we illustrate the method on simple test systems undergoing rare event dynamics. Next to simple 2D potentials, we explore particle models representing molecular isomerization reactions and protein-ligand unbinding. Besides optimal interaction parameters, the methodology gives physical insights into what parts of the model are most sensitive to the kinetics. We discuss the generality and broad implications of the methodology.
Collapse
Affiliation(s)
- Peter G Bolhuis
- van 't Hoff Institute for Molecular Sciences, University of Amsterdam, P.O. Box 94157, 1090 GD Amsterdam, The Netherlands
| | - Z Faidon Brotzakis
- Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United Kingdom
| | - Bettina G Keller
- Department of Biology, Chemistry, Pharmacy, Freie Universität Berlin, Arnimallee 22, D-14195 Berlin, Germany
| |
Collapse
|
15
|
Liebl K, Zacharias M. The development of nucleic acids force fields: From an unchallenged past to a competitive future. Biophys J 2023; 122:2841-2851. [PMID: 36540025 PMCID: PMC10398263 DOI: 10.1016/j.bpj.2022.12.022] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 11/08/2022] [Accepted: 12/15/2022] [Indexed: 12/24/2022] Open
Abstract
Molecular dynamics simulations have strongly matured as a method to study biomolecular processes. Their validity, however, is determined by the accuracy of the underlying force fields that describe the forces between all atoms. In this article, we review the development of nucleic acids force fields. We describe the early attempts in the 1990s and emphasize their strong influence on recent force fields. State-of-the-art force fields still use the same Lennard-Jones parameters derived 25 years ago in spite of the fact that these parameters were in general not fitted for nucleic acids. In addition, electrostatic parameters also are deprecated, which may explain some of the current force field deficiencies. We compare different force fields for various systems and discuss new tests of the recently developed Tumuc1 force field. The OL-force fields and Tumuc1 are arguably the best force fields to describe the DNA double helix. However, no force field is flawless. In particular, the description of sugar-puckering remains a problem for nucleic acids force fields. Future refinements are required, so we review methods for force field refinement and give an outlook to the future of force fields.
Collapse
Affiliation(s)
- Korbinian Liebl
- Department of Chemistry, Chicago Center for Theoretical Chemistry, Institute for Biophysical Dynamics, and James Franck Institute, The University of Chicago, Chicago, Illinois.
| | - Martin Zacharias
- Physics Department and Center of Protein Assemblies, Technical University of Munich, Munich, Germany
| |
Collapse
|
16
|
Zhang P, Yang W. Toward a general neural network force field for protein simulations: Refining the intramolecular interaction in protein. J Chem Phys 2023; 159:024118. [PMID: 37431910 PMCID: PMC10481389 DOI: 10.1063/5.0142280] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 06/22/2023] [Indexed: 07/12/2023] Open
Abstract
Molecular dynamics (MD) is an extremely powerful, highly effective, and widely used approach to understanding the nature of chemical processes in atomic details for proteins. The accuracy of results from MD simulations is highly dependent on force fields. Currently, molecular mechanical (MM) force fields are mainly utilized in MD simulations because of their low computational cost. Quantum mechanical (QM) calculation has high accuracy, but it is exceedingly time consuming for protein simulations. Machine learning (ML) provides the capability for generating accurate potential at the QM level without increasing much computational effort for specific systems that can be studied at the QM level. However, the construction of general machine learned force fields, needed for broad applications and large and complex systems, is still challenging. Here, general and transferable neural network (NN) force fields based on CHARMM force fields, named CHARMM-NN, are constructed for proteins by training NN models on 27 fragments partitioned from the residue-based systematic molecular fragmentation (rSMF) method. The NN for each fragment is based on atom types and uses new input features that are similar to MM inputs, including bonds, angles, dihedrals, and non-bonded terms, which enhance the compatibility of CHARMM-NN to MM MD and enable the implementation of CHARMM-NN force fields in different MD programs. While the main part of the energy of the protein is based on rSMF and NN, the nonbonded interactions between the fragments and with water are taken from the CHARMM force field through mechanical embedding. The validations of the method for dipeptides on geometric data, relative potential energies, and structural reorganization energies demonstrate that the CHARMM-NN local minima on the potential energy surface are very accurate approximations to QM, showing the success of CHARMM-NN for bonded interactions. However, the MD simulations on peptides and proteins indicate that more accurate methods to represent protein-water interactions in fragments and non-bonded interactions between fragments should be considered in the future improvement of CHARMM-NN, which can increase the accuracy of approximation beyond the current mechanical embedding QM/MM level.
Collapse
Affiliation(s)
- Pan Zhang
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
| | - Weitao Yang
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
| |
Collapse
|
17
|
Kümmerer F, Orioli S, Lindorff-Larsen K. Fitting Force Field Parameters to NMR Relaxation Data. J Chem Theory Comput 2023. [PMID: 37276045 DOI: 10.1021/acs.jctc.3c00174] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
We present an approach to optimize force field parameters using time-dependent data from NMR relaxation experiments. To do so, we scan parameters in the dihedral angle potential energy terms describing the rotation of the methyl groups in proteins and compare NMR relaxation rates calculated from molecular dynamics simulations with the modified force fields to deuterium relaxation measurements of T4 lysozyme. We find that a small modification of Cγ methyl groups improves the agreement with experiments both for the protein used to optimize the force field and when validating using simulations of CI2 and ubiquitin. We also show that these improvements enable a more effective a posteriori reweighting of the MD trajectories. The resulting force field thus enables more direct comparison between simulations and side-chain NMR relaxation data and makes it possible to construct ensembles that better represent the dynamics of proteins in solution.
Collapse
Affiliation(s)
- Felix Kümmerer
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark
| | - Simone Orioli
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark
- Structural Biophysics, Niels Bohr Institute, Faculty of Science, University of Copenhagen, DK-2100 Copenhagen Ø, Denmark
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark
| |
Collapse
|
18
|
Fotopoulos V, Mora-Fonz D, Kleinbichler M, Bodlos R, Kozeschnik E, Romaner L, Shluger AL. Structure and Migration Mechanisms of Small Vacancy Clusters in Cu: A Combined EAM and DFT Study. NANOMATERIALS (BASEL, SWITZERLAND) 2023; 13:nano13091464. [PMID: 37177009 PMCID: PMC10180345 DOI: 10.3390/nano13091464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 04/21/2023] [Accepted: 04/22/2023] [Indexed: 05/15/2023]
Abstract
Voids in face-centered cubic (fcc) metals are commonly assumed to form via the aggregation of vacancies; however, the mechanisms of vacancy clustering and diffusion are not fully understood. In this study, we use computational modeling to provide a detailed insight into the structures and formation energies of primary vacancy clusters, mechanisms and barriers for their migration in bulk copper, and how these properties are affected at simple grain boundaries. The calculations were carried out using embedded atom method (EAM) potentials and density functional theory (DFT) and employed the site-occupation disorder code (SOD), the activation relaxation technique nouveau (ARTn) and the knowledge led master code (KLMC). We investigate stable structures and migration paths and barriers for clusters of up to six vacancies. The migration of vacancy clusters occurs via hops of individual constituent vacancies with di-vacancies having a significantly smaller migration barrier than mono-vacancies and other clusters. This barrier is further reduced when di-vacancies interact with grain boundaries. This interaction leads to the formation of self-interstitial atoms and introduces significant changes into the boundary structure. Tetra-, penta-, and hexa-vacancy clusters exhibit increasingly complex migration paths and higher barriers than smaller clusters. Finally, a direct comparison with the DFT results shows that EAM can accurately describe the vacancy-induced relaxation effects in the Cu bulk and in grain boundaries. Significant discrepancies between the two methods were found in structures with a higher number of low-coordinated atoms, such as penta-vacancies and di-vacancy absortion by grain boundary. These results will be useful for modeling the mechanisms of diffusion of complex defect structures and provide further insights into the structural evolution of metal films under thermal and mechanical stress.
Collapse
Affiliation(s)
- Vasileios Fotopoulos
- Department of Physics and Astronomy, University College London, Gower Street, London WC1E 6BT, UK
| | - David Mora-Fonz
- Department of Physics and Astronomy, University College London, Gower Street, London WC1E 6BT, UK
| | - Manuel Kleinbichler
- KAI-Kompetenzzentrum Automobil- und Industrieelektronik GmbH, Europastrasse 8, 9524 Villach, Austria
| | - Rishi Bodlos
- Materials Center Leoben Forschung GmbH (MCL), Roseggerstraße 12, 8700 Leoben, Austria
| | - Ernst Kozeschnik
- Institute of Materials Science and Technology, TU Wien, Getreidemarkt 9, 1060 Vienna, Austria
| | - Lorenz Romaner
- Materials Center Leoben Forschung GmbH (MCL), Roseggerstraße 12, 8700 Leoben, Austria
- Department of Materials Science, Montanuniversität Leoben, Franz-Josef Straße 18, 8700 Leoben, Austria
| | - Alexander L Shluger
- Department of Physics and Astronomy, University College London, Gower Street, London WC1E 6BT, UK
- WPI-Advanced Institute for Materials Research (WPI-AIMR), Tohoku University, 2-1-1 Katahira, Aoba-ku, Sendai 980-8577, Japan
| |
Collapse
|
19
|
Hofmann DWM, Kuleshova LN. A general force field by machine learning on experimental crystal structures. Calculations of intermolecular Gibbs energy with FlexCryst. Acta Crystallogr A Found Adv 2023; 79:132-144. [PMID: 36862039 DOI: 10.1107/s2053273323000268] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Accepted: 01/10/2023] [Indexed: 02/11/2023] Open
Abstract
Machine learning was employed on the experimental crystal structures of the Cambridge Structural Database (CSD) to derive an intermolecular force field for all available types of atoms (general force field). The obtained pairwise interatomic potentials of the general force field allow for the fast and accurate calculation of intermolecular Gibbs energy. The approach is based on three postulates regarding Gibbs energy: the lattice energy must be below zero, the crystal structure must be a local minimum, and, if available, the experimental and the calculated lattice energy must coincide. The parametrized general force field was then validated regarding these three conditions. First, the experimental lattice energy was compared with the calculated energies. The observed errors were found to be in the order of experimental errors. Second, Gibbs lattice energy was calculated for all structures available in the CSD. Their energy values were found to be below zero in 99.86% of the cases. Finally, 500 random structures were minimized, and the change in density and energy was examined. The mean error in the case of density was below 4.06%, and for energy it was below 5.7%. The obtained general force field calculated Gibbs lattice energies of 259 041 known crystal structures within a few hours. Since Gibbs energy defines the reaction energy, the calculated energy can be used to predict chemical-physical properties of crystals, for instance, the formation of co-crystals, polymorph stability and solubility.
Collapse
|
20
|
Csizi K, Reiher M. Universal
QM
/
MM
approaches for general nanoscale applications. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2023. [DOI: 10.1002/wcms.1656] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Affiliation(s)
| | - Markus Reiher
- Laboratorium für Physikalische Chemie ETH Zürich Zürich Switzerland
| |
Collapse
|
21
|
Houston PL, Qu C, Yu Q, Conte R, Nandi A, Li JK, Bowman JM. PESPIP: Software to fit complex molecular and many-body potential energy surfaces with permutationally invariant polynomials. J Chem Phys 2023; 158:044109. [PMID: 36725524 DOI: 10.1063/5.0134442] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
We wish to describe a potential energy surface by using a basis of permutationally invariant polynomials whose coefficients will be determined by numerical regression so as to smoothly fit a dataset of electronic energies as well as, perhaps, gradients. The polynomials will be powers of transformed internuclear distances, usually either Morse variables, exp(-ri,j/λ), where λ is a constant range hyperparameter, or reciprocals of the distances, 1/ri,j. The question we address is how to create the most efficient basis, including (a) which polynomials to keep or discard, (b) how many polynomials will be needed, (c) how to make sure the polynomials correctly reproduce the zero interaction at a large distance, (d) how to ensure special symmetries, and (e) how to calculate gradients efficiently. This article discusses how these questions can be answered by using a set of programs to choose and manipulate the polynomials as well as to write efficient Fortran programs for the calculation of energies and gradients. A user-friendly interface for access to monomial symmetrization approach results is also described. The software for these programs is now publicly available.
Collapse
Affiliation(s)
- Paul L Houston
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, USA and Department of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| | - Chen Qu
- Independent Researcher, Toronto, Ontario M9B0E3, Canada
| | - Qi Yu
- Department of Chemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Riccardo Conte
- Dipartimento di Chimica, Università Degli Studi di Milano, Via Golgi 19, 20133 Milano, Italy
| | - Apurba Nandi
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, USA
| | - Jeffrey K Li
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, USA
| | - Joel M Bowman
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, USA
| |
Collapse
|
22
|
Ferreira TH, Maximiano P, Ureta M, Gomez-Zavaglia A, Simões PN. Molecular Simulation: a remarkable tool to study mechanisms of cell membrane preservation in probiotic bacteria. Curr Opin Food Sci 2023. [DOI: 10.1016/j.cofs.2022.100985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
23
|
Schmid F. Understanding and Modeling Polymers: The Challenge of Multiple Scales. ACS POLYMERS AU 2022. [DOI: 10.1021/acspolymersau.2c00049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Friederike Schmid
- Institut für Physik, Johannes Gutenberg-Universität Mainz, Staudingerweg 9, 55128Mainz, Germany
| |
Collapse
|
24
|
Borišek J, Aupič J, Magistrato A. Establishing the catalytic and regulatory mechanism of
RNA
‐based machineries. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1643] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Jure Borišek
- Theory Department National Institute of Chemistry Ljubljana Slovenia
| | | | | |
Collapse
|
25
|
Oliveira MP, Gonçalves YMH, Ol Gheta SK, Rieder SR, Horta BAC, Hünenberger PH. Comparison of the United- and All-Atom Representations of (Halo)alkanes Based on Two Condensed-Phase Force Fields Optimized against the Same Experimental Data Set. J Chem Theory Comput 2022; 18:6757-6778. [PMID: 36190354 DOI: 10.1021/acs.jctc.2c00524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The level of accuracy that can be achieved by a force field is influenced by choices made in the interaction-function representation and in the relevant simulation parameters. These choices, referred to here as functional-form variants (FFVs), include for example the model resolution, the charge-derivation procedure, the van der Waals combination rules, the cutoff distance, and the treatment of the long-range interactions. Ideally, assessing the effect of a given FFV on the intrinsic accuracy of the force-field representation requires that only the specific FFV is changed and that this change is performed at an optimal level of parametrization, a requirement that may prove extremely challenging to achieve in practice. Here, we present a first attempt at such a comparison for one specific FFV, namely the choice of a united-atom (UA) versus an all-atom (AA) resolution in a force field for saturated acyclic (halo)alkanes. Two force-field versions (UA vs AA) are optimized in an automated way using the CombiFF approach against 961 experimental values for the pure-liquid densities ρliq and vaporization enthalpies ΔHvap of 591 compounds. For the AA force field, the torsional and third-neighbor Lennard-Jones parameters are also refined based on quantum-mechanical rotational-energy profiles. The comparison between the UA and AA resolutions is also extended to properties that have not been included as parameterization targets, namely the surface-tension coefficient γ, the isothermal compressibility κT, the isobaric thermal-expansion coefficient αP, the isobaric heat capacity cP, the static relative dielectric permittivity ϵ, the self-diffusion coefficient D, the shear viscosity η, the hydration free energy ΔGwat, and the free energy of solvation ΔGche in cyclohexane. For the target properties ρliq and ΔHvap, the UA and AA resolutions reach very similar levels of accuracy after optimization. For the nine other properties, the AA representation leads to more accurate results in terms of η; comparably accurate results in terms of γ, κT, αP, ϵ, D, and ΔGche; and less accurate results in terms of cP and ΔGwat. This work also represents a first step toward the calibration of a GROMOS-compatible force field at the AA resolution.
Collapse
Affiliation(s)
- Marina P Oliveira
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Yan M H Gonçalves
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| | - S Kashef Ol Gheta
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Salomé R Rieder
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Bruno A C Horta
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Philippe H Hünenberger
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| |
Collapse
|
26
|
Piomponi V, Fröhlking T, Bernetti M, Bussi G. Molecular Simulations Matching Denaturation Experiments for N 6-Methyladenosine. ACS CENTRAL SCIENCE 2022; 8:1218-1228. [PMID: 36032773 PMCID: PMC9413829 DOI: 10.1021/acscentsci.2c00565] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Indexed: 06/15/2023]
Abstract
Post-transcriptional modifications are crucial for RNA function and can affect its structure and dynamics. Force-field-based classical molecular dynamics simulations are a fundamental tool to characterize biomolecular dynamics, and their application to RNA is flourishing. Here, we show that the set of force-field parameters for N6-methyladenosine (m6A) developed for the commonly used AMBER force field does not reproduce duplex denaturation experiments and, specifically, cannot be used to describe both paired and unpaired states. Then, we use reweighting techniques to derive new parameters matching available experimental data. The resulting force field can be used to properly describe paired and unpaired m6A in both syn and anti conformation, which thus opens the way to the use of molecular simulations to investigate the effects of N6 methylations on RNA structural dynamics.
Collapse
|
27
|
Ringrose C, Horton JT, Wang LP, Cole DJ. Exploration and validation of force field design protocols through QM-to-MM mapping. Phys Chem Chem Phys 2022; 24:17014-17027. [PMID: 35792069 DOI: 10.1039/d2cp02864f] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The scale of the parameter optimisation problem in traditional molecular mechanics force field construction means that design of a new force field is a long process, and sub-optimal choices made in the early stages can persist for many generations. We hypothesise that careful use of quantum mechanics to inform molecular mechanics parameter derivation (QM-to-MM mapping) should be used to significantly reduce the number of parameters that require fitting to experiment and increase the pace of force field development. Here, we design and train a collection of 15 new protocols for small, organic molecule force field derivation, and test their accuracy against experimental liquid properties. Our best performing model has only seven fitting parameters, yet achieves mean unsigned errors of just 0.031 g cm-3 and 0.69 kcal mol-1 in liquid densities and heats of vaporisation, compared to experiment. The software required to derive the designed force fields is freely available at https://github.com/qubekit/QUBEKit.
Collapse
Affiliation(s)
- Chris Ringrose
- School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne NE1 7RU, UK.
| | - Joshua T Horton
- School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne NE1 7RU, UK.
| | - Lee-Ping Wang
- Department of Chemistry, The University of California at Davis, Davis, California 95616, USA
| | - Daniel J Cole
- School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne NE1 7RU, UK.
| |
Collapse
|
28
|
Bowman JM, Qu C, Conte R, Nandi A, Houston PL, Yu Q. The MD17 datasets from the perspective of datasets for gas-phase “small” molecule potentials. J Chem Phys 2022; 156:240901. [DOI: 10.1063/5.0089200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
There has been great progress in developing methods for machine-learned potential energy surfaces. There have also been important assessments of these methods by comparing so-called learning curves on datasets of electronic energies and forces, notably the MD17 database. The dataset for each molecule in this database generally consists of tens of thousands of energies and forces obtained from DFT direct dynamics at 500 K. We contrast the datasets from this database for three “small” molecules, ethanol, malonaldehyde, and glycine, with datasets we have generated with specific targets for the potential energy surfaces (PESs) in mind: a rigorous calculation of the zero-point energy and wavefunction, the tunneling splitting in malonaldehyde, and, in the case of glycine, a description of all eight low-lying conformers. We found that the MD17 datasets are too limited for these targets. We also examine recent datasets for several PESs that describe small-molecule but complex chemical reactions. Finally, we introduce a new database, “QM-22,” which contains datasets of molecules ranging from 4 to 15 atoms that extend to high energies and a large span of configurations.
Collapse
Affiliation(s)
- Joel M. Bowman
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, USA
| | - Chen Qu
- Independent Researcher, Toronto, Canada
| | - Riccardo Conte
- Dipartimento di Chimica, Università Degli Studi di Milano, via Golgi 19, 20133 Milano, Italy
| | - Apurba Nandi
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, USA
| | - Paul L. Houston
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, USA
- Department of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| | - Qi Yu
- Department of Chemistry, Yale University, New Haven, Connecticut 06520, USA
| |
Collapse
|
29
|
Fröhlking T, Mlýnský V, Janeček M, Kührová P, Krepl M, Banáš P, Šponer J, Bussi G. Automatic Learning of Hydrogen-Bond Fixes in the AMBER RNA Force Field. J Chem Theory Comput 2022; 18:4490-4502. [PMID: 35699952 PMCID: PMC9281393 DOI: 10.1021/acs.jctc.2c00200] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
![]()
The
capability of
current force fields to reproduce RNA structural
dynamics is limited. Several methods have been developed to take advantage
of experimental data in order to enforce agreement with experiments.
Here, we extend an existing framework which allows arbitrarily chosen
force-field correction terms to be fitted by quantification of the
discrepancy between observables back-calculated from simulation and
corresponding experiments. We apply a robust regularization protocol
to avoid overfitting and additionally introduce and compare a number
of different regularization strategies, namely, L1, L2, Kish size,
relative Kish size, and relative entropy penalties. The training set
includes a GACC tetramer as well as more challenging systems, namely,
gcGAGAgc and gcUUCGgc RNA tetraloops. Specific intramolecular hydrogen
bonds in the AMBER RNA force field are corrected with automatically
determined parameters that we call gHBfixopt. A validation
involving a separate simulation of a system present in the training
set (gcUUCGgc) and new systems not seen during training (CAAU and
UUUU tetramers) displays improvements regarding the native population
of the tetraloop as well as good agreement with NMR experiments for
tetramers when using the new parameters. Then, we simulate folded
RNAs (a kink–turn and L1 stalk rRNA) including hydrogen bond
types not sufficiently present in the training set. This allows a
final modification of the parameter set which is named gHBfix21 and
is suggested to be applicable to a wider range of RNA systems.
Collapse
Affiliation(s)
- Thorben Fröhlking
- Scuola Internazionale Superiore di Studi Avanzati, via Bonomea 265, Trieste 34136, Italy
| | - Vojtěch Mlýnský
- Institute of Biophysics of the Czech Academy of Sciences, Kralovopolska 135, Brno 612 65, Czech Republic
| | - Michal Janeček
- Department of Physical Chemistry, Faculty of Science, Palacky University, tr. 17 listopadu 12, Olomouc 771 46, Czech Republic
| | - Petra Kührová
- Regional Centre of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacky University Olomouc, Slechtitelu 27, 779 00 Olomouc, Czech Republic
| | - Miroslav Krepl
- Institute of Biophysics of the Czech Academy of Sciences, Kralovopolska 135, Brno 612 65, Czech Republic.,Regional Centre of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacky University Olomouc, Slechtitelu 27, 779 00 Olomouc, Czech Republic
| | - Pavel Banáš
- Regional Centre of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacky University Olomouc, Slechtitelu 27, 779 00 Olomouc, Czech Republic
| | - Jiří Šponer
- Institute of Biophysics of the Czech Academy of Sciences, Kralovopolska 135, Brno 612 65, Czech Republic.,Regional Centre of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacky University Olomouc, Slechtitelu 27, 779 00 Olomouc, Czech Republic
| | - Giovanni Bussi
- Scuola Internazionale Superiore di Studi Avanzati, via Bonomea 265, Trieste 34136, Italy
| |
Collapse
|
30
|
Darmawan KK, Karagiannis TC, Hughes JG, Small DM, Hung A. Molecular modeling of lactoferrin for food and nutraceutical applications: insights from in silico techniques. Crit Rev Food Sci Nutr 2022; 63:9074-9097. [PMID: 35503258 DOI: 10.1080/10408398.2022.2067824] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Lactoferrin is a protein, primarily found in milk that has attracted the interest of the food industries due to its health properties. Nevertheless, the instability of lactoferrin has limited its commercial application. Recent studies have focused on encapsulation to enhance the stability of lactoferrin. However, the molecular insights underlying the changes of structural properties of lactoferrin and the interaction with protectants remain poorly understood. Computational approaches have proven useful in understanding the structural properties of molecules and the key binding with other constituents. In this review, comprehensive information on the structure and function of lactoferrin and the binding with various molecules for food purposes are reviewed, with a special emphasis on the use of molecular dynamics simulations. The results demonstrate the application of modeling and simulations to determine key residues of lactoferrin responsible for its stability and interactions with other biomolecular components under various conditions, which are also associated with its functional benefits. These have also been extended into the potential creation of enhanced lactoferrin for commercial purposes. This review provides valuable strategies in designing novel nutraceuticals for food science practitioners and those who have interests in acquiring familiarity with the application of computational modeling for food and health purposes.
Collapse
Affiliation(s)
- Kevion K Darmawan
- School of Science, STEM College, RMIT University, Melbourne, Australia
| | - Tom C Karagiannis
- Epigenomic Medicine, Department of Diabetes, Central Clinical School, Monash University, Melbourne, Australia
- Department of Clinical Pathology, The University of Melbourne, Melbourne, Australia
| | - Jeff G Hughes
- School of Science, STEM College, RMIT University, Melbourne, Australia
| | - Darryl M Small
- School of Science, STEM College, RMIT University, Melbourne, Australia
| | - Andrew Hung
- School of Science, STEM College, RMIT University, Melbourne, Australia
| |
Collapse
|
31
|
Madin OC, Boothroyd S, Messerly RA, Fass J, Chodera JD, Shirts MR. Bayesian-Inference-Driven Model Parametrization and Model Selection for 2CLJQ Fluid Models. J Chem Inf Model 2022; 62:874-889. [PMID: 35129974 PMCID: PMC9217127 DOI: 10.1021/acs.jcim.1c00829] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A high level of physical detail in a molecular model improves its ability to perform high accuracy simulations but can also significantly affect its complexity and computational cost. In some situations, it is worthwhile to add complexity to a model to capture properties of interest; in others, additional complexity is unnecessary and can make simulations computationally infeasible. In this work, we demonstrate the use of Bayesian inference for molecular model selection, using Monte Carlo sampling techniques accelerated with surrogate modeling to evaluate the Bayes factor evidence for different levels of complexity in the two-centered Lennard-Jones + quadrupole (2CLJQ) fluid model. Examining three nested levels of model complexity, we demonstrate that the use of variable quadrupole and bond length parameters in this model framework is justified only for some chemistries. Through this process, we also get detailed information about the distributions and correlation of parameter values, enabling improved parametrization and parameter analysis. We also show how the choice of parameter priors, which encode previous model knowledge, can have substantial effects on the selection of models, penalizing careless introduction of additional complexity. We detail the computational techniques used in this analysis, providing a roadmap for future applications of molecular model selection via Bayesian inference and surrogate modeling.
Collapse
Affiliation(s)
- Owen C. Madin
- Department of Chemical & Biological Engineering, University of Colorado Boulder, Boulder, CO 80309
| | - Simon Boothroyd
- Boothroyd Scientific Consulting Ltd., 71-75 Shelton Street, London, Greater London, United Kingdom, WC2H 9JQ
| | | | - Josh Fass
- Computational & Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065
| | - John D. Chodera
- Department of Chemical & Biological Engineering, University of Colorado Boulder, Boulder, CO 80309
| | - Michael R. Shirts
- Department of Chemical & Biological Engineering, University of Colorado Boulder, Boulder, CO 80309
| |
Collapse
|
32
|
Houston PL, Qu C, Nandi A, Conte R, Yu Q, Bowman JM. Permutationally invariant polynomial regression for energies and gradients, using reverse differentiation, achieves orders of magnitude speed-up with high precision compared to other machine learning methods. J Chem Phys 2022; 156:044120. [DOI: 10.1063/5.0080506] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Affiliation(s)
- Paul L. Houston
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, USA and Department of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| | - Chen Qu
- Department of Chemistry and Biochemistry, University of Maryland, College Park, Maryland 20742, USA
| | - Apurba Nandi
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, USA
| | - Riccardo Conte
- Dipartimento di Chimica, Università Degli Studi di Milano, via Golgi 19, 20133 Milano, Italy
| | - Qi Yu
- Department of Chemistry, Yale University, New Haven, Connecticut 06511, USA
| | - Joel M. Bowman
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, USA
| |
Collapse
|
33
|
Thaler S, Zavadlav J. Learning neural network potentials from experimental data via Differentiable Trajectory Reweighting. Nat Commun 2021; 12:6884. [PMID: 34824254 PMCID: PMC8617111 DOI: 10.1038/s41467-021-27241-4] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Accepted: 11/09/2021] [Indexed: 11/09/2022] Open
Abstract
In molecular dynamics (MD), neural network (NN) potentials trained bottom-up on quantum mechanical data have seen tremendous success recently. Top-down approaches that learn NN potentials directly from experimental data have received less attention, typically facing numerical and computational challenges when backpropagating through MD simulations. We present the Differentiable Trajectory Reweighting (DiffTRe) method, which bypasses differentiation through the MD simulation for time-independent observables. Leveraging thermodynamic perturbation theory, we avoid exploding gradients and achieve around 2 orders of magnitude speed-up in gradient computation for top-down learning. We show effectiveness of DiffTRe in learning NN potentials for an atomistic model of diamond and a coarse-grained model of water based on diverse experimental observables including thermodynamic, structural and mechanical properties. Importantly, DiffTRe also generalizes bottom-up structural coarse-graining methods such as iterative Boltzmann inversion to arbitrary potentials. The presented method constitutes an important milestone towards enriching NN potentials with experimental data, particularly when accurate bottom-up data is unavailable.
Collapse
Affiliation(s)
- Stephan Thaler
- Professorship of Multiscale Modeling of Fluid Materials, TUM School of Engineering and Design, Technical University of Munich, Munich, Germany.
| | - Julija Zavadlav
- Professorship of Multiscale Modeling of Fluid Materials, TUM School of Engineering and Design, Technical University of Munich, Munich, Germany.
- Munich Data Science Institute, Technical University of Munich, Munich, Germany.
| |
Collapse
|
34
|
Patmanidis I, Alessandri R, de Vries AH, Marrink SJ. Comparing Dimerization Free Energies and Binding Modes of Small Aromatic Molecules with Different Force Fields. Molecules 2021; 26:molecules26196069. [PMID: 34641613 PMCID: PMC8512883 DOI: 10.3390/molecules26196069] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Revised: 09/25/2021] [Accepted: 09/26/2021] [Indexed: 11/16/2022] Open
Abstract
Dimerization free energies are fundamental quantities that describe the strength of interaction of different molecules. Obtaining accurate experimental values for small molecules and disentangling the conformations that contribute most to the binding can be extremely difficult, due to the size of the systems and the small energy differences. In many cases, one has to resort to computational methods to calculate such properties. In this work, we used molecular dynamics simulations in conjunction with metadynamics to calculate the free energy of dimerization of small aromatic rings, and compared three models from popular online servers for atomistic force fields, namely G54a7, CHARMM36 and OPLS. We show that, regardless of the force field, the profiles for the dimerization free energy of these compounds are very similar. However, significant care needs to be taken when studying larger molecules, since the deviations from the trends increase with the size of the molecules, resulting in force field dependent preferred stacking modes; for example, in the cases of pyrene and tetracene. Our results provide a useful background study for using topology builders to model systems which rely on stacking of aromatic moieties, and are relevant in areas ranging from drug design to supramolecular assembly.
Collapse
Affiliation(s)
- Ilias Patmanidis
- Groningen Biomolecular Sciences and Biotechnology Institute and Zernike Institute for Advanced Materials, University of Groningen, Nijenborgh 7, 9747 AG Groningen, The Netherlands; (I.P.); (R.A.); (A.H.d.V.)
| | - Riccardo Alessandri
- Groningen Biomolecular Sciences and Biotechnology Institute and Zernike Institute for Advanced Materials, University of Groningen, Nijenborgh 7, 9747 AG Groningen, The Netherlands; (I.P.); (R.A.); (A.H.d.V.)
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, IL 60637, USA
| | - Alex H. de Vries
- Groningen Biomolecular Sciences and Biotechnology Institute and Zernike Institute for Advanced Materials, University of Groningen, Nijenborgh 7, 9747 AG Groningen, The Netherlands; (I.P.); (R.A.); (A.H.d.V.)
| | - Siewert J. Marrink
- Groningen Biomolecular Sciences and Biotechnology Institute and Zernike Institute for Advanced Materials, University of Groningen, Nijenborgh 7, 9747 AG Groningen, The Netherlands; (I.P.); (R.A.); (A.H.d.V.)
- Correspondence:
| |
Collapse
|
35
|
Vuorte M, Kuitunen S, Sammalkorpi M. Physisorption of bio oil nitrogen compounds onto montmorillonite. Phys Chem Chem Phys 2021; 23:21840-21851. [PMID: 34554171 DOI: 10.1039/d1cp01880a] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
We assess computationally the adsorption of a series of nitrogen containing heterocycles and fatty acid amides from bio-oil on a model clay surface, Na-montmorillonite. The adsorption energies and conformations predicted by atomistic detail molecular dynamics (MD) simulations are compared against density functional theory (DFT) based molecular electrostatic potentials (MEP) and Hirshfeld, AIM, Merz-Singh-Kollman, and ChelpG charges. MD predicts systematically adsorption via cation bridging with adsorption strength of the heterocycles following purine > pyridine > imidazole > pyrrole > indole > quinoline. The fatty acid amides adsorption strength follows the steric availability and bulkiness of the head group. A comparison against the DFT calculations shows that MEP predicts adsorption geometries and the MD simulations reproduce the conformations for single adsorption site species. However, the DFT derived charge distibutions show that MD force-fields with non-polarizable fixed partial charge representations parametrized for aqueous environments cannot be used in apolar solvent environments without careful accuracy considerations. The overall trends in adsorption energies are reproduced by the Charmm GenFF employed in the MD simulations but the adsorption energies are systematically overestimated in this apolar solvent environment. The work has significance both for revealing nitrogen compound adsorption trends in technologically relevant bio oil environments but also as a methodological assessment revealing the limits of state of the art biomolecular force-fields and simulation protocols in apolar bioenvironments.
Collapse
Affiliation(s)
- Maisa Vuorte
- Department of Chemistry and Materials Science, School of Chemical Engineering, Aalto University, P.O. Box 16100, FI-00076 Aalto, Finland.
| | - Susanna Kuitunen
- Neste Engineering Solutions Oy, P.O. Box 310, FI-06101 Porvoo, Finland
| | - Maria Sammalkorpi
- Department of Chemistry and Materials Science, School of Chemical Engineering, Aalto University, P.O. Box 16100, FI-00076 Aalto, Finland. .,Department of Bioproducts and Biosystems, School of Chemical Engineering, Aalto University, P.O. Box 16100, FI-00076 Aalto, Finland
| |
Collapse
|
36
|
Role and Perspective of Molecular Simulation-Based Investigation of RNA-Ligand Interaction: From Small Molecules and Peptides to Photoswitchable RNA Binding. Molecules 2021; 26:molecules26113384. [PMID: 34205049 PMCID: PMC8199858 DOI: 10.3390/molecules26113384] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 05/28/2021] [Accepted: 06/01/2021] [Indexed: 12/15/2022] Open
Abstract
Aberrant RNA–protein complexes are formed in a variety of diseases. Identifying the ligands that interfere with their formation is a valuable therapeutic strategy. Molecular simulation, validated against experimental data, has recently emerged as a powerful tool to predict both the pose and energetics of such ligands. Thus, the use of molecular simulation may provide insight into aberrant molecular interactions in diseases and, from a drug design perspective, may allow for the employment of less wet lab resources than traditional in vitro compound screening approaches. With regard to basic research questions, molecular simulation can support the understanding of the exact molecular interaction and binding mode. Here, we focus on examples targeting RNA–protein complexes in neurodegenerative diseases and viral infections. These examples illustrate that the strategy is rather general and could be applied to different pharmacologically relevant approaches. We close this study by outlining one of these approaches, namely the light-controllable association of small molecules with RNA, as an emerging approach in RNA-targeting therapy.
Collapse
|
37
|
Ulian G, Moro D, Valdrè G. Hydroxylapatite and Related Minerals in Bone and Dental Tissues: Structural, Spectroscopic and Mechanical Properties from a Computational Perspective. Biomolecules 2021; 11:728. [PMID: 34068073 PMCID: PMC8152500 DOI: 10.3390/biom11050728] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Revised: 05/05/2021] [Accepted: 05/10/2021] [Indexed: 01/07/2023] Open
Abstract
Hard tissues (e.g., bone, enamel, dentin) in vertebrates perform various and different functions, from sustaining the body to haematopoiesis. Such complex and hierarchal tissue is actually a material composite whose static and dynamic properties are controlled by the subtle physical and chemical interplay between its components, collagen (main organic part) and hydroxylapatite-like mineral. The knowledge needed to fully understand the properties of bony and dental tissues and to develop specific applicative biomaterials (e.g., fillers, prosthetics, scaffolds, implants, etc.) resides mostly at the atomic scale. Among the different methods to obtains such detailed information, atomistic computer simulations (in silico) have proven to be both corroborative and predictive tools in this subject. The authors have intensively worked on quantum mechanical simulations of bioapatite and the present work reports a detailed review addressed to the crystal-chemical, physical, spectroscopic, mechanical, and surface properties of the mineral phase of bone and dental tissues. The reviewed studies were conducted at different length and time scales, trying to understand the features of hydroxylapatite and biological apatite models alone and/or in interaction with simplified collagen-like models. The reported review shows the capability of the computational approach in dealing with complex biological physicochemical systems, providing accurate results that increase the overall knowledge of hard tissue science.
Collapse
Affiliation(s)
- Gianfranco Ulian
- Centro di Ricerca Interdisciplinare di Biomineralogia, Cristallografia e Biomateriali, Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna Alma Mater Studiorum, P. Porta San Donato 1, 40126 Bologna, Italy;
| | | | - Giovanni Valdrè
- Centro di Ricerca Interdisciplinare di Biomineralogia, Cristallografia e Biomateriali, Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna Alma Mater Studiorum, P. Porta San Donato 1, 40126 Bologna, Italy;
| |
Collapse
|
38
|
Rosenberger D, Smith JS, Garcia AE. Modeling of Peptides with Classical and Novel Machine Learning Force Fields: A Comparison. J Phys Chem B 2021; 125:3598-3612. [DOI: 10.1021/acs.jpcb.0c10401] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Affiliation(s)
- David Rosenberger
- Los Alamos National Laboratory, Theoretical Division, Chemistry and Physics of Materials Group, Los Alamos, 87545 New Mexico, United States
- Los Alamos National Laboratory, Theoretical Division, Center for Nonlinear Studies, Los Alamos, 87545 New Mexico, United States
| | - Justin S. Smith
- Los Alamos National Laboratory, Theoretical Division, Chemistry and Physics of Materials Group, Los Alamos, 87545 New Mexico, United States
| | - Angel E. Garcia
- Los Alamos National Laboratory, Theoretical Division, Center for Nonlinear Studies, Los Alamos, 87545 New Mexico, United States
| |
Collapse
|
39
|
Kania A, Sarapata K, Gucwa M, Wójcik-Augustyn A. Optimal Solution to the Torsional Coefficient Fitting Problem in Force Field Parametrization. J Phys Chem A 2021; 125:2673-2681. [PMID: 33759532 PMCID: PMC8041298 DOI: 10.1021/acs.jpca.0c10845] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
![]()
Molecular modeling
is an excellent tool for studying biological
systems on the atomic scale. Depending on objects, which may be proteins,
nucleic acids, or lipids, different force fields are recommended.
The phospholipid bilayers constitute an example, in which behavior
is extensively studied using molecular dynamics simulations due to
limitations of experimental methods. The reliability of the results
is strongly dependent on an appropriate description of these compounds.
There are some deficiencies in the parametrization of intra- and intermolecular
interactions that result in incorrect reproduction of phospholipid
bilayer properties known from experimental studies, such as temperatures
of phase transitions. Refinement of the force field parameters of
nonbonded interactions present in the studied system is required to
close these discrepancies. Such parameters as partial charges and
torsional potential coefficients are crucial in this issue and not
obtainable from experimental studies. This work presents a new fitting
procedure for torsional coefficients that employs linear algebra theory
and compares it with the Monte Carlo method. The proposed algebraic
approach can be applied to any considered molecular system. In the
manuscript, it is presented on the example of dimethyl phosphoric
acid molecule. The advantages of our method encompass finding an optimal
solution, the lack of additional parameters required by the algorithm,
and significantly shorter computational time. Additionally, we indicate
the importance of proper assignment of the partial charges.
Collapse
Affiliation(s)
- Adrian Kania
- Department of Computational Biophysics and Bioinformatics, Faculty of Biochemistry, Biophysics and Biotechnology, Jagiellonian University, ul. Gronostajowa 7, 30-387 Cracow, Poland
| | - Krzysztof Sarapata
- Department of Computational Biophysics and Bioinformatics, Faculty of Biochemistry, Biophysics and Biotechnology, Jagiellonian University, ul. Gronostajowa 7, 30-387 Cracow, Poland
| | - Michał Gucwa
- Department of Computational Biophysics and Bioinformatics, Faculty of Biochemistry, Biophysics and Biotechnology, Jagiellonian University, ul. Gronostajowa 7, 30-387 Cracow, Poland
| | - Anna Wójcik-Augustyn
- Department of Computational Biophysics and Bioinformatics, Faculty of Biochemistry, Biophysics and Biotechnology, Jagiellonian University, ul. Gronostajowa 7, 30-387 Cracow, Poland
| |
Collapse
|
40
|
Casalini T. Not only in silico drug discovery: Molecular modeling towards in silico drug delivery formulations. J Control Release 2021; 332:390-417. [PMID: 33675875 DOI: 10.1016/j.jconrel.2021.03.005] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Revised: 02/28/2021] [Accepted: 03/02/2021] [Indexed: 12/18/2022]
Abstract
The use of methods at molecular scale for the discovery of new potential active ligands, as well as previously unknown binding sites for target proteins, is now an established reality. Literature offers many successful stories of active compounds developed starting from insights obtained in silico and approved by Food and Drug Administration (FDA). One of the most famous examples is raltegravir, a HIV integrase inhibitor, which was developed after the discovery of a previously unknown transient binding area thanks to molecular dynamics simulations. Molecular simulations have the potential to also improve the design and engineering of drug delivery devices, which are still largely based on fundamental conservation equations. Although they can highlight the dominant release mechanism and quantitatively link the release rate to design parameters (size, drug loading, et cetera), their spatial resolution does not allow to fully capture how phenomena at molecular scale influence system behavior. In this scenario, the "computational microscope" offered by simulations at atomic scale can shed light on the impact of molecular interactions on crucial parameters such as release rate and the response of the drug delivery device to external stimuli, providing insights that are difficult or impossible to obtain experimentally. Moreover, the new paradigm brought by nanomedicine further underlined the importance of such computational microscope to study the interactions between nanoparticles and biological components with an unprecedented level of detail. Such knowledge is a fundamental pillar to perform device engineering and to achieve efficient and safe formulations. After a brief theoretical background, this review aims at discussing the potential of molecular simulations for the rational design of drug delivery systems.
Collapse
Affiliation(s)
- Tommaso Casalini
- Department of Chemistry and Applied Bioscience, Institute for Chemical and Bioengineering, ETH Zurich, Vladimir-Prelog-Weg 1-5/10, Zürich 8093, Switzerland; Polymer Engineering Laboratory, Institute for Mechanical Engineering and Materials Technology, University of Applied Sciences and Arts of Southern Switzerland (SUPSI), Via la Santa 1, Lugano 6962, Switzerland.
| |
Collapse
|
41
|
Khanna V, Doherty MF, Peters B. Absolute chemical potentials for complex molecules in fluid phases: A centroid reference for predicting phase equilibria. J Chem Phys 2020; 153:214504. [PMID: 33291889 DOI: 10.1063/5.0025844] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Solid-fluid phase equilibria are difficult to predict in simulations because bound degrees of freedom in the crystal phase must be converted to free translations and rotations in the fluid phase. Here, we avoid the solid-to-fluid transformation step by starting with chemical potentials for two reference systems, one for the fluid phase and one for the solid phase. For the solid, we start from the Einstein crystal and transform to the fully interacting molecular crystal. For the fluid phase, we introduce a new reference system, the "centroid," and then transform to gas phase molecules. We illustrate the new calculations by predicting the sublimation vapor pressure of succinic acid in the temperature range of 300 K-350 K.
Collapse
Affiliation(s)
- Vikram Khanna
- Department of Chemical Engineering, University of California Santa Barbara, Santa Barbara, California 93106, USA
| | - Michael F Doherty
- Department of Chemical Engineering, University of California Santa Barbara, Santa Barbara, California 93106, USA
| | - Baron Peters
- Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, USA
| |
Collapse
|