1
|
Chen Q, Zhou J, Sun R. Carbon Nanotube Loading Strategies for Peptide Drugs: Insights from Molecular Dynamics Simulations. LANGMUIR : THE ACS JOURNAL OF SURFACES AND COLLOIDS 2024; 40:13515-13526. [PMID: 38887887 DOI: 10.1021/acs.langmuir.4c00973] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/20/2024]
Abstract
Carbon nanotubes (CNTs) can be regarded as a potential platform for transmembrane drug delivery as many experimental works have demonstrated their capability to effectively transport bioactive molecules into living cells. Within this framework, the loading of a peptide drug onto either the interior or exterior of CNTs has gained considerable interest. This study aims to conduct a comprehensive comparison of these two loading methods. To this end, we performed molecular dynamics simulations and the umbrella sampling technique to investigate the interaction energy, conformational changes, and free energy changes of a model peptide drug containing α-helical structure interacting with the inner or outer walls of a 14.7-nm-long (20,20) CNT. Our finding reveals that, for a tube of such dimensions, it is thermodynamically more favorable for the peptide to be loaded onto the inner tube wall than the outer tube wall, primarily due to a larger free energy change for the former strategy. Conversely, unloading the drug from the tube interior poses greater challenges. Moreover, the tube's curvature plays an essential role in influencing the conformation of the adsorbed peptide. Despite the relatively weaker van der Waals interaction between the CNT exterior and the peptide, loading the peptide onto the exterior may induce significant conformational changes, particularly affecting the peptide's α-helix structure. In contrast, loading of the peptide on the CNT interior could maintain most of the α-helical content. CNTs do not typically attract specific peptide residues, with adsorbed groups primarily determined by the peptide's configurations and orientations. Finally, we offer a guideline for selecting an optimal loading strategy for CNT-based drug delivery.
Collapse
Affiliation(s)
- Qu Chen
- School of Biological and Chemical Engineering, Zhejiang University of Science and Technology, Hangzhou 310023, People's Republic of China
| | - Jianping Zhou
- School of Biological and Chemical Engineering, Zhejiang University of Science and Technology, Hangzhou 310023, People's Republic of China
| | - Rong Sun
- School of Biological and Chemical Engineering, Zhejiang University of Science and Technology, Hangzhou 310023, People's Republic of China
| |
Collapse
|
2
|
Csizi KS, Steiner M, Reiher M. Nanoscale chemical reaction exploration with a quantum magnifying glass. Nat Commun 2024; 15:5320. [PMID: 38909029 PMCID: PMC11193806 DOI: 10.1038/s41467-024-49594-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 06/04/2024] [Indexed: 06/24/2024] Open
Abstract
Nanoscopic systems exhibit diverse molecular substructures by which they facilitate specific functions. Theoretical models of them, which aim at describing, understanding, and predicting these capabilities, are difficult to build. Viable quantum-classical hybrid models come with specific challenges regarding atomistic structure construction and quantum region selection. Moreover, if their dynamics are mapped onto a state-to-state mechanism such as a chemical reaction network, its exhaustive exploration will be impossible due to the combinatorial explosion of the reaction space. Here, we introduce a "quantum magnifying glass" that allows one to interactively manipulate nanoscale structures at the quantum level. The quantum magnifying glass seamlessly combines autonomous model parametrization, ultra-fast quantum mechanical calculations, and automated reaction exploration. It represents an approach to investigate complex reaction sequences in a physically consistent manner with unprecedented effortlessness in real time. We demonstrate these features for reactions in bio-macromolecules and metal-organic frameworks, diverse systems that highlight general applicability.
Collapse
Affiliation(s)
- Katja-Sophia Csizi
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093, Zurich, Switzerland
| | - Miguel Steiner
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093, Zurich, Switzerland
- ETH Zurich, NCCR Catalysis, Vladimir-Prelog-Weg 2, 8093, Zurich, Switzerland
| | - Markus Reiher
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093, Zurich, Switzerland.
- ETH Zurich, NCCR Catalysis, Vladimir-Prelog-Weg 2, 8093, Zurich, Switzerland.
| |
Collapse
|
3
|
Sun T, Korolev N, Minhas V, Mirzoev A, Lyubartsev AP, Nordenskiöld L. Multiscale modeling reveals the ion-mediated phase separation of nucleosome core particles. Biophys J 2024; 123:1414-1434. [PMID: 37915169 PMCID: PMC11163297 DOI: 10.1016/j.bpj.2023.10.030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 10/05/2023] [Accepted: 10/27/2023] [Indexed: 11/03/2023] Open
Abstract
Due to the vast length scale inside the cell nucleus, multiscale models are required to understand chromatin folding, structure, and dynamics and how they regulate genomic activities such as DNA transcription, replication, and repair. We study the interactions and structure of condensed phases formed by the universal building block of chromatin, the nucleosome core particle (NCP), using bottom-up multiscale coarse-grained (CG) simulations with a model extracted from all-atom MD simulations. In the presence of the multivalent cations Mg(H2O)62+ or CoHex3+, we analyze the internal structures of the NCP aggregates and the contributions of histone tails and ions to the aggregation patterns. We then derive a "super" coarse-grained (SCG) NCP model to study the macroscopic scale phase separation of NCPs. The SCG simulations show the formation of NCP aggregates with Mg(H2O)62+ concentration-dependent densities and sizes. Variation of the CoHex3+ concentrations results in highly ordered lamellocolumnar and hexagonal columnar phases in agreement with experimental data. The results give detailed insights into nucleosome interactions and for understanding chromatin folding in the cell nucleus.
Collapse
Affiliation(s)
- Tiedong Sun
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Nikolay Korolev
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Vishal Minhas
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Alexander Mirzoev
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Alexander P Lyubartsev
- Department of Materials and Environmental Chemistry, Stockholm University, Stockholm, Sweden.
| | - Lars Nordenskiöld
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore.
| |
Collapse
|
4
|
Macaya L, González D, Vöhringer-Martinez E. Nonbonded Force Field Parameters from MBIS Partitioning of the Molecular Electron Density Improve Binding Affinity Predictions of the T4-Lysozyme Double Mutant. J Chem Inf Model 2024; 64:3269-3277. [PMID: 38546407 DOI: 10.1021/acs.jcim.3c01912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/23/2024]
Abstract
The use of computer simulation for binding affinity prediction is growing in drug discovery. However, its wider use is constrained by the accuracy of the free energy calculations. The key sources of error are the force fields used to depict molecular interactions and insufficient sampling of the configurational space. To improve the quality of the force field, we developed a Python-based computational workflow. The workflow described here uses the minimal basis iterative stockholder (MBIS) method to determine atomic charges and Lennard-Jones parameters from the polarized molecular density. This is done by performing electronic structure calculations on various configurations of the ligand when it is both bound and unbound. In addition, we validated a simulation procedure that accounts for the protein and ligand degrees of freedom to precisely calculate binding free energies. This was achieved by comparing the self-adjusted mixture sampling and nonequilibrium thermodynamic integration methods using various protein and ligand conformations. The accuracy of predicting binding affinity is improved by using MBIS-derived force field parameters and a validated simulation procedure. This improvement surpasses the chemical precision for the eight aromatic ligands, reaching a root-mean-square error of 0.7 kcal/mol.
Collapse
Affiliation(s)
- Luis Macaya
- Departamento de Físico-Química, Facultad de Ciencias Químicas, Universidad de Concepción, 4070386 Concepción, Chile
| | - Duván González
- Departamento de Físico-Química, Facultad de Ciencias Químicas, Universidad de Concepción, 4070386 Concepción, Chile
| | - Esteban Vöhringer-Martinez
- Departamento de Físico-Química, Facultad de Ciencias Químicas, Universidad de Concepción, 4070386 Concepción, Chile
| |
Collapse
|
5
|
Frischknecht AL, Stevens MJ. Force Fields for High Concentration Aqueous KOH Solutions and Zincate Ions. J Phys Chem B 2024; 128:3475-3484. [PMID: 38547112 DOI: 10.1021/acs.jpcb.3c08302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/12/2024]
Abstract
Motivated by increasing interest in electrochemical devices that include highly alkaline electrolytes, we investigated two force fields for potassium hydroxide (KOH) at high concentrations in water. The "FNB" model uses the SPC/E water model, while the "FHM" model uses the TIP4P/2005 water model. We also developed parameters to describe zincate ions in these solutions. The density and viscosity of KOH using the FHM model are in better agreement with experiment than the values from the FNB model. Comparing the properties of the zincate solutions to the available experimental data, we find that both force fields agree reasonably well, although the FHM parameters give a better prediction of the viscosity. The developed force field parameters can be used in future simulations of zincate/KOH solutions in combination with other species of interest.
Collapse
Affiliation(s)
- Amalie L Frischknecht
- Center for Integrated Nanotechnologies, Sandia National Laboratories, Albuquerque, New Mexico 87185, United States
| | - Mark J Stevens
- Center for Integrated Nanotechnologies, Sandia National Laboratories, Albuquerque, New Mexico 87185, United States
| |
Collapse
|
6
|
Champion C, Lehner M, Smith AA, Ferrage F, Bolik-Coulon N, Riniker S. Unraveling motion in proteins by combining NMR relaxometry and molecular dynamics simulations: A case study on ubiquitin. J Chem Phys 2024; 160:104105. [PMID: 38465679 DOI: 10.1063/5.0188416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Accepted: 02/20/2024] [Indexed: 03/12/2024] Open
Abstract
Nuclear magnetic resonance (NMR) relaxation experiments shine light onto the dynamics of molecular systems in the picosecond to millisecond timescales. As these methods cannot provide an atomically resolved view of the motion of atoms, functional groups, or domains giving rise to such signals, relaxation techniques have been combined with molecular dynamics (MD) simulations to obtain mechanistic descriptions and gain insights into the functional role of side chain or domain motion. In this work, we present a comparison of five computational methods that permit the joint analysis of MD simulations and NMR relaxation experiments. We discuss their relative strengths and areas of applicability and demonstrate how they may be utilized to interpret the dynamics in MD simulations with the small protein ubiquitin as a test system. We focus on the aliphatic side chains given the rigidity of the backbone of this protein. We find encouraging agreement between experiment, Markov state models built in the χ1/χ2 rotamer space of isoleucine residues, explicit rotamer jump models, and a decomposition of the motion using ROMANCE. These methods allow us to ascribe the dynamics to specific rotamer jumps. Simulations with eight different combinations of force field and water model highlight how the different metrics may be employed to pinpoint force field deficiencies. Furthermore, the presented comparison offers a perspective on the utility of NMR relaxation to serve as validation data for the prediction of kinetics by state-of-the-art biomolecular force fields.
Collapse
Affiliation(s)
- Candide Champion
- Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Marc Lehner
- Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Albert A Smith
- Institute for Medical Physics and Biophysics, Leipzig University, Härtelstrasse 16-18, 04107 Leipzig, Germany
| | - Fabien Ferrage
- Laboratoire des Biomolécules, LBM, Département de Chimie, École normale supérieure, PSL University, Sorbonne Université, CNRS, 75005 Paris, France
| | - Nicolas Bolik-Coulon
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Sereina Riniker
- Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| |
Collapse
|
7
|
Bittner JP, Smirnova I, Jakobtorweihen S. Investigating Biomolecules in Deep Eutectic Solvents with Molecular Dynamics Simulations: Current State, Challenges and Future Perspectives. Molecules 2024; 29:703. [PMID: 38338447 PMCID: PMC10856712 DOI: 10.3390/molecules29030703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 01/23/2024] [Accepted: 01/24/2024] [Indexed: 02/12/2024] Open
Abstract
Deep eutectic solvents (DESs) have recently gained increased attention for their potential in biotechnological applications. DESs are binary mixtures often consisting of a hydrogen bond acceptor and a hydrogen bond donor, which allows for tailoring their properties for particular applications. If produced from sustainable resources, they can provide a greener alternative to many traditional organic solvents for usage in various applications (e.g., as reaction environment, crystallization agent, or storage medium). To navigate this large design space, it is crucial to comprehend the behavior of biomolecules (e.g., enzymes, proteins, cofactors, and DNA) in DESs and the impact of their individual components. Molecular dynamics (MD) simulations offer a powerful tool for understanding thermodynamic and transport processes at the atomic level and offer insights into their fundamental phenomena, which may not be accessible through experiments. While the experimental investigation of DESs for various biotechnological applications is well progressed, a thorough investigation of biomolecules in DESs via MD simulations has only gained popularity in recent years. Within this work, we aim to provide an overview of the current state of modeling biomolecules with MD simulations in DESs and discuss future directions with a focus for optimizing the molecular simulations and increasing our fundamental knowledge.
Collapse
Affiliation(s)
- Jan Philipp Bittner
- Institute of Thermal Separation Processes, Hamburg University of Technology, Eißendorfer Straße 38, 21073 Hamburg, Germany
| | - Irina Smirnova
- Institute of Thermal Separation Processes, Hamburg University of Technology, Eißendorfer Straße 38, 21073 Hamburg, Germany
| | - Sven Jakobtorweihen
- Institute of Thermal Separation Processes, Hamburg University of Technology, Eißendorfer Straße 38, 21073 Hamburg, Germany
- Institute of Chemical Reaction Engineering, Hamburg University of Technology, Eißendorfer Straße 38, 21073 Hamburg, Germany
| |
Collapse
|
8
|
Jorgensen WL, Ghahremanpour MM, Saar A, Tirado-Rives J. OPLS/2020 Force Field for Unsaturated Hydrocarbons, Alcohols, and Ethers. J Phys Chem B 2024; 128:250-262. [PMID: 38127719 DOI: 10.1021/acs.jpcb.3c06602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
The OPLS all-atom force field was updated and applied to modeling unsaturated hydrocarbons, alcohols, and ethers. Testing has included gas-phase conformational energetics, properties of pure liquids, and free energies of hydration. Monte Carlo statistical mechanics (MC) calculations were used to model 60 liquids. In addition, a robust, automated procedure was devised to compute the free energies of hydration with high precision via free-energy perturbation (FEP) calculations using double annihilation. Testing has included larger molecules than in the past, and parameters are reported for the first time for some less common groups including alkynes, allenes, dienes, and acetals. The average errors in comparison with experimental data for the computed properties of the pure liquids were improved with the modified force field (OPLS/2020). For liquid densities and heats of vaporization, the average unsigned errors are 0.01 g/cm3 and 0.2 kcal/mol. The average error and signed error for free energies of hydration are both 1.2 kcal/mol. As noted before, this reflects a systematic overestimate of the hydrophobicity of organic molecules when the parametrization is done to minimize the errors for properties of pure liquids. Implications for the modeling of biomolecular systems with standard force fields are considered.
Collapse
Affiliation(s)
- William L Jorgensen
- Department of Chemistry, Yale University, New Haven, Connecticut 06520-8107, United States
| | | | - Anastasia Saar
- Department of Chemistry, Yale University, New Haven, Connecticut 06520-8107, United States
| | - Julian Tirado-Rives
- Department of Chemistry, Yale University, New Haven, Connecticut 06520-8107, United States
| |
Collapse
|
9
|
Gelžinytė E, Öeren M, Segall MD, Csányi G. Transferable Machine Learning Interatomic Potential for Bond Dissociation Energy Prediction of Drug-like Molecules. J Chem Theory Comput 2024; 20:164-177. [PMID: 38108269 PMCID: PMC10782450 DOI: 10.1021/acs.jctc.3c00710] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 11/30/2023] [Accepted: 11/30/2023] [Indexed: 12/19/2023]
Abstract
We present a transferable MACE interatomic potential that is applicable to open- and closed-shell drug-like molecules containing hydrogen, carbon, and oxygen atoms. Including an accurate description of radical species extends the scope of possible applications to bond dissociation energy (BDE) prediction, for example, in the context of cytochrome P450 (CYP) metabolism. The transferability of the MACE potential was validated on the COMP6 data set, containing only closed-shell molecules, where it reaches better accuracy than the readily available general ANI-2x potential. MACE achieves similar accuracy on two CYP metabolism-specific data sets, which include open- and closed-shell structures. This model enables us to calculate the aliphatic C-H BDE, which allows us to compare reaction energies of hydrogen abstraction, which is the rate-limiting step of the aliphatic hydroxylation reaction catalyzed by CYPs. On the "CYP 3A4" data set, MACE achieves a BDE RMSE of 1.37 kcal/mol and better prediction of BDE ranks than alternatives: the semiempirical AM1 and GFN2-xTB methods and the ALFABET model that directly predicts bond dissociation enthalpies. Finally, we highlight the smoothness of the MACE potential over paths of sp3C-H bond elongation and show that a minimal extension is enough for the MACE model to start finding reasonable minimum energy paths of methoxy radical-mediated hydrogen abstraction. Altogether, this work lays the ground for further extensions of scope in terms of chemical elements, (CYP-mediated) reaction classes and modeling the full reaction paths, not only BDEs.
Collapse
Affiliation(s)
- Elena Gelžinytė
- Engineering
Laboratory, University of Cambridge, Trumpington Street, Cambridge CB2 1PZ, U.K.
| | - Mario Öeren
- Optibrium
Limited, Cambridge Innovation Park, Denny End Road, Cambridge CB25 9GL, U.K.
| | - Matthew D. Segall
- Optibrium
Limited, Cambridge Innovation Park, Denny End Road, Cambridge CB25 9GL, U.K.
| | - Gábor Csányi
- Engineering
Laboratory, University of Cambridge, Trumpington Street, Cambridge CB2 1PZ, U.K.
| |
Collapse
|
10
|
Kehrein J, Sotriffer C. Molecular Dynamics Simulations for Rationalizing Polymer Bioconjugation Strategies: Challenges, Recent Developments, and Future Opportunities. ACS Biomater Sci Eng 2024; 10:51-74. [PMID: 37466304 DOI: 10.1021/acsbiomaterials.3c00636] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/20/2023]
Abstract
The covalent modification of proteins with polymers is a well-established method for improving the pharmacokinetic properties of therapeutically valuable biologics. The conjugated polymer chains of the resulting hybrid represent highly flexible macromolecular structures. As the dynamics of such systems remain rather elusive for established experimental techniques from the field of protein structure elucidation, molecular dynamics simulations have proven as a valuable tool for studying such conjugates at an atomistic level, thereby complementing experimental studies. With a focus on new developments, this review aims to provide researchers from the polymer bioconjugation field with a concise and up to date overview of such approaches. After introducing basic principles of molecular dynamics simulations, as well as methods for and potential pitfalls in modeling bioconjugates, the review illustrates how these computational techniques have contributed to the understanding of bioconjugates and bioconjugation strategies in the recent past and how they may lead to a more rational design of novel bioconjugates in the future.
Collapse
Affiliation(s)
- Josef Kehrein
- Institute of Pharmacy and Food Chemistry, University of Würzburg, Würzburg 97074, Germany
| | - Christoph Sotriffer
- Institute of Pharmacy and Food Chemistry, University of Würzburg, Würzburg 97074, Germany
| |
Collapse
|
11
|
Demir Gİ, Tekin A. NICE-FF: A non-empirical, intermolecular, consistent, and extensible force field for nucleic acids and beyond. J Chem Phys 2023; 159:244117. [PMID: 38153156 DOI: 10.1063/5.0176641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 12/04/2023] [Indexed: 12/29/2023] Open
Abstract
A new non-empirical ab initio intermolecular force field (NICE-FF in buffered 14-7 potential form) has been developed for nucleic acids and beyond based on the dimer interaction energies (IEs) calculated at the spin component scaled-MI-second order Møller-Plesset perturbation theory. A fully automatic framework has been implemented for this purpose, capable of generating well-polished computational grids, performing the necessary ab initio calculations, conducting machine learning (ML) assisted force field (FF) parametrization, and extending existing FF parameters by incorporating new atom types. For the ML-assisted parametrization of NICE-FF, interaction energies of ∼18 000 dimer geometries (with IE < 0) were used, and the best fit gave a mean square deviation of about 0.46 kcal/mol. During this parametrization, atom types apparent in four deoxyribonucleic acid (DNA) bases have been first trained using the generated DNA base datasets. Both uracil and hypoxanthine, which contain the same atom types found in DNA bases, have been considered as test molecules. Three new atom types have been added to the DNA atom types by using IE datasets of both pyrazinamide and 9-methylhypoxanthine. Finally, the last test molecule, theophylline, has been selected, which contains already-fitted atom-type parameters. The performance of NICE-FF has been investigated on the S22 dataset, and it has been found that NICE-FF outperforms the well-known FFs by generating the most consistent IEs with the high-level ab initio ones. Moreover, NICE-FF has been integrated into our in-house developed crystal structure prediction (CSP) tool [called FFCASP (Fast and Flexible CrystAl Structure Predictor)], aiming to find the experimental crystal structures of all considered molecules. CSPs, which were performed up to 4 formula units (Z), resulted in NICE-FF being able to locate almost all the known experimental crystal structures with sufficiently low RMSD20 values to provide good starting points for density functional theory optimizations.
Collapse
Affiliation(s)
- Gözde İniş Demir
- Informatics Institute, Istanbul Technical University, 34469 Maslak, Istanbul, Türkiye
| | - Adem Tekin
- Informatics Institute, Istanbul Technical University, 34469 Maslak, Istanbul, Türkiye
- Research Institute for Fundamental Sciences (TÜBİTAK-TBAE), Kocaeli, Türkiye
| |
Collapse
|
12
|
Thürlemann M, Riniker S. Hybrid classical/machine-learning force fields for the accurate description of molecular condensed-phase systems. Chem Sci 2023; 14:12661-12675. [PMID: 38020395 PMCID: PMC10646964 DOI: 10.1039/d3sc04317g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 10/24/2023] [Indexed: 12/01/2023] Open
Abstract
Electronic structure methods offer in principle accurate predictions of molecular properties, however, their applicability is limited by computational costs. Empirical methods are cheaper, but come with inherent approximations and are dependent on the quality and quantity of training data. The rise of machine learning (ML) force fields (FFs) exacerbates limitations related to training data even further, especially for condensed-phase systems for which the generation of large and high-quality training datasets is difficult. Here, we propose a hybrid ML/classical FF model that is parametrized exclusively on high-quality ab initio data of dimers and monomers in vacuum but is transferable to condensed-phase systems. The proposed hybrid model combines our previous ML-parametrized classical model with ML corrections for situations where classical approximations break down, thus combining the robustness and efficiency of classical FFs with the flexibility of ML. Extensive validation on benchmarking datasets and experimental condensed-phase data, including organic liquids and small-molecule crystal structures, showcases how the proposed approach may promote FF development and unlock the full potential of classical FFs.
Collapse
Affiliation(s)
- Moritz Thürlemann
- Department of Chemistry and Applied Biosciences, ETH Zürich Vladimir-Prelog-Weg 2 Zürich 8093 Switzerland
| | - Sereina Riniker
- Department of Chemistry and Applied Biosciences, ETH Zürich Vladimir-Prelog-Weg 2 Zürich 8093 Switzerland
| |
Collapse
|
13
|
Lehner MT, Katzberger P, Maeder N, Schiebroek CC, Teetz J, Landrum GA, Riniker S. DASH: Dynamic Attention-Based Substructure Hierarchy for Partial Charge Assignment. J Chem Inf Model 2023; 63:6014-6028. [PMID: 37738206 PMCID: PMC10565818 DOI: 10.1021/acs.jcim.3c00800] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Indexed: 09/24/2023]
Abstract
We present a robust and computationally efficient approach for assigning partial charges of atoms in molecules. The method is based on a hierarchical tree constructed from attention values extracted from a graph neural network (GNN), which was trained to predict atomic partial charges from accurate quantum-mechanical (QM) calculations. The resulting dynamic attention-based substructure hierarchy (DASH) approach provides fast assignment of partial charges with the same accuracy as the GNN itself, is software-independent, and can easily be integrated in existing parametrization pipelines, as shown for the Open force field (OpenFF). The implementation of the DASH workflow, the final DASH tree, and the training set are available as open source/open data from public repositories.
Collapse
Affiliation(s)
| | | | - Niels Maeder
- Department of Chemistry and
Applied Biosciences, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Carl C.G. Schiebroek
- Department of Chemistry and
Applied Biosciences, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Jakob Teetz
- Department of Chemistry and
Applied Biosciences, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Gregory A. Landrum
- Department of Chemistry and
Applied Biosciences, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Sereina Riniker
- Department of Chemistry and
Applied Biosciences, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| |
Collapse
|
14
|
Zhu Q, Wu Y, Zhao S, Cieplak P, Duan Y, Luo R. Streamlining and Optimizing Strategies of Electrostatic Parameterization. J Chem Theory Comput 2023; 19:6353-6365. [PMID: 37676646 PMCID: PMC10530599 DOI: 10.1021/acs.jctc.3c00659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
Accurate characterization of electrostatic interactions is crucial in molecular simulation. Various methods and programs have been developed to obtain electrostatic parameters for additive or polarizable models to replicate electrostatic properties obtained from experimental measurements or theoretical calculations. Electrostatic potentials (ESPs), a set of physically well-defined observables from quantum mechanical (QM) calculations, are well suited for optimization efforts due to the ease of collecting a large amount of conformation-dependent data. However, a reliable set of QM ESP computed at an appropriate level of theory and atomic basis set is necessary. In addition, despite the recent development of the PyRESP program for electrostatic parameterizations of induced dipole-polarizable models, the time-consuming and error-prone input file preparation process has limited the widespread use of these protocols. This work aims to comprehensively evaluate the quality of QM ESPs derived by eight methods, including wave function methods such as Hartree-Fock (HF), second-order Møller-Plesset (MP2), and coupled cluster-singles and doubles (CCSD), as well as five hybrid density functional theory (DFT) methods, used in conjunction with 13 different basis sets. The highest theory levels CCSD/aug-cc-pV5Z (a5z) and MP2/aug-cc-pV5Z (a5z) were selected as benchmark data over two homemade data sets. The results show that the hybrid DFT method, ωB97X-D, combined with the aug-cc-pVTZ (a3z) basis set, performs well in reproducing ESPs while taking both accuracy and efficiency into consideration. Moreover, a flexible and user-friendly program called PyRESP_GEN was developed to streamline input file preparation. The restraining strengths, along with strategies for polarizable Gaussian multipole (pGM) model parameterizations, were also optimized. These findings and the program presented in this work facilitate the development and application of induced dipole-polarizable models, such as pGM models, for molecular simulations of both chemical and biological significance.
Collapse
Affiliation(s)
- Qiang Zhu
- Department of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine, Irvine, California 92697, United States
| | - Yongxian Wu
- Department of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine, Irvine, California 92697, United States
| | - Shiji Zhao
- Nurix Therapeutics, Inc., 1700 Owens St, Suite 205, San Francisco, California 94158, United States
| | - Piotr Cieplak
- SBP Medical Discovery Institute, La Jolla, California 92037, United States
| | - Yong Duan
- UC Davis Genome Center and Department of Biomedical Engineering, University of California, Davis, Davis, California 95616, United States
| | - Ray Luo
- Department of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine, Irvine, California 92697, United States
| |
Collapse
|
15
|
Horton JT, Boothroyd S, Behara PK, Mobley DL, Cole DJ. A transferable double exponential potential for condensed phase simulations of small molecules. DIGITAL DISCOVERY 2023; 2:1178-1187. [PMID: 38013814 PMCID: PMC10408570 DOI: 10.1039/d3dd00070b] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 07/07/2023] [Indexed: 11/29/2023]
Abstract
The Lennard-Jones potential is the most widely-used function for the description of non-bonded interactions in transferable force fields for the condensed phase. This is not because it has an optimal functional form, but rather it is a legacy resulting from when computational expense was a major consideration and this potential was particularly convenient numerically. At present, it persists because the effort that would be required to re-write molecular modelling software and train new force fields has, until now, been prohibitive. Here, we present Smirnoff-plugins as a flexible framework to extend the Open Force Field software stack to allow custom force field functional forms. We deploy Smirnoff-plugins with the automated Open Force Field infrastructure to train a transferable, small molecule force field based on the recently-proposed double exponential functional form, on over 1000 experimental condensed phase properties. Extensive testing of the resulting force field shows improvements in transfer free energies, with acceptable conformational energetics, run times and convergence properties compared to state-of-the-art Lennard-Jones based force fields.
Collapse
Affiliation(s)
- Joshua T Horton
- School of Natural and Environmental Sciences, Newcastle University Newcastle upon Tyne NE1 7RU UK
| | | | - Pavan Kumar Behara
- Department of Pharmaceutical Sciences, University of California Irvine California 92697 USA
| | - David L Mobley
- Department of Pharmaceutical Sciences, University of California Irvine California 92697 USA
- Department of Chemistry, University of California Irvine California 92697 USA
| | - Daniel J Cole
- School of Natural and Environmental Sciences, Newcastle University Newcastle upon Tyne NE1 7RU UK
| |
Collapse
|
16
|
Kovács DP, Batatia I, Arany ES, Csányi G. Evaluation of the MACE force field architecture: From medicinal chemistry to materials science. J Chem Phys 2023; 159:044118. [PMID: 37522405 DOI: 10.1063/5.0155322] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 06/29/2023] [Indexed: 08/01/2023] Open
Abstract
The MACE architecture represents the state of the art in the field of machine learning force fields for a variety of in-domain, extrapolation, and low-data regime tasks. In this paper, we further evaluate MACE by fitting models for published benchmark datasets. We show that MACE generally outperforms alternatives for a wide range of systems, from amorphous carbon, universal materials modeling, and general small molecule organic chemistry to large molecules and liquid water. We demonstrate the capabilities of the model on tasks ranging from constrained geometry optimization to molecular dynamics simulations and find excellent performance across all tested domains. We show that MACE is very data efficient and can reproduce experimental molecular vibrational spectra when trained on as few as 50 randomly selected reference configurations. We further demonstrate that the strictly local atom-centered model is sufficient for such tasks even in the case of large molecules and weakly interacting molecular assemblies.
Collapse
Affiliation(s)
- Dávid Péter Kovács
- Engineering Laboratory, University of Cambridge, Cambridge CB2 1PZ, United Kingdom
| | - Ilyes Batatia
- Engineering Laboratory, University of Cambridge, Cambridge CB2 1PZ, United Kingdom
- ENS Paris-Saclay, Université Paris-Saclay, 91190 Gif-sur-Yvette, France
| | - Eszter Sára Arany
- School of Clinical Medicine, University of Cambridge, Cambridge CB2 0SP, United Kingdom
| | - Gábor Csányi
- Engineering Laboratory, University of Cambridge, Cambridge CB2 1PZ, United Kingdom
| |
Collapse
|
17
|
Zhang P, Yang W. Toward a general neural network force field for protein simulations: Refining the intramolecular interaction in protein. J Chem Phys 2023; 159:024118. [PMID: 37431910 PMCID: PMC10481389 DOI: 10.1063/5.0142280] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 06/22/2023] [Indexed: 07/12/2023] Open
Abstract
Molecular dynamics (MD) is an extremely powerful, highly effective, and widely used approach to understanding the nature of chemical processes in atomic details for proteins. The accuracy of results from MD simulations is highly dependent on force fields. Currently, molecular mechanical (MM) force fields are mainly utilized in MD simulations because of their low computational cost. Quantum mechanical (QM) calculation has high accuracy, but it is exceedingly time consuming for protein simulations. Machine learning (ML) provides the capability for generating accurate potential at the QM level without increasing much computational effort for specific systems that can be studied at the QM level. However, the construction of general machine learned force fields, needed for broad applications and large and complex systems, is still challenging. Here, general and transferable neural network (NN) force fields based on CHARMM force fields, named CHARMM-NN, are constructed for proteins by training NN models on 27 fragments partitioned from the residue-based systematic molecular fragmentation (rSMF) method. The NN for each fragment is based on atom types and uses new input features that are similar to MM inputs, including bonds, angles, dihedrals, and non-bonded terms, which enhance the compatibility of CHARMM-NN to MM MD and enable the implementation of CHARMM-NN force fields in different MD programs. While the main part of the energy of the protein is based on rSMF and NN, the nonbonded interactions between the fragments and with water are taken from the CHARMM force field through mechanical embedding. The validations of the method for dipeptides on geometric data, relative potential energies, and structural reorganization energies demonstrate that the CHARMM-NN local minima on the potential energy surface are very accurate approximations to QM, showing the success of CHARMM-NN for bonded interactions. However, the MD simulations on peptides and proteins indicate that more accurate methods to represent protein-water interactions in fragments and non-bonded interactions between fragments should be considered in the future improvement of CHARMM-NN, which can increase the accuracy of approximation beyond the current mechanical embedding QM/MM level.
Collapse
Affiliation(s)
- Pan Zhang
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
| | - Weitao Yang
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
| |
Collapse
|
18
|
Doyle M, Bhowmick A, Wych DC, Lassalle L, Simon PS, Holton J, Sauter NK, Yachandra VK, Kern JF, Yano J, Wall ME. Water Networks in Photosystem II Using Crystalline Molecular Dynamics Simulations and Room-Temperature XFEL Serial Crystallography. J Am Chem Soc 2023; 145:14621-14635. [PMID: 37369071 PMCID: PMC10347547 DOI: 10.1021/jacs.3c01412] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Indexed: 06/29/2023]
Abstract
Structural dynamics of water and its hydrogen-bonding networks play an important role in enzyme function via the transport of protons, ions, and substrates. To gain insights into these mechanisms in the water oxidation reaction in Photosystem II (PS II), we have performed crystalline molecular dynamics (MD) simulations of the dark-stable S1 state. Our MD model consists of a full unit cell with 8 PS II monomers in explicit solvent (861 894 atoms), enabling us to compute the simulated crystalline electron density and to compare it directly with the experimental density from serial femtosecond X-ray crystallography under physiological temperature collected at X-ray free electron lasers (XFELs). The MD density reproduced the experimental density and water positions with high fidelity. The detailed dynamics in the simulations provided insights into the mobility of water molecules in the channels beyond what can be interpreted from experimental B-factors and electron densities alone. In particular, the simulations revealed fast, coordinated exchange of waters at sites where the density is strong, and water transport across the bottleneck region of the channels where the density is weak. By computing MD hydrogen and oxygen maps separately, we developed a novel Map-based Acceptor-Donor Identification (MADI) technique that yields information which helps to infer hydrogen-bond directionality and strength. The MADI analysis revealed a series of hydrogen-bond wires emanating from the Mn cluster through the Cl1 and O4 channels; such wires might provide pathways for proton transfer during the reaction cycle of PS II. Our simulations provide an atomistic picture of the dynamics of water and hydrogen-bonding networks in PS II, with implications for the specific role of each channel in the water oxidation reaction.
Collapse
Affiliation(s)
- Margaret
D. Doyle
- Molecular
Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Asmit Bhowmick
- Molecular
Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - David C. Wych
- Computer,
Computational and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
- Center
for Non-linear Studies, Los Alamos National
Laboratory, Los Alamos, New Mexico 87545, United States
| | - Louise Lassalle
- Molecular
Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Philipp S. Simon
- Molecular
Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - James Holton
- Molecular
Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
- Department
of Biochemistry and Biophysics, University
of California, San Francisco, San
Francisco, California 94158, United States
- SSRL, SLAC National Accelerator Laboratory, Menlo Park, California 94025, United States
| | - Nicholas K. Sauter
- Molecular
Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Vittal K. Yachandra
- Molecular
Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Jan F. Kern
- Molecular
Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Junko Yano
- Molecular
Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Michael E. Wall
- Computer,
Computational and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| |
Collapse
|
19
|
Nazarychev VM, Lyulin SV. The Effect of Mechanical Elongation on the Thermal Conductivity of Amorphous and Semicrystalline Thermoplastic Polyimides: Atomistic Simulations. Polymers (Basel) 2023; 15:2926. [PMID: 37447571 DOI: 10.3390/polym15132926] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 06/28/2023] [Accepted: 06/29/2023] [Indexed: 07/15/2023] Open
Abstract
Over the past few decades, the enhancement of polymer thermal conductivity has attracted considerable attention in the scientific community due to its potential for the development of new thermal interface materials (TIM) for both electronic and electrical devices. The mechanical elongation of polymers may be considered as an appropriate tool for the improvement of heat transport through polymers without the necessary addition of nanofillers. Polyimides (PIs) in particular have some of the best thermal, dielectric, and mechanical properties, as well as radiation and chemical resistance. They can therefore be used as polymer binders in TIM without compromising their dielectric properties. In the present study, the effects of uniaxial deformation on the thermal conductivity of thermoplastic PIs were examined for the first time using atomistic computer simulations. We believe that this approach will be important for the development of thermal interface materials based on thermoplastic PIs with improved thermal conductivity properties. Current research has focused on the analysis of three thermoplastic PIs: two semicrystalline, namely BPDA-P3 and R-BAPB; and one amorphous, ULTEMTM. To evaluate the impact of uniaxial deformation on the thermal conductivity, samples of these PIs were deformed up to 200% at a temperature of 600 K, slightly above the melting temperatures of BPDA-P3 and R-BAPB. The thermal conductivity coefficients of these PIs increased in the glassy state and above the glass transition point. Notably, some improvement in the thermal conductivity of the amorphous polyimide ULTEMTM was achieved. Our study demonstrates that the thermal conductivity coefficient is anisotropic in different directions with respect to the deformation axis and shows a significant increase in both semicrystalline and amorphous PIs in the direction parallel to the deformation. Both types of structural ordering (self-ordering of semicrystalline PI and mechanical elongation) led to the same significant increase in thermal conductivity coefficient.
Collapse
Affiliation(s)
- Victor M Nazarychev
- Institute of Macromolecular Compounds, Russian Academy of Sciences, Bolshoi Prospect V.O. 31, 199004 St. Petersburg, Russia
| | - Sergey V Lyulin
- Institute of Macromolecular Compounds, Russian Academy of Sciences, Bolshoi Prospect V.O. 31, 199004 St. Petersburg, Russia
| |
Collapse
|
20
|
Halpern A, Bartsch LR, Ibrahim K, Harrison SA, Ahn M, Christodoulou J, Lane N. Biophysical Interactions Underpin the Emergence of Information in the Genetic Code. Life (Basel) 2023; 13:life13051129. [PMID: 37240774 DOI: 10.3390/life13051129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 04/25/2023] [Accepted: 04/30/2023] [Indexed: 05/28/2023] Open
Abstract
The genetic code conceals a 'code within the codons', which hints at biophysical interactions between amino acids and their cognate nucleotides. Yet, research over decades has failed to corroborate systematic biophysical interactions across the code. Using molecular dynamics simulations and NMR, we have analysed interactions between the 20 standard proteinogenic amino acids and 4 RNA mononucleotides in 3 charge states. Our simulations show that 50% of amino acids bind best with their anticodonic middle base in the -1 charge state common to the backbone of RNA, while 95% of amino acids interact most strongly with at least 1 of their codonic or anticodonic bases. Preference for the cognate anticodonic middle base was greater than 99% of randomised assignments. We verify a selection of our results using NMR, and highlight challenges with both techniques for interrogating large numbers of weak interactions. Finally, we extend our simulations to a range of amino acids and dinucleotides, and corroborate similar preferences for cognate nucleotides. Despite some discrepancies between the predicted patterns and those observed in biology, the existence of weak stereochemical interactions means that random RNA sequences could template non-random peptides. This offers a compelling explanation for the emergence of genetic information in biology.
Collapse
Affiliation(s)
- Aaron Halpern
- UCL Centre for Life's Origins and Evolution (CLOE), Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Lilly R Bartsch
- UCL Centre for Life's Origins and Evolution (CLOE), Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Kaan Ibrahim
- UCL Centre for Life's Origins and Evolution (CLOE), Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Stuart A Harrison
- UCL Centre for Life's Origins and Evolution (CLOE), Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Minkoo Ahn
- Department of Structural and Molecular Biology, Institute of Structural and Molecular Biology (ISMB), University College London, London WC1E 6BT, UK
| | - John Christodoulou
- Department of Structural and Molecular Biology, Institute of Structural and Molecular Biology (ISMB), University College London, London WC1E 6BT, UK
| | - Nick Lane
- UCL Centre for Life's Origins and Evolution (CLOE), Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| |
Collapse
|
21
|
Tillotson MJ, Diamantonis NI, Buda C, Bolton LW, Müller EA. Molecular modelling of the thermophysical properties of fluids: expectations, limitations, gaps and opportunities. Phys Chem Chem Phys 2023; 25:12607-12628. [PMID: 37114325 DOI: 10.1039/d2cp05423j] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
This manuscript provides an overview of the current state of the art in terms of the molecular modelling of the thermophysical properties of fluids. It is intended to manage the expectations and serve as guidance to practising physical chemists, chemical physicists and engineers in terms of the scope and accuracy of the more commonly available intermolecular potentials along with the peculiarities of the software and methods employed in molecular simulations while providing insights on the gaps and opportunities available in this field. The discussion is focused around case studies which showcase both the precision and the limitations of frequently used workflows.
Collapse
Affiliation(s)
- Marcus J Tillotson
- Department of Chemical Engineering, Imperial College London, London, UK.
| | | | | | | | - Erich A Müller
- Department of Chemical Engineering, Imperial College London, London, UK.
| |
Collapse
|
22
|
Pinto ÉSM, Krause MJ, Dorn M, Feltes BC. The nucleotide excision repair proteins through the lens of molecular dynamics simulations. DNA Repair (Amst) 2023; 127:103510. [PMID: 37148846 DOI: 10.1016/j.dnarep.2023.103510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 04/07/2023] [Accepted: 04/23/2023] [Indexed: 05/08/2023]
Abstract
Mutations that affect the proteins responsible for the nucleotide excision repair (NER) pathway can lead to diseases such as xeroderma pigmentosum, trichothiodystrophy, Cockayne syndrome, and Cerebro-oculo-facio-skeletal syndrome. Hence, understanding their molecular behavior is needed to elucidate these diseases' phenotypes and how the NER pathway is organized and coordinated. Molecular dynamics techniques enable the study of different protein conformations, adaptable to any research question, shedding light on the dynamics of biomolecules. However, as important as they are, molecular dynamics studies focused on DNA repair pathways are still becoming more widespread. Currently, there are no review articles compiling the advancements made in molecular dynamics approaches applied to NER and discussing: (i) how this technique is currently employed in the field of DNA repair, focusing on NER proteins; (ii) which technical setups are being employed, their strengths and limitations; (iii) which insights or information are they providing to understand the NER pathway or NER-associated proteins; (iv) which open questions would be suited for this technique to answer; and (v) where can we go from here. These questions become even more crucial considering the numerous 3D structures published regarding the NER pathway's proteins in recent years. In this work, we tackle each one of these questions, revising and critically discussing the results published in the context of the NER pathway.
Collapse
Affiliation(s)
| | - Mathias J Krause
- Institute for Applied and Numerical Mathematics, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | - Márcio Dorn
- Center for Biotechnology, Federal University of Rio Grande do Sul, RS, Brazil; Institute of Informatics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil; National Institute of Science and Technology - Forensic Science, Porto Alegre, RS, Brazil
| | - Bruno César Feltes
- Institute of Informatics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| |
Collapse
|
23
|
Exploring the Potential of Black Soldier Fly Larval Proteins as Bioactive Peptide Sources through in Silico Gastrointestinal Proteolysis: A Cheminformatic Investigation. Catalysts 2023. [DOI: 10.3390/catal13030605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2023] Open
Abstract
Despite their potential as a protein source for human consumption, the health benefits of black soldier fly larvae (BSFL) proteins following human gastrointestinal (GI) digestion are poorly understood. This computational study explored the potential of BSFL proteins to release health-promoting peptides after human GI digestion. Twenty-six proteins were virtually proteolyzed with GI proteases. The resultant peptides were screened for high GI absorption and non-toxicity. Shortlisted peptides were searched against the BIOPEP-UWM and Scopus databases to identify their bioactivities. The potential of the peptides as inhibitors of myeloperoxidase (MPO), NADPH oxidase (NOX), and xanthine oxidase (XO), as well as a disruptor of Keap1–Nrf2 protein–protein interaction, were predicted using molecular docking and dynamics simulation. Our results revealed that about 95% of the 5218 fragments generated from the proteolysis of BSFL proteins came from muscle proteins. Dipeptides comprised the largest group (about 25%) of fragments arising from each muscular protein. Screening of 1994 di- and tripeptides using SwissADME and STopTox tools revealed 65 unique sequences with high GI absorption and non-toxicity. A search of the databases identified 16 antioxidant peptides, 14 anti-angiotensin-converting enzyme peptides, and 17 anti-dipeptidyl peptidase IV peptides among these sequences. Results from molecular docking and dynamic simulation suggest that the dipeptide DF has the potential to inhibit Keap1–Nrf2 interaction and interact with MPO within a short time frame, whereas the dipeptide TF shows promise as an XO inhibitor. BSFL peptides were likely weak NOX inhibitors. Our in silico results suggest that upon GI digestion, BSFL proteins may yield high-GI-absorbed and non-toxic peptides with potential health benefits. This study is the first to investigate the bioactivity of peptides liberated from BSFL proteins following human GI digestion. Our findings provide a basis for further investigations into the potential use of BSFL proteins as a functional food ingredient with significant health benefits.
Collapse
|
24
|
Oliveira MP, Hünenberger PH. Influence of the Lennard-Jones Combination Rules on the Simulated Properties of Organic Liquids at Optimal Force-Field Parametrization. J Chem Theory Comput 2023; 19:2048-2063. [PMID: 36920838 PMCID: PMC10100539 DOI: 10.1021/acs.jctc.2c01170] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/16/2023]
Abstract
We recently introduced the CombiFF scheme [Oliveira et al., J. Chem. Theory Comput. 2020, 16, 7525], an approach for the automated refinement of force-field parameters against experimental condensed-phase data for large compound families. Using this scheme, once the time-consuming task of target-data selection and curation has been performed, the force-field optimization itself is both straightforward and fast. As a result, CombiFF provides an ideal framework for evaluating the influence of functional-form decisions on the accuracy of a force field at an optimal level of parametrization. We already used this approach to assess the effect of using an all-atom representation compared to united-atom representations in the force field [Oliveira et al., J. Chem. Theory Comput. 2022, 18, 6757]. Here, CombiFF is applied to assess the effect of three Lennard-Jones combination rules, geometric-mean (GM), Lorentz-Berthelot (LB), or Waldman-Hagler (WH), on the simulated properties of organic liquids. The comparison is performed in terms of the experimental liquid density ρliq, vaporization enthalpy ΔHvap, surface-tension coefficient γ, static relative dielectric permittivity ϵ, and self-diffusion coefficient D. The calibrations of the three force-field variants are carried out independently against 2044 experimental values for ρliq, and ΔHvap concerning 1516 compounds. The resulting root-mean-square deviations from experiment are 30.0, 26.9, and 36.7 kg m-3 for ρliq and 2.8, 2.8, and 2.9 kJ mol-1 for ΔHvap, when applying the GM, LB, and WH combination rules, respectively. In terms of these (and the other) properties, the three combination rules perform comparatively well, with the GM and LB results being more similar to each other and slightly more accurate compared to experiment. In contrast, the use of distinct combination rules for the parameter calibration and property calculation leads to much larger errors.
Collapse
Affiliation(s)
- Marina P Oliveira
- Laboratorium für Physikalische Chemie, ETH Zürich, CH-8093 Zürich, Switzerland
| | | |
Collapse
|
25
|
Agles AA, Bourg IC. Structure-Thermodynamic Relationship of a Polysaccharide Gel (Alginate) as a Function of Water Content and Counterion Type (Na vs Ca). J Phys Chem B 2023; 127:1828-1841. [PMID: 36791328 PMCID: PMC10159261 DOI: 10.1021/acs.jpcb.2c07129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
Biofilms are the predominant mode of microbial life on Earth, and so a deep understanding of microbial communities─and their impacts on environmental processes─requires a firm understanding of biofilm properties. Because of the importance of biofilms to their microbial inhabitants, microbes have evolved different ways of engineering and reconfiguring the matrix of extracellular polymeric substances (EPS) that constitute the main non-living component of biofilms. This ability makes it difficult to distinguish between the biotic and abiotic origins of biofilm properties. An important route toward establishing this distinction has been the study of simplified models of the EPS matrix. This study builds on such efforts by using atomistic simulations to predict the nanoscale (≤10 nm scale) structure of a model EPS matrix and the sensitivity of this structure to interpolymer interactions and water content. To accomplish this, we use replica exchange molecular dynamics (REMD) simulations to generate all-atom configurations of ten 3.4 kDa alginate polymers at a range of water contents and Ca-Na ratios. Simulated systems are solvated with explicitly modeled water molecules, which allows us to capture the discrete structure of the hydrating water and to examine the thermodynamic stability of water in the gels as they are progressively dehydrated. Our primary findings are that (i) the structure of the hydrogels is highly sensitive to the identity of the charge-compensating cations, (ii) the thermodynamics of water within the gels (specific enthalpy and free energy) are, surprisingly, only weakly sensitive to cation identity, and (iii) predictions of the differential enthalpy and free energy of hydration include a short-ranged enthalpic term that promotes hydration and a longer-ranged (presumably entropic) term that promotes dehydration, where short and long ranges refer to distances shorter or longer than ∼0.6 nm between alginate strands.
Collapse
Affiliation(s)
- Avery A Agles
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, United States
| | - Ian C Bourg
- Department of Civil and Environmental Engineering and High Meadows Environmental Institute, Princeton University, Princeton, New Jersey 08544, United States
| |
Collapse
|
26
|
Seo B, Savoie BM. Evidence That Less Can Be More for Transferable Force Fields. J Chem Inf Model 2023; 63:1188-1195. [PMID: 36744744 DOI: 10.1021/acs.jcim.2c01163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Graph-based parameter assignment has been the basis for developing transferable force fields for molecular dynamics simulations for decades. Nevertheless, transferable force fields vary in how specifically terms are defined with respect to the molecular graph and the procedures for generating parametrization data. More-specific force-field terms increase the complexity of the force field, theoretically increasing accuracy but also increasing training data requirements. In contrast, less-specific force fields can be reused across larger regions of chemical space, theoretically reducing accuracy but also reducing the number of parameters and training data requirements. Here, the tradeoffs between force-field specificity and accuracy are quantified by parametrizing three new sets of force fields with varying levels of graph specificity, using a shared procedure for generating training data. These force fields are benchmarked for their ability to reproduce the structural features and liquid properties of 87 organic molecules at 146 distinct state points. The overall accuracy for properties that were directly trained on rapidly saturates as the graph specificity of the force-field increases. From this, we conclude there is at best a marginal benefit of using less transferable and more complex force fields with common sources of quantum-chemically derived training data. When looking at properties unseen during training, there is some evidence that the more-complex force fields even perform slightly worse. These results are rationalized by the fortuitous regularization of force fields based on less-specific and more-transferable atom types. Both the saturation in the accuracy of training properties and the marginally worse performance on off-target properties fundamentally contradict the expectation that bespoke force fields are generally more accurate, given their larger number of parameters, and suggests that increasing force-field complexity should be carefully justified against performance gains and balanced against available training data.
Collapse
Affiliation(s)
- Bumjoon Seo
- Davidson School of Chemical Engineering, Purdue University, West Lafayette, Indiana47906, United States
| | - Brett M Savoie
- Davidson School of Chemical Engineering, Purdue University, West Lafayette, Indiana47906, United States
| |
Collapse
|
27
|
Kříž K, Schmidt L, Andersson AT, Walz MM, van der Spoel D. An Imbalance in the Force: The Need for Standardized Benchmarks for Molecular Simulation. J Chem Inf Model 2023; 63:412-431. [PMID: 36630710 PMCID: PMC9875315 DOI: 10.1021/acs.jcim.2c01127] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Indexed: 01/12/2023]
Abstract
Force fields (FFs) for molecular simulation have been under development for more than half a century. As with any predictive model, rigorous testing and comparisons of models critically depends on the availability of standardized data sets and benchmarks. While such benchmarks are rather common in the fields of quantum chemistry, this is not the case for empirical FFs. That is, few benchmarks are reused to evaluate FFs, and development teams rather use their own training and test sets. Here we present an overview of currently available tests and benchmarks for computational chemistry, focusing on organic compounds, including halogens and common ions, as FFs for these are the most common ones. We argue that many of the benchmark data sets from quantum chemistry can in fact be reused for evaluating FFs, but new gas phase data is still needed for compounds containing phosphorus and sulfur in different valence states. In addition, more nonequilibrium interaction energies and forces, as well as molecular properties such as electrostatic potentials around compounds, would be beneficial. For the condensed phases there is a large body of experimental data available, and tools to utilize these data in an automated fashion are under development. If FF developers, as well as researchers in artificial intelligence, would adopt a number of these data sets, it would become easier to compare the relative strengths and weaknesses of different models and to, eventually, restore the balance in the force.
Collapse
Affiliation(s)
- Kristian Kříž
- Department
of Cell and Molecular Biology, Uppsala University, Box 596, SE-75124Uppsala, Sweden
| | - Lisa Schmidt
- Faculty
of Biosciences, University of Heidelberg, Heidelberg69117, Germany
| | - Alfred T. Andersson
- Department
of Cell and Molecular Biology, Uppsala University, Box 596, SE-75124Uppsala, Sweden
| | - Marie-Madeleine Walz
- Department
of Cell and Molecular Biology, Uppsala University, Box 596, SE-75124Uppsala, Sweden
| | - David van der Spoel
- Department
of Cell and Molecular Biology, Uppsala University, Box 596, SE-75124Uppsala, Sweden
| |
Collapse
|
28
|
Thürlemann M, Böselt L, Riniker S. Regularized by Physics: Graph Neural Network Parametrized Potentials for the Description of Intermolecular Interactions. J Chem Theory Comput 2023; 19:562-579. [PMID: 36633918 PMCID: PMC9878731 DOI: 10.1021/acs.jctc.2c00661] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Indexed: 01/13/2023]
Abstract
Simulations of molecular systems using electronic structure methods are still not feasible for many systems of biological importance. As a result, empirical methods such as force fields (FF) have become an established tool for the simulation of large and complex molecular systems. The parametrization of FF is, however, time-consuming and has traditionally been based on experimental data. Recent years have therefore seen increasing efforts to automatize FF parametrization or to replace FF with machine-learning (ML) based potentials. Here, we propose an alternative strategy to parametrize FF, which makes use of ML and gradient-descent based optimization while retaining a functional form founded in physics. Using a predefined functional form is shown to enable interpretability, robustness, and efficient simulations of large systems over long time scales. To demonstrate the strength of the proposed method, a fixed-charge and a polarizable model are trained on ab initio potential-energy surfaces. Given only information about the constituting elements, the molecular topology, and reference potential energies, the models successfully learn to assign atom types and corresponding FF parameters from scratch. The resulting models and parameters are validated on a wide range of experimentally and computationally derived properties of systems including dimers, pure liquids, and molecular crystals.
Collapse
Affiliation(s)
- Moritz Thürlemann
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Lennard Böselt
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Sereina Riniker
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| |
Collapse
|
29
|
Pirhadi E, Vanegas JM, Farin M, Schertzer JW, Yong X. Effect of Local Stress on Accurate Modeling of Bacterial Outer Membranes Using All-Atom Molecular Dynamics. J Chem Theory Comput 2023; 19:363-372. [PMID: 36579901 DOI: 10.1021/acs.jctc.2c01026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Biological membranes are fundamental components of living organisms that play an undeniable role in their survival. Molecular dynamics (MD) serves as an essential computational tool for studying biomembranes on molecular and atomistic scales. The status quo of MD simulations of biomembranes studies a nanometer-sized membrane patch periodically extended under periodic boundary conditions (PBCs). In nature, membranes are usually composed of different lipids in their two layers (referred to as leaflets). This compositional asymmetry imposes a fixed ratio of lipid numbers between the two leaflets in a periodically constrained membrane, which needs to be set appropriately. The widely adopted methods of defining a leaflet lipid ratio suffer from the lack of control over the mechanical tension of each leaflet, which could significantly influence research findings. In this study, we investigate the role of membrane-building protocol and the resulting initial stress state on the interaction between small molecules and asymmetric membranes. We model the outer membrane of Pseudomonas aeruginosa bacteria using two different building protocols and probe their interactions with the Pseudomonas quinolone signal (PQS). Our results show that differential stress could shift the position of free energy minimum for the PQS molecule between the two leaflets of the asymmetric membrane. This work provides critical insights into the relationship between the initial per-leaflet tension and the spontaneous intercalation of PQS.
Collapse
Affiliation(s)
- Emad Pirhadi
- Department of Mechanical Engineering, Binghamton University, Binghamton, New York 13902-6000, United States
| | - Juan M Vanegas
- Department of Biochemistry and Biophysics, Oregon State University, Corvallis, Oregon 97331-4003, United States
| | - Mithila Farin
- Department of Mechanical Engineering, Binghamton University, Binghamton, New York 13902-6000, United States
| | - Jeffrey W Schertzer
- Department of Biological Sciences, Binghamton University, Binghamton, New York 13902-6000, United States
| | - Xin Yong
- Department of Mechanical Engineering, Binghamton University, Binghamton, New York 13902-6000, United States
| |
Collapse
|
30
|
Madin OC, Shirts MR. Using physical property surrogate models to perform accelerated multi-fidelity optimization of force field parameters †. DIGITAL DISCOVERY 2023; 2:828-847. [PMCID: PMC10259372 DOI: 10.1039/d2dd00138a] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2022] [Accepted: 04/28/2023] [Indexed: 06/14/2023]
Abstract
Accurate representations of van der Waals dispersion–repulsion interactions play an important role in high-quality molecular dynamics simulations. Training the force field parameters used in the Lennard Jones (LJ) potential typically used to represent these interactions is challenging, generally requiring adjustment based on simulations of macroscopic physical properties. The large computational expense of these simulations, especially when many parameters must be trained simultaneously, limits the size of training data set and number of optimization steps that can be taken, often requiring modelers to perform optimizations within a local parameter region. To allow for more global LJ parameter optimization against large training sets, we introduce a multi-fidelity optimization technique which uses Gaussian process surrogate modeling to build inexpensive models of physical properties as a function of LJ parameters. This approach allows for fast evaluation of approximate objective functions, greatly accelerating searches over parameter space and enabling the use of optimization algorithms capable of searching more globally. In this study, we use an iterative framework which performs global optimization with differential evolution at the surrogate level, followed by validation at the simulation level and surrogate refinement. Using this technique on two previously studied training sets, containing up to 195 physical property targets, we refit a subset of the LJ parameters for the OpenFF 1.0.0 (Parsley) force field. We demonstrate that this multi-fidelity technique can find improved parameter sets compared to a purely simulation-based optimization by searching more broadly and escaping local minima. Additionally, this technique often finds significantly different parameter minima that have comparably accurate performance. In most cases, these parameter sets are transferable to other similar molecules in a test set. Our multi-fidelity technique provides a platform for rapid, more global optimization of molecular models against physical properties, as well as a number of opportunities for further refinement of the technique. We present a multi-fidelity method for optimizing nonbonded force field parameters against physical property data. Leveraging fast surrogate models, we accelerate the parameter search and find novel solutions that improve force field performance.![]()
Collapse
Affiliation(s)
- Owen C. Madin
- Department of Chemical & Biological Engineering, University of Colorado BoulderBoulderCOUSA80309
| | - Michael R. Shirts
- Department of Chemical & Biological Engineering, University of Colorado BoulderBoulderCOUSA80309
| |
Collapse
|
31
|
Shahbabaei M, Tang T. Molecular modeling of thin-film nanocomposite membranes for reverse osmosis water desalination. Phys Chem Chem Phys 2022; 24:29298-29327. [PMID: 36453147 DOI: 10.1039/d2cp03839k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
The scarcity of freshwater resources is a major global challenge causedby population and economic growth. Water desalination using a reverse osmosis (RO) membrane is a promising technology to supply potable water from seawater and brackish water. The advancement of RO desalination highly depends on new membrane materials. Currently, the RO technology mainly relies on polyamide thin-film composite (TFC) membranes, which suffer from several drawbacks (e.g., low water permeability, permeability-selectivity tradeoff, and low fouling resistance) that hamper their real-world applications. Nanoscale fillers with specific characteristics can be used to improve the properties of TFC membranes. Embedding nanofillers into TFC membranes using interfacial polymerization allows the creation of thin-film nanocomposite (TFNC) membranes, and has become an emerging strategy in the fabrication of high-performance membranes for advanced RO water desalination. To achieve optimal design, it is indispensable to search for reliable methods that can provide fast and accurate predictions of the structural and transport properties of the TFNC membranes. However, molecular understanding of permeability-selectivity characteristics of nanofillers remains limited, partially due to the challenges in experimentally exploring microscopic behaviors of water and salt ions in confinement. Molecular modeling and simulations can fill this gap by generating molecular-level insights into the effects of nanofillers' characteristics (e.g., shape, size, surface chemistry, and density) on water permeability and ion selectivity. In this review, we summarize molecular simulations of a diverse range of nanofillers including nanotubes (carbon nanotubes, boron nitride nanotubes, and aquaporin-mimicking nanochannels) and nanosheets (graphene, graphene oxide, boron nitride sheets, molybdenum disulfide, metal and covalent organic frameworks) for water desalination applications. These simulations reveal that water permeability and salt rejection, as the major factors determining the desalination performance of TFNC membranes, significantly depend on the size, topology, density, and chemical modifications of the nanofillers. Identifying their influences and the physicochemical processes behind, via molecular modeling, is expected to yield important insights for the fabrication and optimization of the next generation high-performance TFNC membranes for RO water desalination.
Collapse
Affiliation(s)
- Majid Shahbabaei
- Department of Mechanical Engineering, University of Alberta, Edmonton, Alberta, Canada.
| | - Tian Tang
- Department of Mechanical Engineering, University of Alberta, Edmonton, Alberta, Canada.
| |
Collapse
|
32
|
D’Amore L, Hahn DF, Dotson DL, Horton JT, Anwar J, Craig I, Fox T, Gobbi A, Lakkaraju SK, Lucas X, Meier K, Mobley DL, Narayanan A, Schindler CE, Swope WC, in ’t Veld PJ, Wagner J, Xue B, Tresadern G. Collaborative Assessment of Molecular Geometries and Energies from the Open Force Field. J Chem Inf Model 2022; 62:6094-6104. [PMID: 36433835 PMCID: PMC9873353 DOI: 10.1021/acs.jcim.2c01185] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Force fields form the basis for classical molecular simulations, and their accuracy is crucial for the quality of, for instance, protein-ligand binding simulations in drug discovery. The huge diversity of small-molecule chemistry makes it a challenge to build and parameterize a suitable force field. The Open Force Field Initiative is a combined industry and academic consortium developing a state-of-the-art small-molecule force field. In this report, industry members of the consortium worked together to objectively evaluate the performance of the force fields (referred to here as OpenFF) produced by the initiative on a combined public and proprietary dataset of 19,653 relevant molecules selected from their internal research and compound collections. This evaluation was important because it was completely blind; at most partners, none of the molecules or data were used in force field development or testing prior to this work. We compare the Open Force Field "Sage" version 2.0.0 and "Parsley" version 1.3.0 with GAFF-2.11-AM1BCC, OPLS4, and SMIRNOFF99Frosst. We analyzed force-field-optimized geometries and conformer energies compared to reference quantum mechanical data. We show that OPLS4 performs best, and the latest Open Force Field release shows a clear improvement compared to its predecessors. The performance of established force fields such as GAFF-2.11 was generally worse. While OpenFF researchers were involved in building the benchmarking infrastructure used in this work, benchmarking was done entirely in-house within industrial organizations and the resulting assessment is reported here. This work assesses the force field performance using separate benchmarking steps, external datasets, and involving external research groups. This effort may also be unique in terms of the number of different industrial partners involved, with 10 different companies participating in the benchmark efforts.
Collapse
Affiliation(s)
- Lorenzo D’Amore
- Computational Chemistry, Janssen R&D, C/ Jarama 75A, 45007 Toledo, Spain
| | - David F. Hahn
- Computational Chemistry, Janssen R&D, Turnhoutseweg 30, Beerse B-2340, Belgium
| | - David L. Dotson
- The Open Force Field Initiative, Open Molecular Software Foundation, Davis, California 95616, USA
| | - Joshua T. Horton
- School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne NE1 7RU, UK
| | - Jamshed Anwar
- Department of Chemistry, Lancaster University, Lancaster LA1 4YW, UK
| | - Ian Craig
- Molecular Modeling & Drug Discovery, BASF SE, 67056 Ludwigshafen, Germany
| | - Thomas Fox
- Medicinal Chemistry, Boehringer Ingelheim Pharma GmbH & Co KG, 88397 Biberach/Riss, Germany
| | - Alberto Gobbi
- Genentech, Inc., 1 DNA Way, South San Francisco, California, 94080, USA
| | | | - Xavier Lucas
- Roche Pharma Research and Early Development, Therapeutic Modalities, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Grenzacherstrasse 124, 4070 Basel, Switzerland
| | - Katharina Meier
- Computational Life Science Technology Functions, Crop Science, R&D, Bayer AG, 40789 Monheim, Germany
| | - David L. Mobley
- Departments of Pharmaceutical Sciences and Chemistry, University of California 92617, Irvine, USA
| | - Arjun Narayanan
- Data and Computational Sciences, Vertex Pharmaceuticals, 50 Northern Ave, Boston, MA 02210, USA
| | | | - William C. Swope
- Genentech, Inc., 1 DNA Way, South San Francisco, California, 94080, USA
| | | | - Jeffrey Wagner
- The Open Force Field Initiative, Open Molecular Software Foundation, Davis, California, 95616, USA,Chemistry Department, The University of California at Irvine, Irvine, California, 92617, USA
| | - Bai Xue
- XtalPi Inc. Floor 3, International Biomedical Innovation Park II, No. 2 Hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, Guangdong, 518040 China
| | - Gary Tresadern
- Computational Chemistry, Janssen R&D, Turnhoutseweg 30, Beerse B-2340, Belgium
| |
Collapse
|
33
|
How does it really move? Recent progress in the investigation of protein nanosecond dynamics by NMR and simulation. Curr Opin Struct Biol 2022; 77:102459. [PMID: 36148743 DOI: 10.1016/j.sbi.2022.102459] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2022] [Revised: 07/29/2022] [Accepted: 08/05/2022] [Indexed: 12/14/2022]
Abstract
Nuclear magnetic resonance (NMR) spin relaxation experiments currently probe molecular motions on timescales from picoseconds to nanoseconds. The detailed interpretation of these motions in atomic detail benefits from complementarity with the results from molecular dynamics (MD) simulations. In this mini-review, we describe the recent developments in experimental techniques to study the backbone dynamics from 15N relaxation and side-chain dynamics from 13C relaxation, discuss the different analysis approaches from model-free to dynamics detectors, and highlight the many ways that NMR relaxation experiments and MD simulations can be used together to improve the interpretation and gain insights into protein dynamics.
Collapse
|
34
|
Horton J, Boothroyd S, Wagner J, Mitchell JA, Gokey T, Dotson DL, Behara PK, Ramaswamy VK, Mackey M, Chodera JD, Anwar J, Mobley DL, Cole DJ. Open Force Field BespokeFit: Automating Bespoke Torsion Parametrization at Scale. J Chem Inf Model 2022; 62:5622-5633. [PMID: 36351167 PMCID: PMC9709916 DOI: 10.1021/acs.jcim.2c01153] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The development of accurate transferable force fields is key to realizing the full potential of atomistic modeling in the study of biological processes such as protein-ligand binding for drug discovery. State-of-the-art transferable force fields, such as those produced by the Open Force Field Initiative, use modern software engineering and automation techniques to yield accuracy improvements. However, force field torsion parameters, which must account for many stereoelectronic and steric effects, are considered to be less transferable than other force field parameters and are therefore often targets for bespoke parametrization. Here, we present the Open Force Field QCSubmit and BespokeFit software packages that, when combined, facilitate the fitting of torsion parameters to quantum mechanical reference data at scale. We demonstrate the use of QCSubmit for simplifying the process of creating and archiving large numbers of quantum chemical calculations, by generating a dataset of 671 torsion scans for druglike fragments. We use BespokeFit to derive individual torsion parameters for each of these molecules, thereby reducing the root-mean-square error in the potential energy surface from 1.1 kcal/mol, using the original transferable force field, to 0.4 kcal/mol using the bespoke version. Furthermore, we employ the bespoke force fields to compute the relative binding free energies of a congeneric series of inhibitors of the TYK2 protein, and demonstrate further improvements in accuracy, compared to the base force field (MUE reduced from 0.560.390.77 to 0.420.280.59 kcal/mol and R2 correlation improved from 0.720.350.87 to 0.930.840.97).
Collapse
Affiliation(s)
- Joshua
T. Horton
- School
of Natural and Environmental Sciences, Newcastle
University, Newcastle
upon TyneNE1 7RU, United
Kingdom
| | - Simon Boothroyd
- Boothroyd
Scientific Consulting Ltd., 71-75 Shelton Street, LondonWC2H 9JQ, Greater London, United Kingdom
| | - Jeffrey Wagner
- The
Open Force Field Initiative, Open Molecular
Software Foundation, Davis, California95616, United States
| | - Joshua A. Mitchell
- The
Open Force Field Initiative, Open Molecular
Software Foundation, Davis, California95616, United States
| | - Trevor Gokey
- Department
of Chemistry, University of California, Irvine, California92697, United States
| | - David L. Dotson
- The
Open Force Field Initiative, Open Molecular
Software Foundation, Davis, California95616, United States
| | - Pavan Kumar Behara
- Department
of Pharmaceutical Sciences, University of
California, Irvine, California92697, United States
| | | | - Mark Mackey
- Cresset, New Cambridge House, Bassingbourn
Road, LitlingtonSG8 0SS, Cambridgeshire, United Kingdom
| | - John D. Chodera
- Computational
& Systems Biology Program, Sloan Kettering
Institute, Memorial Sloan Kettering Cancer Center, New
York, New York10065, United States
| | - Jamshed Anwar
- Department
of Chemistry, Lancaster University, LancasterLA1 4YW, United Kingdom
| | - David L. Mobley
- Department
of Chemistry, University of California, Irvine, California92697, United States,Department
of Pharmaceutical Sciences, University of
California, Irvine, California92697, United States
| | - Daniel J. Cole
- School
of Natural and Environmental Sciences, Newcastle
University, Newcastle
upon TyneNE1 7RU, United
Kingdom,
| |
Collapse
|
35
|
Mind the Gap—Deciphering GPCR Pharmacology Using 3D Pharmacophores and Artificial Intelligence. Pharmaceuticals (Basel) 2022; 15:ph15111304. [DOI: 10.3390/ph15111304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 10/15/2022] [Accepted: 10/17/2022] [Indexed: 11/09/2022] Open
Abstract
G protein-coupled receptors (GPCRs) are amongst the most pharmaceutically relevant and well-studied protein targets, yet unanswered questions in the field leave significant gaps in our understanding of their nuanced structure and function. Three-dimensional pharmacophore models are powerful computational tools in in silico drug discovery, presenting myriad opportunities for the integration of GPCR structural biology and cheminformatics. This review highlights success stories in the application of 3D pharmacophore modeling to de novo drug design, the discovery of biased and allosteric ligands, scaffold hopping, QSAR analysis, hit-to-lead optimization, GPCR de-orphanization, mechanistic understanding of GPCR pharmacology and the elucidation of ligand–receptor interactions. Furthermore, advances in the incorporation of dynamics and machine learning are highlighted. The review will analyze challenges in the field of GPCR drug discovery, detailing how 3D pharmacophore modeling can be used to address them. Finally, we will present opportunities afforded by 3D pharmacophore modeling in the advancement of our understanding and targeting of GPCRs.
Collapse
|
36
|
Oliveira MP, Gonçalves YMH, Ol Gheta SK, Rieder SR, Horta BAC, Hünenberger PH. Comparison of the United- and All-Atom Representations of (Halo)alkanes Based on Two Condensed-Phase Force Fields Optimized against the Same Experimental Data Set. J Chem Theory Comput 2022; 18:6757-6778. [PMID: 36190354 DOI: 10.1021/acs.jctc.2c00524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The level of accuracy that can be achieved by a force field is influenced by choices made in the interaction-function representation and in the relevant simulation parameters. These choices, referred to here as functional-form variants (FFVs), include for example the model resolution, the charge-derivation procedure, the van der Waals combination rules, the cutoff distance, and the treatment of the long-range interactions. Ideally, assessing the effect of a given FFV on the intrinsic accuracy of the force-field representation requires that only the specific FFV is changed and that this change is performed at an optimal level of parametrization, a requirement that may prove extremely challenging to achieve in practice. Here, we present a first attempt at such a comparison for one specific FFV, namely the choice of a united-atom (UA) versus an all-atom (AA) resolution in a force field for saturated acyclic (halo)alkanes. Two force-field versions (UA vs AA) are optimized in an automated way using the CombiFF approach against 961 experimental values for the pure-liquid densities ρliq and vaporization enthalpies ΔHvap of 591 compounds. For the AA force field, the torsional and third-neighbor Lennard-Jones parameters are also refined based on quantum-mechanical rotational-energy profiles. The comparison between the UA and AA resolutions is also extended to properties that have not been included as parameterization targets, namely the surface-tension coefficient γ, the isothermal compressibility κT, the isobaric thermal-expansion coefficient αP, the isobaric heat capacity cP, the static relative dielectric permittivity ϵ, the self-diffusion coefficient D, the shear viscosity η, the hydration free energy ΔGwat, and the free energy of solvation ΔGche in cyclohexane. For the target properties ρliq and ΔHvap, the UA and AA resolutions reach very similar levels of accuracy after optimization. For the nine other properties, the AA representation leads to more accurate results in terms of η; comparably accurate results in terms of γ, κT, αP, ϵ, D, and ΔGche; and less accurate results in terms of cP and ΔGwat. This work also represents a first step toward the calibration of a GROMOS-compatible force field at the AA resolution.
Collapse
Affiliation(s)
- Marina P Oliveira
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Yan M H Gonçalves
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| | - S Kashef Ol Gheta
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Salomé R Rieder
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Bruno A C Horta
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Philippe H Hünenberger
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| |
Collapse
|
37
|
Hofstetter A, Böselt L, Riniker S. Graph-convolutional neural networks for (QM)ML/MM molecular dynamics simulations. Phys Chem Chem Phys 2022; 24:22497-22512. [PMID: 36106790 DOI: 10.1039/d2cp02931f] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
To accurately study the chemical reactions in the condensed phase or within enzymes, both quantum-mechanical description and sufficient configurational sampling are required to reach converged estimates. Here, quantum mechanics/molecular mechanics (QM/MM) molecular dynamics (MD) simulations play an important role, providing QM accuracy for the region of interest at a decreased computational cost. However, QM/MM simulations are still too expensive to study large systems on longer time scales. Recently, machine learning (ML) models have been proposed to replace the QM description. The main limitation of these models lies in the accurate description of long-range interactions present in condensed-phase systems. To overcome this issue, a recent workflow has been introduced combining a semi-empirical method (i.e. density functional tight binding (DFTB)) and a high-dimensional neural network potential (HDNNP) in a Δ-learning scheme. This approach has been shown to be capable of correctly incorporating long-range interactions within a cutoff of 1.4 nm. One of the promising alternative approaches to efficiently take long-range effects into account is the development of graph-convolutional neural networks (GCNNs) for the prediction of the potential-energy surface. In this work, we investigate the use of GCNN models - with and without a Δ-learning scheme - for (QM)ML/MM MD simulations. We show that the Δ-learning approach using a GCNN and DFTB as a baseline achieves competitive performance on our benchmarking set of solutes and chemical reactions in water. This method is additionally validated by performing prospective (QM)ML/MM MD simulations of retinoic acid in water and S-adenoslymethionine interacting with cytosine in water. The results indicate that the Δ-learning GCNN model is a valuable alternative for the (QM)ML/MM MD simulations of condensed-phase systems.
Collapse
Affiliation(s)
- Albert Hofstetter
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland.
| | - Lennard Böselt
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland.
| | - Sereina Riniker
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland.
| |
Collapse
|
38
|
Castillo-Sánchez JR, Rincent A, Gheribi AE, Harvey JP. On the transferability of classical pairwise additive atomistic force field to the description of unary and multi-component systems: applications to the solidification of Al-based alloys. Phys Chem Chem Phys 2022; 24:22605-22623. [PMID: 36102884 DOI: 10.1039/d2cp02746a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Multi-component and multiphasic materials are continually being developed for electronics, aircraft, automotive, and general applications. Integrated Computational Materials Engineering (ICME) is a multiple-length scale approach that greatly benefits from atomistic scale simulations to explore new alloys. Molecular Dynamics (MD) allows to perform large-scale simulations by using classical interatomic potentials. The main challenge of using such a classical approach is the transferability of the interatomic potentials from one structure to another when one aims to study multi-component systems. In this work, the reliability of Zr, Al-Cu, Al-Cr and Al-Zr-Ti force field potentials is examined. It has been found that current interatomic potentials are not completely transferable due to the structure dependence from their parameterization. Besides that, they provide an appropriate description of unary and binary systems, notably for liquids, isotropic solids, and partially isotropic compounds. For solidification purposes, it has been found that coherent primary solidification of the FCC-phase in pure Al is highly dependent on the formalism to tune interatomic interactions. For Al-Cr alloys, the icosahedral short-range ordering (ISRO) increased by adding Cr to the melts. The different steps of solidification (formation of nuclei, effective germination of the α-Al phase and end of solidification) have been related to the evolution of the ISRO. The addition of Cr in melts prevented undercooling via icosahedral-enhanced nucleation of the α-Al phase. Precipitation of primary intermetallics in hyper-peritectic Al-Cr alloys was also tested. Contrary to classical thermodynamics predictions, α-Al phase was the primary precipitate for these alloys. This implies that Cr supersaturated the α-Al phase rather than forming intermetallic phases due to the high cooling rates.
Collapse
Affiliation(s)
- Juan-Ricardo Castillo-Sánchez
- Centre for Research in Computational Thermochemistry (CRCT), Department of Chemical Engineering, Polytechnique Montréal, C.P. 6079, Succursale "Downtown", Montréal, Québec H3C 3A7, Canada.
| | - Antoine Rincent
- Centre for Research in Computational Thermochemistry (CRCT), Department of Chemical Engineering, Polytechnique Montréal, C.P. 6079, Succursale "Downtown", Montréal, Québec H3C 3A7, Canada.
| | - Aïmen E Gheribi
- Centre for Research in Computational Thermochemistry (CRCT), Department of Chemical Engineering, Polytechnique Montréal, C.P. 6079, Succursale "Downtown", Montréal, Québec H3C 3A7, Canada.
| | - Jean-Philippe Harvey
- Centre for Research in Computational Thermochemistry (CRCT), Department of Chemical Engineering, Polytechnique Montréal, C.P. 6079, Succursale "Downtown", Montréal, Québec H3C 3A7, Canada.
| |
Collapse
|
39
|
Ricci E, Minelli M, De Angelis MG. Modelling Sorption and Transport of Gases in Polymeric Membranes across Different Scales: A Review. MEMBRANES 2022; 12:membranes12090857. [PMID: 36135877 PMCID: PMC9502097 DOI: 10.3390/membranes12090857] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 08/24/2022] [Accepted: 08/27/2022] [Indexed: 06/02/2023]
Abstract
Professor Giulio C. Sarti has provided outstanding contributions to the modelling of fluid sorption and transport in polymeric materials, with a special eye on industrial applications such as membrane separation, due to his Chemical Engineering background. He was the co-creator of innovative theories such as the Non-Equilibrium Theory for Glassy Polymers (NET-GP), a flexible tool to estimate the solubility of pure and mixed fluids in a wide range of polymers, and of the Standard Transport Model (STM) for estimating membrane permeability and selectivity. In this review, inspired by his rigorous and original approach to representing membrane fundamentals, we provide an overview of the most significant and up-to-date modeling tools available to estimate the main properties governing polymeric membranes in fluid separation, namely solubility and diffusivity. The paper is not meant to be comprehensive, but it focuses on those contributions that are most relevant or that show the potential to be relevant in the future. We do not restrict our view to the field of macroscopic modelling, which was the main playground of professor Sarti, but also devote our attention to Molecular and Multiscale Hierarchical Modeling. This work proposes a critical evaluation of the different approaches considered, along with their limitations and potentiality.
Collapse
Affiliation(s)
- Eleonora Ricci
- Department of Civil, Chemical, Environmental and Materials Engineering (DICAM), Alma Mater Studiorum—University of Bologna, 40126 Bologna, Italy
| | - Matteo Minelli
- Department of Civil, Chemical, Environmental and Materials Engineering (DICAM), Alma Mater Studiorum—University of Bologna, 40126 Bologna, Italy
| | - Maria Grazia De Angelis
- Institute for Materials and Processes, School of Engineering, University of Edinburgh, Edinburgh EH9 3FB, UK
| |
Collapse
|
40
|
Rieder SR, Ries BJ, Kubincová A, Champion C, Barros EP, Hünenberger PH, Riniker S. Leveraging the Sampling Efficiency of RE-EDS in OpenMM Using a Shifted Reaction-Field With an Atom-Based Cutoff. J Chem Phys 2022; 157:104117. [DOI: 10.1063/5.0107935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Replica-exchange enveloping distribution sampling (RE-EDS) is a pathway-independent multistate free-energy method, currently implemented in the GROMOS software package for molecular dynamics (MD) simulations. It has a high intrinsic sampling efficiency as the interactions between the unperturbed particles have to be calculated only once for multiple end-states. As a result, RE-EDS is an attractive method for the calculation of relative solvation and binding free energies. An essential requirement for reaching this high efficiency is the separability of the nonbonded interactions into solute-solute, solute-environment, and environment-environment contributions. Such a partitioning is trivial when using a Coulomb term with a reaction-field (RF) correction to model the electrostatic interactions, but not when using lattice- sum schemes. To avoid cutoff artifacts, the RF correction is typically used in combination with a charge-group based cutoff, which is not supported by most small-molecule force fields and other MD engines. To address this issue, we investigate the combination of RE-EDS simulations with a recently introduced RF scheme including a shifting function that enables the rigorous calculation of RF electrostatics with atom-based cutoffs. The resulting approach is validated by calculating solvation free energies with the generalized AMBER force field (GAFF) in water and chloroform using both the GROMOS software package and a proof-of-concept implementation in OpenMM.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Sereina Riniker
- Laboratory of Physical Chemistry, ETH Zurich D-CHAB, Switzerland
| |
Collapse
|
41
|
Nandy A, Adamji H, Kastner DW, Vennelakanti V, Nazemi A, Liu M, Kulik HJ. Using Computational Chemistry To Reveal Nature’s Blueprints for Single-Site Catalysis of C–H Activation. ACS Catal 2022. [DOI: 10.1021/acscatal.2c02096] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Affiliation(s)
- Aditya Nandy
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Husain Adamji
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - David W. Kastner
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Vyshnavi Vennelakanti
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Azadeh Nazemi
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Mingjie Liu
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Heather J. Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
42
|
Rieder SR, Ries B, Schaller K, Champion C, Barros EP, Hünenberger PH, Riniker S. Replica-Exchange Enveloping Distribution Sampling Using Generalized AMBER Force-Field Topologies: Application to Relative Hydration Free-Energy Calculations for Large Sets of Molecules. J Chem Inf Model 2022; 62:3043-3056. [PMID: 35675713 PMCID: PMC9241072 DOI: 10.1021/acs.jcim.2c00383] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
![]()
Free-energy differences
between pairs of end-states can be estimated
based on molecular dynamics (MD) simulations using standard pathway-dependent
methods such as thermodynamic integration (TI), free-energy perturbation,
or Bennett’s acceptance ratio. Replica-exchange enveloping
distribution sampling (RE-EDS), on the other hand, allows for the
sampling of multiple end-states in a single simulation without the
specification of any pathways. In this work, we use the RE-EDS method
as implemented in GROMOS together with generalized AMBER force-field
(GAFF) topologies, converted to a GROMOS-compatible format with a
newly developed GROMOS++ program amber2gromos, to
compute relative hydration free energies for a series of benzene derivatives.
The results obtained with RE-EDS are compared to the experimental
data as well as calculated values from the literature. In addition,
the estimated free-energy differences in water and in vacuum are compared
to values from TI calculations carried out with GROMACS. The hydration
free energies obtained using RE-EDS for multiple molecules are found
to be in good agreement with both the experimental data and the results
calculated using other free-energy methods. While all considered free-energy
methods delivered accurate results, the RE-EDS calculations required
the least amount of total simulation time. This work serves as a validation
for the use of GAFF topologies with the GROMOS simulation package
and the RE-EDS approach. Furthermore, the performance of RE-EDS for
a large set of 28 end-states is assessed with promising results.
Collapse
Affiliation(s)
- Salomé R Rieder
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Benjamin Ries
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Kay Schaller
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Candide Champion
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Emilia P Barros
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Philippe H Hünenberger
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Sereina Riniker
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| |
Collapse
|
43
|
Better force fields start with better data: A data set of cation dipeptide interactions. Sci Data 2022; 9:327. [PMID: 35715420 PMCID: PMC9205945 DOI: 10.1038/s41597-022-01297-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Accepted: 03/18/2022] [Indexed: 11/08/2022] Open
Abstract
We present a data set from a first-principles study of amino-methylated and acetylated (capped) dipeptides of the 20 proteinogenic amino acids - including alternative possible side chain protonation states and their interactions with selected divalent cations (Ca2+, Mg2+ and Ba2+). The data covers 21,909 stationary points on the respective potential-energy surfaces in a wide relative energy range of up to 4 eV (390 kJ/mol). Relevant properties of interest, like partial charges, were derived for the conformers. The motivation was to provide a solid data basis for force field parameterization and further applications like machine learning or benchmarking. In particular the process of creating all this data on the same first-principles footing, i.e. density-functional theory calculations employing the generalized gradient approximation with a van der Waals correction, makes this data suitable for first principles data-driven force field development. To make the data accessible across domain borders and to machines, we formalized the metadata in an ontology.
Collapse
|
44
|
Pascazio R, Zaccaria F, van Beek B, Infante I. Classical Force-Field Parameters for CsPbBr 3 Perovskite Nanocrystals. THE JOURNAL OF PHYSICAL CHEMISTRY. C, NANOMATERIALS AND INTERFACES 2022; 126:9898-9908. [PMID: 35747512 PMCID: PMC9207923 DOI: 10.1021/acs.jpcc.2c00600] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 04/26/2022] [Indexed: 06/15/2023]
Abstract
Understanding the chemico-physical properties of colloidal semiconductor nanocrystals (NCs) requires exploration of the dynamic processes occurring at the NC surfaces, in particular at the ligand-NC interface. Classical molecular dynamics (MD) simulations under realistic conditions are a powerful tool to acquire this knowledge because they have good accuracy and are computationally cheap, provided that a set of force-field (FF) parameters is available. In this work, we employed a stochastic algorithm, the adaptive rate Monte Carlo method, to optimize FF parameters of cesium lead halide perovskite (CsPbBr3) NCs passivated with typical organic molecules used in the synthesis of these materials: oleates, phosphonates, sulfonates, and primary and quaternary ammonium ligands. The optimized FF parameters have been obtained against MD reference trajectories computed at the density functional theory level on small NC model systems. We validated our parameters through a comparison of a wide range of nonfitted properties to experimentally available values. With the exception of the NC-phosphonate case, the transferability of the FF model has been successfully tested on realistically sized systems (>5 nm) comprising thousands of passivating organic ligands and solvent molecules, just as those used in experiments.
Collapse
Affiliation(s)
- Roberta Pascazio
- Department
of Nanochemistry, Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genova, Italy
- Dipartimento
di Chimica e Chimica Industriale, Università
degli Studi di Genova, Via Dodecaneso 31, 16146 Genova, Italy
| | - Francesco Zaccaria
- Department
of Nanochemistry, Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genova, Italy
| | - Bas van Beek
- Department
of Theoretical Chemistry, Faculty of Science, Vrije Universiteit Amsterdam, de Boelelaan 1083, 1081 HV Amsterdam, The Netherlands
| | - Ivan Infante
- Department
of Nanochemistry, Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genova, Italy
- BCMaterials,
Basque Center for Materials, Applications, and Nanostructures, UPV/EHU Science Park, Leioa 48940, Spain
- Ikerbasque,
Basque Foundation for Science, Bilbao 48009, Spain
| |
Collapse
|
45
|
Paiva VDA, Gomes IDS, Monteiro CR, Mendonça MV, Martins PM, Santana CA, Gonçalves-Almeida V, Izidoro SC, Melo-Minardi RCD, Silveira SDA. Protein structural bioinformatics: An overview. Comput Biol Med 2022; 147:105695. [DOI: 10.1016/j.compbiomed.2022.105695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 06/01/2022] [Accepted: 06/02/2022] [Indexed: 11/27/2022]
|
46
|
Yang Z, Hajlasz N, Kulik HJ. Computational Modeling of Conformer Stability in Benenodin-1, a Thermally Actuated Lasso Peptide Switch. J Phys Chem B 2022; 126:3398-3406. [PMID: 35481742 DOI: 10.1021/acs.jpcb.2c00762] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Benenodin-1 is a thermally actuated lasso peptide rotaxane switch with two primary translational isomers that differ in the relative position of the residue Gln15. The conversion from one conformer to the other involves substantial enthalpy-entropy compensation: one conformer is energetically favored and the other is entropically favored. Here, we take a multi-scale quantum mechanical (QM) and classical molecular dynamic (MD) approach to reveal residue-specific sources of these differences in stability. QM reveals that the two benenodin-1 conformers involve distinct hydrogen bonding networks, with the enthalpically favored conformer having more intra-peptide hydrogen bonds between the Gln15 side chain and nearby residues. The evaluation of configurational entropy over the MD-sampled geometries reveals that the entropically favored conformer has enhanced conformational flexibility. By computing the by-residue-sum entropies, we identify the role of Gln15 and neighboring Glu14 in mediating the entropic variation during the switching process. These computational insights help explain the effects of Glu14Ala and Gln15Ala mutations on the conformational population of benenodin-1 observed experimentally.
Collapse
Affiliation(s)
- Zhongyue Yang
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Natalia Hajlasz
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Heather J Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
47
|
Sundararaman R, Vigil-Fowler D, Schwarz K. Improving the Accuracy of Atomistic Simulations of the Electrochemical Interface. Chem Rev 2022; 122:10651-10674. [PMID: 35522135 PMCID: PMC10127457 DOI: 10.1021/acs.chemrev.1c00800] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Atomistic simulation of the electrochemical double layer is an ambitious undertaking, requiring quantum mechanical description of electrons, phase space sampling of liquid electrolytes, and equilibration of electrolytes over nanosecond time scales. All models of electrochemistry make different trade-offs in the approximation of electrons and atomic configurations, from the extremes of classical molecular dynamics of a complete interface with point-charge atoms to correlated electronic structure methods of a single electrode configuration with no dynamics or electrolyte. Here, we review the spectrum of simulation techniques suitable for electrochemistry, focusing on the key approximations and accuracy considerations for each technique. We discuss promising approaches, such as enhanced sampling techniques for atomic configurations and computationally efficient beyond density functional theory (DFT) electronic methods, that will push electrochemical simulations beyond the present frontier.
Collapse
Affiliation(s)
- Ravishankar Sundararaman
- Department of Materials Science and Engineering, Rensselaer Polytechnic Institute, 110 8th Street, Troy, New York 12180, United States
| | - Derek Vigil-Fowler
- Materials, Chemical, and Computational Science Directorate, National Renewable Energy Laboratory, Golden, Colorado 80401, United States
| | - Kathleen Schwarz
- Material Measurement Laboratory, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| |
Collapse
|
48
|
Blanco MA. Computational models for studying physical instabilities in high concentration biotherapeutic formulations. MAbs 2022; 14:2044744. [PMID: 35282775 PMCID: PMC8928847 DOI: 10.1080/19420862.2022.2044744] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Computational prediction of the behavior of concentrated protein solutions is particularly advantageous in early development stages of biotherapeutics when material availability is limited and a large set of formulation conditions needs to be explored. This review provides an overview of the different computational paradigms that have been successfully used in modeling undesirable physical behaviors of protein solutions with a particular emphasis on high-concentration drug formulations. This includes models ranging from all-atom simulations, coarse-grained representations to macro-scale mathematical descriptions used to study physical instability phenomena of protein solutions such as aggregation, elevated viscosity, and phase separation. These models are compared and summarized in the context of the physical processes and their underlying assumptions and limitations. A detailed analysis is also given for identifying protein interaction processes that are explicitly or implicitly considered in the different modeling approaches and particularly their relations to various formulation parameters. Lastly, many of the shortcomings of existing computational models are discussed, providing perspectives and possible directions toward an efficient computational framework for designing effective protein formulations.
Collapse
Affiliation(s)
- Marco A. Blanco
- Materials and Biophysical Characterization, Analytical R & D, Merck & Co., Inc, Kenilworth, NJ USA
| |
Collapse
|
49
|
Madin OC, Boothroyd S, Messerly RA, Fass J, Chodera JD, Shirts MR. Bayesian-Inference-Driven Model Parametrization and Model Selection for 2CLJQ Fluid Models. J Chem Inf Model 2022; 62:874-889. [PMID: 35129974 PMCID: PMC9217127 DOI: 10.1021/acs.jcim.1c00829] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A high level of physical detail in a molecular model improves its ability to perform high accuracy simulations but can also significantly affect its complexity and computational cost. In some situations, it is worthwhile to add complexity to a model to capture properties of interest; in others, additional complexity is unnecessary and can make simulations computationally infeasible. In this work, we demonstrate the use of Bayesian inference for molecular model selection, using Monte Carlo sampling techniques accelerated with surrogate modeling to evaluate the Bayes factor evidence for different levels of complexity in the two-centered Lennard-Jones + quadrupole (2CLJQ) fluid model. Examining three nested levels of model complexity, we demonstrate that the use of variable quadrupole and bond length parameters in this model framework is justified only for some chemistries. Through this process, we also get detailed information about the distributions and correlation of parameter values, enabling improved parametrization and parameter analysis. We also show how the choice of parameter priors, which encode previous model knowledge, can have substantial effects on the selection of models, penalizing careless introduction of additional complexity. We detail the computational techniques used in this analysis, providing a roadmap for future applications of molecular model selection via Bayesian inference and surrogate modeling.
Collapse
Affiliation(s)
- Owen C. Madin
- Department of Chemical & Biological Engineering, University of Colorado Boulder, Boulder, CO 80309
| | - Simon Boothroyd
- Boothroyd Scientific Consulting Ltd., 71-75 Shelton Street, London, Greater London, United Kingdom, WC2H 9JQ
| | | | - Josh Fass
- Computational & Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065
| | - John D. Chodera
- Department of Chemical & Biological Engineering, University of Colorado Boulder, Boulder, CO 80309
| | - Michael R. Shirts
- Department of Chemical & Biological Engineering, University of Colorado Boulder, Boulder, CO 80309
| |
Collapse
|
50
|
Thürlemann M, Böselt L, Riniker S. Learning Atomic Multipoles: Prediction of the Electrostatic Potential with Equivariant Graph Neural Networks. J Chem Theory Comput 2022; 18:1701-1710. [PMID: 35112866 DOI: 10.1021/acs.jctc.1c01021] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The accurate description of electrostatic interactions remains a challenging problem for classical potential-energy functions. The commonly used fixed partial-charge approximation fails to reproduce the electrostatic potential at short range due to its insensitivity to conformational changes and anisotropic effects. At the same time, possibly more accurate machine-learned (ML) potentials struggle with the long-range behavior due to their inherent locality ansatz. Employing a multipole expansion offers in principle an exact treatment of the electrostatic potential such that the long-range and short-range electrostatic interactions can be treated simultaneously with high accuracy. However, such an expansion requires the calculation of the electron density using computationally expensive quantum-mechanical (QM) methods. Here, we introduce an equivariant graph neural network (GNN) to address this issue. The proposed model predicts atomic multipoles up to the quadrupole, circumventing the need for expensive QM computations. By using an equivariant architecture, the model enforces the correct symmetry by design without relying on local reference frames. The GNN reproduces the electrostatic potential of various systems with high fidelity. Possible uses for such an approach include the separate treatment of long-range interactions in ML potentials, the analysis of electrostatic potential surfaces, and static multipoles in polarizable force fields.
Collapse
Affiliation(s)
- Moritz Thürlemann
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Lennard Böselt
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Sereina Riniker
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| |
Collapse
|