1
|
Notari E, Wood CW, Michel J. Assessment of the Topology and Oligomerisation States of Coiled Coils Using Metadynamics with Conformational Restraints. J Chem Theory Comput 2025; 21:3260-3276. [PMID: 40042175 PMCID: PMC11948332 DOI: 10.1021/acs.jctc.4c01695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2024] [Revised: 02/04/2025] [Accepted: 02/17/2025] [Indexed: 03/26/2025]
Abstract
Coiled-coil proteins provide an excellent scaffold for multistate de novo protein design due to their established sequence-to-structure relationships and ability to switch conformations in response to external stimuli, such as changes in pH or temperature. However, the computational design of multistate coiled-coil protein assemblies is challenging, as it requires accurate estimates of the free energy differences between multiple alternative coiled-coil conformations. Here, we demonstrate how this challenge can be tackled using metadynamics simulations with orientational, positional and conformational restraints. We show that, even for subtle sequence variations, our protocol can predict the preferred topology of coiled-coil dimers and trimers, the preferred oligomerization states of coiled-coil dimers, trimers, and tetramers, as well as the switching behavior of a pH-dependent multistate system. Our approach provides a method for predicting the stability of coiled-coil designs and offers a new framework for computing binding free energies in protein-protein and multiprotein complexes.
Collapse
Affiliation(s)
- Evangelia Notari
- EaStCHEM
School of Chemistry, University of Edinburgh, David Brewster Road, Edinburgh EH9 3FJ, U.K.
| | - Christopher W. Wood
- School
of Biological Sciences, University of Edinburgh, Roger Land Building, Edinburgh EH9 3FF, U.K.
| | - Julien Michel
- EaStCHEM
School of Chemistry, University of Edinburgh, David Brewster Road, Edinburgh EH9 3FJ, U.K.
| |
Collapse
|
2
|
Yadav AK, Prakash MV, Bandyopadhyay P. Physics-Based Machine Learning to Predict Hydration Free Energies for Small Molecules with a Minimal Number of Descriptors: Interpretable and Accurate. J Phys Chem B 2025; 129:1640-1647. [PMID: 39841935 DOI: 10.1021/acs.jpcb.4c07090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2025]
Abstract
Hydration free energy (HFE) of molecules is a fundamental property having importance throughout chemistry and biology. Calculation of the HFE can be challenging and expensive with classical molecular dynamics simulation-based approaches. Machine learning (ML) models are increasingly being used to predict HFE. Although the accuracy of ML models for data sets for small molecules is impressive, these models suffer from lack of interpretability. In this work, we have developed a physics-based ML model with only six descriptors, which is both accurate and fully interpretable, and applied it to a database for small molecule HFE, FreeSolv. We evaluated the electrostatic energy by an approximate closed form of the Generalized Born (GB) model and polar surface area. In addition, we have log P and hydrogen bond acceptor and donors as descriptors along with the number of rotatable bonds. We have used different ML models, such as random forest and extreme gradient boosting. The best result from these models has a mean absolute error of only 0.74 kcal/mol. The main power of this model is that the descriptors have clear physical meaning, and it was found that the descriptor describing the electrostatics and the polar surface area, followed by the hydrogen bond donors and acceptors, are the most important factors for the calculation of hydration free energy.
Collapse
Affiliation(s)
- Ajeet Kumar Yadav
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | - Marvin V Prakash
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | - Pradipta Bandyopadhyay
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| |
Collapse
|
3
|
Piskorz T, Perez-Chirinos L, Qiao B, Sasselli IR. Tips and Tricks in the Modeling of Supramolecular Peptide Assemblies. ACS OMEGA 2024; 9:31254-31273. [PMID: 39072142 PMCID: PMC11270692 DOI: 10.1021/acsomega.4c02628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Revised: 06/17/2024] [Accepted: 06/19/2024] [Indexed: 07/30/2024]
Abstract
Supramolecular peptide assemblies (SPAs) hold promise as materials for nanotechnology and biomedicine. Although their investigation often entails adapting experimental techniques from their protein counterparts, SPAs are fundamentally distinct from proteins, posing unique challenges for their study. Computational methods have emerged as indispensable tools for gaining deeper insights into SPA structures at the molecular level, surpassing the limitations of experimental techniques, and as screening tools to reduce the experimental search space. However, computational studies have grappled with issues stemming from the absence of standardized procedures and relevant crystal structures. Fundamental disparities between SPAs and protein simulations, such as the absence of experimentally validated initial structures and the importance of the simulation size, number of molecules, and concentration, have compounded these challenges. Understanding the roles of various parameters and the capabilities of different models and simulation setups remains an ongoing endeavor. In this review, we aim to provide readers with guidance on the parameters to consider when conducting SPA simulations, elucidating their potential impact on outcomes and validity.
Collapse
Affiliation(s)
| | - Laura Perez-Chirinos
- Center
for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramón 182, 20014 Donostia-San Sebastián, Spain
| | - Baofu Qiao
- Department
of Natural Sciences, Baruch College, City
University of New York, New York, New York 10010, United States
| | - Ivan R. Sasselli
- Centro
de Física de Materiales (CFM), CSIC-UPV/EHU, Paseo Manuel de Lardizabal 5, 20018 San Sebastián, Spain
| |
Collapse
|
4
|
Katzberger P, Riniker S. A general graph neural network based implicit solvation model for organic molecules in water. Chem Sci 2024; 15:10794-10802. [PMID: 39027274 PMCID: PMC11253111 DOI: 10.1039/d4sc02432j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Accepted: 05/24/2024] [Indexed: 07/20/2024] Open
Abstract
The dynamical behavior of small molecules in their environment can be studied with classical molecular dynamics (MD) simulations to gain deeper insight on an atomic level and thus complement and rationalize the interpretation of experimental findings. Such approaches are of great value in various areas of research, e.g., in the development of new therapeutics. The accurate description of solvation effects in such simulations is thereby key and has in consequence been an active field of research since the introduction of MD. So far, the most accurate approaches involve computationally expensive explicit solvent simulations, while widely applied models using an implicit solvent description suffer from reduced accuracy. Recently, machine learning (ML) approaches that provide a probabilistic representation of solvation effects have been proposed as potential alternatives. However, the associated computational costs and minimal or lack of transferability render them unusable in practice. Here, we report the first example of a transferable ML-based implicit solvent model trained on a diverse set of 3 000 000 molecular structures that can be applied to organic small molecules for simulations in water. Extensive testing against reference calculations demonstrated that the model delivers on par accuracy with explicit solvent simulations while providing an up to 18-fold increase in sampling rate.
Collapse
Affiliation(s)
- Paul Katzberger
- Department of Chemistry and Applied Biosciences, ETH Zürich Vladimir-Prelog-Weg 2 8093 Zürich Switzerland
| | - Sereina Riniker
- Department of Chemistry and Applied Biosciences, ETH Zürich Vladimir-Prelog-Weg 2 8093 Zürich Switzerland
| |
Collapse
|
5
|
Cao X, Hummel MH, Wang Y, Simmerling C, Coutsias EA. Exact Analytical Algorithm for the Solvent-Accessible Surface Area and Derivatives in Implicit Solvent Molecular Simulations on GPUs. J Chem Theory Comput 2024; 20:4456-4468. [PMID: 38780181 PMCID: PMC11530138 DOI: 10.1021/acs.jctc.3c01366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2024]
Abstract
In this paper, we present differentiable solvent-accessible surface area (dSASA), an exact geometric method to calculate SASA analytically along with atomic derivatives on GPUs. The atoms in a molecule are first assigned to tetrahedra in groups of four atoms by Delaunay tetrahedralization adapted for efficient GPU implementation, and the SASA values for atoms and molecules are calculated based on the tetrahedralization information and inclusion-exclusion method. The SASA values from the numerical icosahedral-based method can be reproduced with >98% accuracy for both proteins and RNAs. Having been implemented on GPUs and incorporated into AMBER, we can apply dSASA to implicit solvent molecular dynamics simulations with the inclusion of this nonpolar term. The current GPU version of GB/SA simulations has been accelerated up to nearly 20-fold compared to the CPU version, outperforming LCPO, a commonly used, fast algorithm for calculating SASA, as the system size increases. While we focus on the accuracy of the SASA calculations for proteins and nucleic acids, we also demonstrate stable GB/SA MD mini-protein simulations.
Collapse
Affiliation(s)
- Xin Cao
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York 11794, United States
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States
| | - Michelle H Hummel
- Sandia National Laboratories, Albuquerque, New Mexico 87123, United States
| | - Yuzhang Wang
- Department of Chemistry, Stony Brook University, Stony Brook, New York 11794, United States
| | - Carlos Simmerling
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York 11794, United States
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States
| | - Evangelos A Coutsias
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York 11794, United States
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, United States
| |
Collapse
|
6
|
Varenyk Y, Theodorakis PE, Pham DQH, Li MS, Krupa P. Exploring Structural Insights of Aβ42 and α-Synuclein Monomers and Heterodimer: A Comparative Study Using Implicit and Explicit Solvent Simulations. J Phys Chem B 2024; 128:4655-4669. [PMID: 38700150 PMCID: PMC11103699 DOI: 10.1021/acs.jpcb.4c00503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 04/10/2024] [Accepted: 04/16/2024] [Indexed: 05/05/2024]
Abstract
Protein misfolding, aggregation, and fibril formation play a central role in the development of severe neurological disorders, including Alzheimer's and Parkinson's diseases. The structural stability of mature fibrils in these diseases is of great importance, as organisms struggle to effectively eliminate amyloid plaques. To address this issue, it is crucial to investigate the early stages of fibril formation when monomers aggregate into small, toxic, and soluble oligomers. However, these structures are inherently disordered, making them challenging to study through experimental approaches. Recently, it has been shown experimentally that amyloid-β 42 (Aβ42) and α-synuclein (α-Syn) can coassemble. This has motivated us to investigate the interaction between their monomers as a first step toward exploring the possibility of forming heterodimeric complexes. In particular, our study involves the utilization of various Amber and CHARMM force-fields, employing both implicit and explicit solvent models in replica exchange and conventional simulation modes. This comprehensive approach allowed us to assess the strengths and weaknesses of these solvent models and force fields in comparison to experimental and theoretical findings, ensuring the highest level of robustness. Our investigations revealed that Aβ42 and α-Syn monomers can indeed form stable heterodimers, and the resulting heterodimeric model exhibits stronger interactions compared to the Aβ42 dimer. The binding of α-Syn to Aβ42 reduces the propensity of Aβ42 to adopt fibril-prone conformations and induces significant changes in its conformational properties. Notably, in AMBER-FB15 and CHARMM36m force fields with the use of explicit solvent, the presence of Aβ42 significantly increases the β-content of α-Syn, consistent with the experiments showing that Aβ42 triggers α-Syn aggregation. Our analysis clearly shows that although the use of implicit solvent resulted in too large compactness of monomeric α-Syn, structural properties of monomeric Aβ42 and the heterodimer were preserved in explicit-solvent simulations. We anticipate that our study sheds light on the interaction between α-Syn and Aβ42 proteins, thus providing the atom-level model required to assess the initial stage of aggregation mechanisms related to Alzheimer's and Parkinson's diseases.
Collapse
Affiliation(s)
- Yuliia Varenyk
- Institute
of Physics Polish Academy of Sciences, Al. Lotnikow 32/46, 02-668 Warsaw, Poland
- Department
of Theoretical Chemistry, University of
Vienna, Vienna 1090, Austria
| | | | - Dinh Q. H. Pham
- Institute
of Physics Polish Academy of Sciences, Al. Lotnikow 32/46, 02-668 Warsaw, Poland
| | - Mai Suan Li
- Institute
of Physics Polish Academy of Sciences, Al. Lotnikow 32/46, 02-668 Warsaw, Poland
| | - Paweł Krupa
- Institute
of Physics Polish Academy of Sciences, Al. Lotnikow 32/46, 02-668 Warsaw, Poland
| |
Collapse
|
7
|
Dinic J, Tirrell MV. Effects of Charge Sequence Pattern and Lysine-to-Arginine Substitution on the Structural Stability of Bioinspired Polyampholytes. Biomacromolecules 2024; 25:2838-2851. [PMID: 38567844 PMCID: PMC11094733 DOI: 10.1021/acs.biomac.4c00002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2024] [Revised: 03/17/2024] [Accepted: 03/18/2024] [Indexed: 05/14/2024]
Abstract
A comprehensive study focusing on the combined influence of the charge sequence pattern and the type of positively charged amino acids on the formation of secondary structures in sequence-specific polyampholytes is presented. The sequences of interest consisting exclusively of ionizable amino acids (lysine, K; arginine, R; and glutamic acid, E) are (EKEK)5, (EKKE)5, (ERER)5, (ERRE)5, and (EKER)5. The stability of the secondary structure was examined at three pH values in the presence of urea and NaCl. The results presented here underscore the combined prominent effects of the charge sequence pattern and the type of positively charged monomers on secondary structure formation. Additionally, (ERRE)5 readily aggregated across a wide range of pH. In contrast, sequences with the same charge pattern, (EKKE)5, as well as the sequences with the equivalent amino acid content, (ERER)5, exhibited no aggregate formation under equivalent pH and concentration conditions.
Collapse
Affiliation(s)
- Jelena Dinic
- Pritzker
School of Molecular Engineering, University
of Chicago, Chicago, Illinois 60637, United States
- Center
for Molecular Engineering and Materials Science Division, Argonne National Laboratory, Lemont, Illinois 60439, United States
| | - Matthew V. Tirrell
- Pritzker
School of Molecular Engineering, University
of Chicago, Chicago, Illinois 60637, United States
- Center
for Molecular Engineering and Materials Science Division, Argonne National Laboratory, Lemont, Illinois 60439, United States
| |
Collapse
|
8
|
Cao X, Hummel MH, Wang Y, Simmerling C, Coutsias EA. Exact analytical algorithm for solvent accessible surface area and derivatives in implicit solvent molecular simulations on GPUs. ARXIV 2024:arXiv:2401.10462v2. [PMID: 38313200 PMCID: PMC10836080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/06/2024]
Abstract
In this paper, we present dSASA (differentiable SASA), an exact geometric method to calculate solvent accessible surface area (SASA) analytically along with atomic derivatives on GPUs. The atoms in a molecule are first assigned to tetrahedra in groups of four atoms by Delaunay tetrahedrization adapted for efficient GPU implementation and the SASA values for atoms and molecules are calculated based on the tetrahedrization information and inclusion-exclusion method. The SASA values from the numerical icosahedral-based method can be reproduced with more than 98% accuracy for both proteins and RNAs. Having been implemented on GPUs and incorporated into the software Amber, we can apply dSASA to implicit solvent molecular dynamics simulations with inclusion of this nonpolar term. The current GPU version of GB/SA simulations has been accelerated up to nearly 20-fold compared to the CPU version, outperforming LCPO, a commonly used, fast algorithm for calculating SASA, as the system size increases. While we focus on the accuracy of the SASA calculations for proteins and nucleic acids, we also demonstrate stable GB/SA MD mini-protein simulations.
Collapse
Affiliation(s)
- Xin Cao
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY 11794, United States
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794, United States
| | | | - Yuzhang Wang
- Department of Chemistry, Stony Brook University, Stony Brook, NY 11794, United States
| | - Carlos Simmerling
- Department of Chemistry, Stony Brook University, Stony Brook, NY 11794, United States
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794, United States
| | - Evangelos A Coutsias
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY 11794, United States
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794, United States
| |
Collapse
|
9
|
Greener JG. Differentiable simulation to develop molecular dynamics force fields for disordered proteins. Chem Sci 2024; 15:4897-4909. [PMID: 38550690 PMCID: PMC10966991 DOI: 10.1039/d3sc05230c] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Accepted: 02/08/2024] [Indexed: 11/11/2024] Open
Abstract
Implicit solvent force fields are computationally efficient but can be unsuitable for running molecular dynamics on disordered proteins. Here I improve the a99SB-disp force field and the GBNeck2 implicit solvent model to better describe disordered proteins. Differentiable molecular simulations with 5 ns trajectories are used to jointly optimise 108 parameters to better match explicit solvent trajectories. Simulations with the improved force field better reproduce the radius of gyration and secondary structure content seen in experiments, whilst showing slightly degraded performance on folded proteins and protein complexes. The force field, called GB99dms, reproduces the results of a small molecule binding study and improves agreement with experiment for the aggregation of amyloid peptides. GB99dms, which can be used in OpenMM, is available at https://github.com/greener-group/GB99dms. This work is the first to show that gradients can be obtained directly from nanosecond-length differentiable simulations of biomolecules and highlights the effectiveness of this approach to training whole force fields to match desired properties.
Collapse
Affiliation(s)
- Joe G Greener
- Medical Research Council Laboratory of Molecular Biology Cambridge CB2 0QH UK
| |
Collapse
|
10
|
Anila S, Samsonov SA. Benchmarking Water Models in Molecular Dynamics of Protein-Glycosaminoglycan Complexes. J Chem Inf Model 2024; 64:1691-1703. [PMID: 38410841 PMCID: PMC10934818 DOI: 10.1021/acs.jcim.4c00030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 02/08/2024] [Accepted: 02/13/2024] [Indexed: 02/28/2024]
Abstract
Glycosaminoglycans (GAGs) made of repeating disaccharide units intricately engage with proteins, playing a crucial role in the spatial organization of the extracellular matrix (ECM) and the transduction of biological signals in cells to modulate a number of biochemical processes. Exploring protein-GAG interactions reveals several challenges for their analysis, namely, the highly charged and periodic nature of GAGs, their multipose binding, and the abundance of the interfacial water molecules in the protein-GAG complexes. Most of the studies on protein-GAG interactions are conducted using the TIP3P water model, and there are no data on the effect of various water models on the results obtained in molecular dynamics (MD) simulations of protein-GAG complexes. Hence, it is essential to perform a systematic analysis of different water models in MD simulations for these systems. In this work, we aim to evaluate the properties of the protein-GAG complexes in MD simulations using different explicit: TIP3P, SPC/E, TIP4P, TIP4PEw, OPC, and TIP5P and implicit: IGB = 1, 2, 5, 7, and 8 water models to find out which of them are best suited to study the dynamics of protein-GAG complexes. The FF14SB and GLYCAM06 force fields were used for the proteins and GAGs, respectively. The interactions of several GAG types, such as heparin, chondroitin sulfate, and hyaluronic acid with basic fibroblast growth factor, cathepsin K, and CD44 receptor, respectively, are investigated. The observed variations in different descriptors used to study the binding in these complexes emphasize the relevance of the choice of water models for the MD simulation of these complexes.
Collapse
Affiliation(s)
- Sebastian Anila
- Faculty of Chemistry, University of Gdańsk, ul. Wita Stwosza 63, 80-308 Gdańsk, Poland
| | - Sergey A. Samsonov
- Faculty of Chemistry, University of Gdańsk, ul. Wita Stwosza 63, 80-308 Gdańsk, Poland
| |
Collapse
|
11
|
Fossat MJ, Posey AE, Pappu RV. Uncovering the Contributions of Charge Regulation to the Stability of Single Alpha Helices. Chemphyschem 2023; 24:e202200746. [PMID: 36599672 PMCID: PMC10734359 DOI: 10.1002/cphc.202200746] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 12/07/2022] [Indexed: 01/06/2023]
Abstract
The single alpha helix (SAH) is a recurring motif in biology. The consensus sequence has a di-block architecture that includes repeats of four consecutive glutamate residues followed by four consecutive lysine residues. Measurements show that the overall helicity of sequences with consensus E4 K4 repeats is insensitive to a wide range of pH values. Here, we use the recently introduced q-canonical ensemble, which allows us to decouple measurements of charge state and conformation, to explain the observed insensitivity of SAH helicity to pH. We couple the outputs from separate measurements of charge and conformation with atomistic simulations to derive residue-specific quantifications of preferences for being in an alpha helix and for the ionizable residues to be charged vs. uncharged. We find a clear preference for accommodating uncharged Glu residues within internal positions of SAH-forming sequences. The stabilities of alpha helical conformations increase with the number of E4 K4 repeats and so do the numbers of accessible charge states that are compatible with forming conformations of high helical content. There is conformational buffering whereby charge state heterogeneity buffers against large-scale conformational changes thus making the overall helicity insensitive to large changes in pH. Further, the results clearly argue against a single, rod-like alpha helical conformation being the only or even dominant conformation in the ensembles of so-called SAH sequences.
Collapse
Affiliation(s)
| | | | - Rohit V. Pappu
- Department of Biomedical Engineering and the Center for Biomolecular Condensates, James McKelvey School of Engineering, Washington University in St. Louis, St. Louis, MO 63130
| |
Collapse
|
12
|
Marcisz M, Samsonov SA. Solvent Model Benchmark for Molecular Dynamics of Glycosaminoglycans. J Chem Inf Model 2023; 63:2147-2157. [PMID: 36989082 PMCID: PMC10091405 DOI: 10.1021/acs.jcim.2c01472] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/30/2023]
Abstract
In computational studies of glycosaminoglycans (GAGs), a group of anionic, periodic linear polysaccharides, so far there has been very little discussion about the role of solvent models in the molecular dynamics simulations of these molecules. Predominantly, the TIP3P water model is commonly used as one of the most popular explicit water models in general. However, there are numerous alternative explicit and implicit water models that are neglected in the computational research of GAGs. Since solvent-mediated interactions are particularly important for GAG dynamic and structural properties, it would be of great interest for the GAG community to establish the solvent model that is suited the best in terms of the quality of theoretically obtained GAG parameters and, at the same time, would be reasonably demanding in terms of computational resources required. In this study, heparin (HP) was simulated using five implicit and six explicit solvent models with the aim to find out how different solvent models influence HP's molecular descriptors in the molecular dynamics simulations. Here, we initiate the search for the most appropriate solvent representation for GAG systems and we hope to encourage other groups to contribute to this highly relevant subject.
Collapse
Affiliation(s)
- Mateusz Marcisz
- Faculty of Chemistry, University of Gdańsk, ul. Wita Stwosza 63, 80-308 Gdańsk, Poland
- Intercollegiate Faculty of Biotechnology of UG and MUG, ul. Abrahama 58, 80-307 Gdańsk, Poland
| | - Sergey A Samsonov
- Faculty of Chemistry, University of Gdańsk, ul. Wita Stwosza 63, 80-308 Gdańsk, Poland
| |
Collapse
|
13
|
Yao S, Van R, Pan X, Park JH, Mao Y, Pu J, Mei Y, Shao Y. Machine learning based implicit solvent model for aqueous-solution alanine dipeptide molecular dynamics simulations. RSC Adv 2023; 13:4565-4577. [PMID: 36760282 PMCID: PMC9900604 DOI: 10.1039/d2ra08180f] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Accepted: 01/20/2023] [Indexed: 02/05/2023] Open
Abstract
Inspired by the recent work from Noé and coworkers on the development of machine learning based implicit solvent model for the simulation of solvated peptides [Chen et al., J. Chem. Phys., 2021, 155, 084101], here we report another investigation of the possibility of using machine learning (ML) techniques to "derive" an implicit solvent model directly from explicit solvent molecular dynamics (MD) simulations. For alanine dipeptide, a machine learning potential (MLP) based on the DeepPot-SE representation of the molecule was trained to capture its interactions with its average solvent environment configuration (ASEC). The predicted forces on the solute deviated only by an RMSD of 0.4 kcal mol-1 Å-1 from the reference values, and the MLP-based free energy surface differed from that obtained from explicit solvent MD simulations by an RMSD of less than 0.9 kcal mol-1. Our MLP training protocol could also accurately reproduce combined quantum mechanical molecular mechanical (QM/MM) forces on the quantum mechanical (QM) solute in ASEC environment, thus enabling the development of accurate ML-based implicit solvent models for ab initio-QM MD simulations. Such ML-based implicit solvent models for QM calculations are cost-effective in both the training stage, where the use of ASEC reduces the number of data points to be labelled, and the inference stage, where the MLP can be evaluated at a relatively small additional cost on top of the QM calculation of the solute.
Collapse
Affiliation(s)
- Songyuan Yao
- Department of Chemistry and Biochemistry, University of Oklahoma Norman OK 73019 USA
| | - Richard Van
- Department of Chemistry and Biochemistry, University of Oklahoma Norman OK 73019 USA
| | - Xiaoliang Pan
- Department of Chemistry and Biochemistry, University of Oklahoma Norman OK 73019 USA
| | - Ji Hwan Park
- School of Computer Science, University of Oklahoma Norman OK 73019 USA
| | - Yuezhi Mao
- Department of Chemistry and Biochemistry, San Diego State University San Diego CA 92182 USA
| | - Jingzhi Pu
- Department of Chemistry and Chemical Biology, Indiana University-Purdue University Indianapolis Indianapolis IN 46202 USA
| | - Ye Mei
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University Shanghai 200062 China
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai Shanghai 200062 China
- Collaborative Innovation Center of Extreme Optics, Shanxi University Taiyuan Shanxi 030006 China
| | - Yihan Shao
- Department of Chemistry and Biochemistry, University of Oklahoma Norman OK 73019 USA
| |
Collapse
|
14
|
Sethio D, Poongavanam V, Xiong R, Tyagi M, Duy Vo D, Lindh R, Kihlberg J. Simulation Reveals the Chameleonic Behavior of Macrocycles. J Chem Inf Model 2023; 63:138-146. [PMID: 36563083 PMCID: PMC9832480 DOI: 10.1021/acs.jcim.2c01093] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Conformational analysis is central to the design of bioactive molecules. It is particularly challenging for macrocycles due to noncovalent transannular interactions, steric interactions, and ring strain that are often coupled. Herein, we simulated the conformations of five macrocycles designed to express a progression of increasing complexity in environment-dependent intramolecular interactions and verified the results against NMR measurements in chloroform and dimethyl sulfoxide. Molecular dynamics using an explicit solvent model, but not the Monte Carlo method with implicit solvation, handled both solvents correctly. Refinement of conformations at the ab initio level was fundamental to reproducing the experimental observations─standard state-of-the-art molecular mechanics force fields were insufficient. Our simulations correctly predicted the intramolecular interactions between side chains and the macrocycle and revealed an unprecedented solvent-induced conformational switch of the macrocyclic ring. Our results provide a platform for the rational, prospective design of molecular chameleons that adapt to the properties of the environment.
Collapse
|