1
|
Khan MZI, Ren JN, Cao C, Ye HYX, Wang H, Guo YM, Yang JR, Chen JZ. Comprehensive hepatotoxicity prediction: ensemble model integrating machine learning and deep learning. Front Pharmacol 2024; 15:1441587. [PMID: 39234116 PMCID: PMC11373136 DOI: 10.3389/fphar.2024.1441587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Accepted: 07/24/2024] [Indexed: 09/06/2024] Open
Abstract
Background Chemicals may lead to acute liver injuries, posing a serious threat to human health. Achieving the precise safety profile of a compound is challenging due to the complex and expensive testing procedures. In silico approaches will aid in identifying the potential risk of drug candidates in the initial stage of drug development and thus mitigating the developmental cost. Methods In current studies, QSAR models were developed for hepatotoxicity predictions using the ensemble strategy to integrate machine learning (ML) and deep learning (DL) algorithms using various molecular features. A large dataset of 2588 chemicals and drugs was randomly divided into training (80%) and test (20%) sets, followed by the training of individual base models using diverse machine learning or deep learning based on three different kinds of descriptors and fingerprints. Feature selection approaches were employed to proceed with model optimizations based on the model performance. Hybrid ensemble approaches were further utilized to determine the method with the best performance. Results The voting ensemble classifier emerged as the optimal model, achieving an excellent prediction accuracy of 80.26%, AUC of 82.84%, and recall of over 93% followed by bagging and stacking ensemble classifiers method. The model was further verified by an external test set, internal 10-fold cross-validation, and rigorous benchmark training, exhibiting much better reliability than the published models. Conclusion The proposed ensemble model offers a dependable assessment with a good performance for the prediction regarding the risk of chemicals and drugs to induce liver damage.
Collapse
Affiliation(s)
| | - Jia-Nan Ren
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China
| | - Cheng Cao
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China
- Polytechnic Institute, Zhejiang University, Hangzhou, China
| | - Hong-Yu-Xiang Ye
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China
| | - Hao Wang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China
| | - Ya-Min Guo
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China
| | - Jin-Rong Yang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China
- Polytechnic Institute, Zhejiang University, Hangzhou, China
| | - Jian-Zhong Chen
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China
| |
Collapse
|
2
|
Wang L, Behara PK, Thompson MW, Gokey T, Wang Y, Wagner JR, Cole DJ, Gilson MK, Shirts MR, Mobley DL. The Open Force Field Initiative: Open Software and Open Science for Molecular Modeling. J Phys Chem B 2024; 128:7043-7067. [PMID: 38989715 DOI: 10.1021/acs.jpcb.4c01558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]
Abstract
Force fields are a key component of physics-based molecular modeling, describing the energies and forces in a molecular system as a function of the positions of the atoms and molecules involved. Here, we provide a review and scientific status report on the work of the Open Force Field (OpenFF) Initiative, which focuses on the science, infrastructure and data required to build the next generation of biomolecular force fields. We introduce the OpenFF Initiative and the related OpenFF Consortium, describe its approach to force field development and software, and discuss accomplishments to date as well as future plans. OpenFF releases both software and data under open and permissive licensing agreements to enable rapid application, validation, extension, and modification of its force fields and software tools. We discuss lessons learned to date in this new approach to force field development. We also highlight ways that other force field researchers can get involved, as well as some recent successes of outside researchers taking advantage of OpenFF tools and data.
Collapse
Affiliation(s)
- Lily Wang
- Open Force Field, Open Molecular Software Foundation, Davis, California 95616, United States
| | - Pavan Kumar Behara
- Center for Neurotherapeutics, University of California, Irvine, California 92697, United States
| | - Matthew W Thompson
- Open Force Field, Open Molecular Software Foundation, Davis, California 95616, United States
| | - Trevor Gokey
- Department of Chemistry, University of California, Irvine, California 92697, United States
| | - Yuanqing Wang
- Simons Center for Computational Physical Chemistry and Center for Data Science, New York, New York 10004, United States
| | - Jeffrey R Wagner
- Open Force Field, Open Molecular Software Foundation, Davis, California 95616, United States
| | - Daniel J Cole
- School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne NE1 7RU, United Kingdom
| | - Michael K Gilson
- Skaggs School of Pharmacy and Pharmaceutical Sciences, The University of California at San Diego, La Jolla, California 92093, United States
| | - Michael R Shirts
- Department of Chemical and Biological Engineering, University of Colorado Boulder, Boulder, Colorado 80305, United States
| | - David L Mobley
- Department of Chemistry, University of California, Irvine, California 92697, United States
- Department of Pharmaceutical Sciences, University of California, Irvine, California 92697, United States
| |
Collapse
|
3
|
Thürlemann M, Böselt L, Riniker S. Regularized by Physics: Graph Neural Network Parametrized Potentials for the Description of Intermolecular Interactions. J Chem Theory Comput 2023; 19:562-579. [PMID: 36633918 PMCID: PMC9878731 DOI: 10.1021/acs.jctc.2c00661] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Indexed: 01/13/2023]
Abstract
Simulations of molecular systems using electronic structure methods are still not feasible for many systems of biological importance. As a result, empirical methods such as force fields (FF) have become an established tool for the simulation of large and complex molecular systems. The parametrization of FF is, however, time-consuming and has traditionally been based on experimental data. Recent years have therefore seen increasing efforts to automatize FF parametrization or to replace FF with machine-learning (ML) based potentials. Here, we propose an alternative strategy to parametrize FF, which makes use of ML and gradient-descent based optimization while retaining a functional form founded in physics. Using a predefined functional form is shown to enable interpretability, robustness, and efficient simulations of large systems over long time scales. To demonstrate the strength of the proposed method, a fixed-charge and a polarizable model are trained on ab initio potential-energy surfaces. Given only information about the constituting elements, the molecular topology, and reference potential energies, the models successfully learn to assign atom types and corresponding FF parameters from scratch. The resulting models and parameters are validated on a wide range of experimentally and computationally derived properties of systems including dimers, pure liquids, and molecular crystals.
Collapse
Affiliation(s)
- Moritz Thürlemann
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Lennard Böselt
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Sereina Riniker
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| |
Collapse
|
4
|
Bajorath J, Chávez-Hernández AL, Duran-Frigola M, Fernández-de Gortari E, Gasteiger J, López-López E, Maggiora GM, Medina-Franco JL, Méndez-Lucio O, Mestres J, Miranda-Quintana RA, Oprea TI, Plisson F, Prieto-Martínez FD, Rodríguez-Pérez R, Rondón-Villarreal P, Saldívar-Gonzalez FI, Sánchez-Cruz N, Valli M. Chemoinformatics and artificial intelligence colloquium: progress and challenges in developing bioactive compounds. J Cheminform 2022; 14:82. [PMID: 36461094 PMCID: PMC9716667 DOI: 10.1186/s13321-022-00661-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Accepted: 11/25/2022] [Indexed: 12/03/2022] Open
Abstract
We report the main conclusions of the first Chemoinformatics and Artificial Intelligence Colloquium, Mexico City, June 15-17, 2022. Fifteen lectures were presented during a virtual public event with speakers from industry, academia, and non-for-profit organizations. Twelve hundred and ninety students and academics from more than 60 countries. During the meeting, applications, challenges, and opportunities in drug discovery, de novo drug design, ADME-Tox (absorption, distribution, metabolism, excretion and toxicity) property predictions, organic chemistry, peptides, and antibiotic resistance were discussed. The program along with the recordings of all sessions are freely available at https://www.difacquim.com/english/events/2022-colloquium/ .
Collapse
Affiliation(s)
- Jürgen Bajorath
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Friedrich-Hirzebruch-Allee 5/6, 53113, Bonn, Germany
| | - Ana L Chávez-Hernández
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, National Autonomous University of Mexico, 04510, Mexico City, Mexico
| | - Miquel Duran-Frigola
- Ersilia Open Source Initiative, Cambridge, UK
- Joint IRB-BSC-CRG Programme in Computational Biology, Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Catalonia, Spain
| | - Eli Fernández-de Gortari
- Nanosafety Laboratory, International Iberian Nanotechnology Laboratory, 4715-330, Braga, Portugal
| | - Johann Gasteiger
- Computer-Chemie-Centrum, University of Erlangen-Nuremberg, Erlangen, Germany
| | - Edgar López-López
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, National Autonomous University of Mexico, 04510, Mexico City, Mexico
- Department of Pharmacology, Center for Research and Advanced Studies of the National Polytechnic Institute (CINVESTAV), 07360, Mexico City, Mexico
| | | | - José L Medina-Franco
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, National Autonomous University of Mexico, 04510, Mexico City, Mexico.
| | | | - Jordi Mestres
- Chemotargets SL, Baldiri Reixac 4, Parc Cientific de Barcelona (PCB), 08028, Barcelona, Catalonia, Spain
- Research Group on Systems Pharmacology, Research Program on Biomedical Informatics (GRIB), IMIM Hospital del Mar Medical Research Institute and University Pompeu Fabra, Parc de Recerca Biomedica (PRBB), 08003, Barcelona, Catalonia, Spain
| | | | - Tudor I Oprea
- Department of Internal Medicine, University of New Mexico School of Medicine, Albuquerque, NM, 87131, USA
- Department of Rheumatology and Inflammation Research, Institute of Medicine, Sahlgrenska Academy at Gothenburg University, 40530, Gothenburg, Sweden
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, 2200, Copenhagen, Denmark
- Roivant Discovery Sciences, Inc., 451 D Street, Boston, MA, 02210, USA
| | - Fabien Plisson
- Department of Biotechnology and Biochemistry, Center for Research and Advanced Studies of the National Polytechnic Institute (CINVESTAV-IPN), Irapuato Unit, 36824, Irapuato, Gto, Mexico
| | | | | | - Paola Rondón-Villarreal
- Universidad de Santander, Facultad de Ciencias Médicas y de la Salud, Instituto de Investigación Masira, Calle 70 No. 55-210, 680003, Santander, Bucaramanga, Colombia
| | - Fernanda I Saldívar-Gonzalez
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, National Autonomous University of Mexico, 04510, Mexico City, Mexico
| | - Norberto Sánchez-Cruz
- Chemotargets SL, Baldiri Reixac 4, Parc Cientific de Barcelona (PCB), 08028, Barcelona, Catalonia, Spain
- Instituto de Química, Unidad Mérida, Universidad Nacional Autónoma de México, Carretera Mérida-Tetiz Km. 4.5, Yucatán, 97357, Ucú, Mexico
| | - Marilia Valli
- Nuclei of Bioassays, Biosynthesis and Ecophysiology of Natural Products (NuBBE), Department of Organic Chemistry, Institute of Chemistry, São Paulo State University-UNESP, Araraquara, Brazil
| |
Collapse
|
5
|
Wang Y, Fass J, Kaminow B, Herr JE, Rufa D, Zhang I, Pulido I, Henry M, Bruce Macdonald HE, Takaba K, Chodera JD. End-to-end differentiable construction of molecular mechanics force fields. Chem Sci 2022; 13:12016-12033. [PMID: 36349096 PMCID: PMC9600499 DOI: 10.1039/d2sc02739a] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Accepted: 09/05/2022] [Indexed: 01/07/2023] Open
Abstract
Molecular mechanics (MM) potentials have long been a workhorse of computational chemistry. Leveraging accuracy and speed, these functional forms find use in a wide variety of applications in biomolecular modeling and drug discovery, from rapid virtual screening to detailed free energy calculations. Traditionally, MM potentials have relied on human-curated, inflexible, and poorly extensible discrete chemical perception rules (atom types) for applying parameters to small molecules or biopolymers, making it difficult to optimize both types and parameters to fit quantum chemical or physical property data. Here, we propose an alternative approach that uses graph neural networks to perceive chemical environments, producing continuous atom embeddings from which valence and nonbonded parameters can be predicted using invariance-preserving layers. Since all stages are built from smooth neural functions, the entire process-spanning chemical perception to parameter assignment-is modular and end-to-end differentiable with respect to model parameters, allowing new force fields to be easily constructed, extended, and applied to arbitrary molecules. We show that this approach is not only sufficiently expressive to reproduce legacy atom types, but that it can learn to accurately reproduce and extend existing molecular mechanics force fields. Trained with arbitrary loss functions, it can construct entirely new force fields self-consistently applicable to both biopolymers and small molecules directly from quantum chemical calculations, with superior fidelity than traditional atom or parameter typing schemes. When adapted to simultaneously fit partial charge models, espaloma delivers high-quality partial atomic charges orders of magnitude faster than current best-practices with low inaccuracy. When trained on the same quantum chemical small molecule dataset used to parameterize the Open Force Field ("Parsley") openff-1.2.0 small molecule force field augmented with a peptide dataset, the resulting espaloma model shows superior accuracy vis-á-vis experiments in computing relative alchemical free energy calculations for a popular benchmark. This approach is implemented in the free and open source package espaloma, available at https://github.com/choderalab/espaloma.
Collapse
Affiliation(s)
- Yuanqing Wang
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA,Physiology, Biophysics and System Biology PhD Program, Weill Cornell Medical College, Cornell UniversityNew York 10065NYUSA,MFA Program in Creative Writing, Division of Humanities and Arts, City College of New York, City University of New YorkNew York 10031NYUSA
| | - Josh Fass
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA,Tri-Institutional PhD Program in Computational Biology and Medicine, Weill Cornell Medical College, Cornell UniversityNew York 10065NYUSA
| | - Benjamin Kaminow
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA,Tri-Institutional PhD Program in Computational Biology and Medicine, Weill Cornell Medical College, Cornell UniversityNew York 10065NYUSA
| | - John E. Herr
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA
| | - Dominic Rufa
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA,Tri-Institutional PhD Program in Chemical Biology, Weill Cornell Medical College, Cornell UniversityNew York 10065NYUSA
| | - Ivy Zhang
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA,Tri-Institutional PhD Program in Computational Biology and Medicine, Weill Cornell Medical College, Cornell UniversityNew York 10065NYUSA
| | - Iván Pulido
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA
| | - Mike Henry
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA
| | - Hannah E. Bruce Macdonald
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA
| | - Kenichiro Takaba
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA,Pharmaceutical Research Center, Advanced Drug Discovery, Asahi Kasei Pharma CorporationShizuoka 410-2321Japan
| | - John D. Chodera
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer CenterNew York 10065NYUSA
| |
Collapse
|
6
|
Oliveira MP, Gonçalves YMH, Ol Gheta SK, Rieder SR, Horta BAC, Hünenberger PH. Comparison of the United- and All-Atom Representations of (Halo)alkanes Based on Two Condensed-Phase Force Fields Optimized against the Same Experimental Data Set. J Chem Theory Comput 2022; 18:6757-6778. [PMID: 36190354 DOI: 10.1021/acs.jctc.2c00524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The level of accuracy that can be achieved by a force field is influenced by choices made in the interaction-function representation and in the relevant simulation parameters. These choices, referred to here as functional-form variants (FFVs), include for example the model resolution, the charge-derivation procedure, the van der Waals combination rules, the cutoff distance, and the treatment of the long-range interactions. Ideally, assessing the effect of a given FFV on the intrinsic accuracy of the force-field representation requires that only the specific FFV is changed and that this change is performed at an optimal level of parametrization, a requirement that may prove extremely challenging to achieve in practice. Here, we present a first attempt at such a comparison for one specific FFV, namely the choice of a united-atom (UA) versus an all-atom (AA) resolution in a force field for saturated acyclic (halo)alkanes. Two force-field versions (UA vs AA) are optimized in an automated way using the CombiFF approach against 961 experimental values for the pure-liquid densities ρliq and vaporization enthalpies ΔHvap of 591 compounds. For the AA force field, the torsional and third-neighbor Lennard-Jones parameters are also refined based on quantum-mechanical rotational-energy profiles. The comparison between the UA and AA resolutions is also extended to properties that have not been included as parameterization targets, namely the surface-tension coefficient γ, the isothermal compressibility κT, the isobaric thermal-expansion coefficient αP, the isobaric heat capacity cP, the static relative dielectric permittivity ϵ, the self-diffusion coefficient D, the shear viscosity η, the hydration free energy ΔGwat, and the free energy of solvation ΔGche in cyclohexane. For the target properties ρliq and ΔHvap, the UA and AA resolutions reach very similar levels of accuracy after optimization. For the nine other properties, the AA representation leads to more accurate results in terms of η; comparably accurate results in terms of γ, κT, αP, ϵ, D, and ΔGche; and less accurate results in terms of cP and ΔGwat. This work also represents a first step toward the calibration of a GROMOS-compatible force field at the AA resolution.
Collapse
Affiliation(s)
- Marina P Oliveira
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Yan M H Gonçalves
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| | - S Kashef Ol Gheta
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Salomé R Rieder
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Bruno A C Horta
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Philippe H Hünenberger
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCI, CH-8093 Zürich, Switzerland
| |
Collapse
|
7
|
Yang JR, Chen Q, Wang H, Hu XY, Guo YM, Chen JZ. Reliable CA-(Q)SAR generation based on entropy weight optimized by grid search and correction factors. Comput Biol Med 2022; 146:105573. [DOI: 10.1016/j.compbiomed.2022.105573] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 03/31/2022] [Accepted: 04/26/2022] [Indexed: 11/03/2022]
|
8
|
He X, Walker B, Man VH, Ren P, Wang J. Recent progress in general force fields of small molecules. Curr Opin Struct Biol 2022; 72:187-193. [PMID: 34942567 PMCID: PMC8860847 DOI: 10.1016/j.sbi.2021.11.011] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2021] [Revised: 09/10/2021] [Accepted: 11/16/2021] [Indexed: 02/03/2023]
Abstract
Recent advances in computational hardware and free energy algorithms enable a broader application of molecular simulation of binding interactions between receptors and small-molecule ligands. The underlying molecular mechanics force fields (FFs) for small molecules have also achieved advancements in accuracy, user-friendliness, and speed during the past several years (2018-2020). Besides the expansion of chemical space coverage of ligand-like molecules among major popular classical additive FFs and polarizable FFs, new charge models have been proposed for better accuracy and transferability, new chemical perception of avoiding predefined atom types have been applied, and new automated parameterization toolkits, including machine learning approaches, have been developed for users' convenience.
Collapse
Affiliation(s)
- Xibing He
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA, 15261, USA
| | - Brandon Walker
- Department of Biomedical Engineering, The University of Texas at Austin, Austin, TX, 78712, USA
| | - Viet H Man
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA, 15261, USA
| | - Pengyu Ren
- Department of Biomedical Engineering, The University of Texas at Austin, Austin, TX, 78712, USA.
| | - Junmei Wang
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA, 15261, USA.
| |
Collapse
|
9
|
Alford RF, Samanta R, Gray JJ. Diverse Scientific Benchmarks for Implicit Membrane Energy Functions. J Chem Theory Comput 2021; 17:5248-5261. [PMID: 34310137 DOI: 10.1021/acs.jctc.0c00646] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Energy functions are fundamental to biomolecular modeling. Their success depends on robust physical formalisms, efficient optimization, and high-resolution data for training and validation. Over the past 20 years, progress in each area has advanced soluble protein energy functions. Yet, energy functions for membrane proteins lag behind due to sparse and low-quality data, leading to overfit tools. To overcome this challenge, we assembled a suite of 12 tests on independent data sets varying in size, diversity, and resolution. The tests probe an energy function's ability to capture membrane protein orientation, stability, sequence, and structure. Here, we present the tests and use the franklin2019 energy function to demonstrate them. We then identify areas for energy function improvement and discuss potential future integration with machine-learning-based optimization methods. The tests are available through the Rosetta Benchmark Server (https://benchmark.graylab.jhu.edu/) and GitHub (https://github.com/rfalford12/Implicit-Membrane-Energy-Function-Benchmark).
Collapse
Affiliation(s)
- Rebecca F Alford
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States
| | - Rituparna Samanta
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States.,Program in Molecular Biophysics, Johns Hopkins University, Baltimore, Maryland, United States
| |
Collapse
|
10
|
Kashefolgheta S, Wang S, Acree WE, Hünenberger PH. Evaluation of nine condensed-phase force fields of the GROMOS, CHARMM, OPLS, AMBER, and OpenFF families against experimental cross-solvation free energies. Phys Chem Chem Phys 2021; 23:13055-13074. [PMID: 34105547 PMCID: PMC8207520 DOI: 10.1039/d1cp00215e] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Accepted: 04/28/2021] [Indexed: 12/02/2022]
Abstract
Experimental solvation free energies are nowadays commonly included as target properties in the validation of condensed-phase force fields, sometimes even in their calibration. In a previous article [Kashefolgheta et al., J. Chem. Theory. Comput., 2020, 16, 7556-7580], we showed how the involved comparison between experimental and simulation results could be made more systematic by considering a full matrix of cross-solvation free energies . For a set of N molecules that are all in the liquid state under ambient conditions, such a matrix encompasses N×N entries for considering each of the N molecules either as solute (A) or as solvent (B). In the quoted study, a cross-solvation matrix of 25 × 25 experimental value was introduced, considering 25 small molecules representative for alkanes, chloroalkanes, ethers, ketones, esters, alcohols, amines, and amides. This experimental data was used to compare the relative accuracies of four popular condensed-phase force fields, namely GROMOS-2016H66, OPLS-AA, AMBER-GAFF, and CHARMM-CGenFF. In the present work, the comparison is extended to five additional force fields, namely GROMOS-54A7, GROMOS-ATB, OPLS-LBCC, AMBER-GAFF2, and OpenFF. Considering these nine force fields, the correlation coefficients between experimental values and simulation results range from 0.76 to 0.88, the root-mean-square errors (RMSEs) from 2.9 to 4.8 kJ mol-1, and average errors (AVEEs) from -1.5 to +1.0 kJ mol-1. In terms of RMSEs, GROMOS-2016H66 and OPLS-AA present the best accuracy (2.9 kJ mol-1), followed by OPLS-LBCC, AMBER-GAFF2, AMBER-GAFF, and OpenFF (3.3 to 3.6 kJ mol-1), and then by GROMOS-54A7, CHARM-CGenFF, and GROMOS-ATB (4.0 to 4.8 kJ mol-1). These differences are statistically significant but not very pronounced, and are distributed rather heterogeneously over the set of compounds within the different force fields.
Collapse
Affiliation(s)
- Sadra Kashefolgheta
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCICH-8093 ZürichSwitzerland+41 44 632 55 03
| | - Shuzhe Wang
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCICH-8093 ZürichSwitzerland+41 44 632 55 03
| | - William E. Acree
- Department of Chemistry, University of North Texas1155 Union Circle Drive #305070DentonTexas 76203USA
| | - Philippe H. Hünenberger
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Hönggerberg, HCICH-8093 ZürichSwitzerland+41 44 632 55 03
| |
Collapse
|
11
|
Oliveira MP, Andrey M, Rieder SR, Kern L, Hahn DF, Riniker S, Horta BAC, Hünenberger PH. Systematic Optimization of a Fragment-Based Force Field against Experimental Pure-Liquid Properties Considering Large Compound Families: Application to Saturated Haloalkanes. J Chem Theory Comput 2020; 16:7525-7555. [DOI: 10.1021/acs.jctc.0c00683] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Affiliation(s)
- Marina P. Oliveira
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Honggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Maurice Andrey
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Honggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Salomé R. Rieder
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Honggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Leyla Kern
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Honggerberg, HCI, CH-8093 Zürich, Switzerland
| | - David F. Hahn
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Honggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Sereina Riniker
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Honggerberg, HCI, CH-8093 Zürich, Switzerland
| | - Bruno A. C. Horta
- Instituto de Química, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21941-909, Brazil
| | - Philippe H. Hünenberger
- Laboratorium für Physikalische Chemie, ETH Zürich, ETH-Honggerberg, HCI, CH-8093 Zürich, Switzerland
| |
Collapse
|
12
|
Schauperl M, Kantonen SM, Wang LP, Gilson MK. Data-driven analysis of the number of Lennard-Jones types needed in a force field. Commun Chem 2020; 3:173. [PMID: 34295996 PMCID: PMC8294475 DOI: 10.1038/s42004-020-00395-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Accepted: 09/28/2020] [Indexed: 01/12/2023] Open
Abstract
Force fields used in molecular simulations contain numerical parameters, such as Lennard-Jones (LJ) parameters, which are assigned to the atoms in a molecule based on a classification of their chemical environments. The number of classes, or types, should be no more than needed to maximize agreement with experiment, as parsimony avoids overfitting and simplifies parameter optimization. However, types have historically been crafted based largely on chemical intuition, so current force fields may contain more types than needed. In this study, we seek the minimum number of LJ parameter types needed to represent key properties of organic liquids. We find that highly competitive force field accuracy is obtained with minimalist sets of LJ types; e.g. two H types and one type apiece for C, O, and N atoms. We also find that the fitness surface has multiple minima, which can lead to local trapping of the optimizer.
Collapse
Affiliation(s)
- Michael Schauperl
- Skaggs School of Pharmacy and Pharmaceutical Sciences, 9500 Gilman Drive, MC0751, University of California, San Diego, CA 92093-0751 USA
| | - Sophie M Kantonen
- Skaggs School of Pharmacy and Pharmaceutical Sciences, 9500 Gilman Drive, MC0751, University of California, San Diego, CA 92093-0751 USA
| | - Lee-Ping Wang
- Department of Chemistry, University of California, Davis, CA 95616 USA
| | - Michael K Gilson
- Skaggs School of Pharmacy and Pharmaceutical Sciences, 9500 Gilman Drive, MC0751, University of California, San Diego, CA 92093-0751 USA
| |
Collapse
|
13
|
Sohrabi S, Khedri M, Maleki R, Keshavarz Moraveji M. Molecular engineering of the last-generation CNTs in smart cancer therapy by grafting PEG-PLGA-riboflavin. RSC Adv 2020; 10:40637-40648. [PMID: 35519185 PMCID: PMC9057702 DOI: 10.1039/d0ra07500k] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Accepted: 10/23/2020] [Indexed: 12/12/2022] Open
Abstract
In this work, the effect of environment and additives on the self-assembly and delivery of doxorubicin (DOX) have been studied. A microfluidic system with better control over molecular interactions and high surface to volume ratio has superior performance in comparison to the bulk system. Moreover, carbon nanotube (CNT) and CNT-doped structures have a high surface area to incorporate the DOX molecules into a polymer and the presence of functional groups can influence the polymer-drug interactions. In this work, the interactions of DOX with both the polymeric complex and the nanotube structure have been investigated. For quantification of the interactions, H-bonding, gyration radius, root-mean-square deviation (RMSD), Gibbs free energy, radial distribution function (RDF), energy, and Solvent Accessible Surface Area (SASA) analyses have been performed. The most stable micelle-DOX interaction is attributed to the presence of BCN in the microfluidic system according to the gyration radius and RMSD. Meanwhile, for DOX-doped CNT interaction the phosphorus-doped CNT in the microfluidic system is more stable. The highest electrostatic interaction can be seen between polymeric micelles and DOX in the presence of BCN. For nanotube-drug interaction, phosphorus-doped carbon nanotubes in the microfluidic system have the largest electrostatic interaction with the DOX. RDF results show that in the microfluidic system, nanotube-DOX affinity is larger than that of nanotube-micelle.
Collapse
Affiliation(s)
- Somayeh Sohrabi
- Department of Chemical Engineering, Amirkabir University of Technology (Tehran Polytechnic) 424 Hafez Avenue Tehran 1591634311 Iran
| | - Mohammad Khedri
- Department of Chemical Engineering, Amirkabir University of Technology (Tehran Polytechnic) 424 Hafez Avenue Tehran 1591634311 Iran
| | - Reza Maleki
- Computational Biology and Chemistry Group (CBCG), Universal Scientific Education and Research Network (USERN) Tehran Iran
| | - Mostafa Keshavarz Moraveji
- Department of Chemical Engineering, Amirkabir University of Technology (Tehran Polytechnic) 424 Hafez Avenue Tehran 1591634311 Iran
| |
Collapse
|
14
|
Recent progress on cheminformatics approaches to epigenetic drug discovery. Drug Discov Today 2020; 25:2268-2276. [PMID: 33010481 DOI: 10.1016/j.drudis.2020.09.021] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Revised: 08/31/2020] [Accepted: 09/17/2020] [Indexed: 12/16/2022]
Abstract
The ability of epigenetic markers to affect genome function has enabled transformative changes in drug discovery, especially in cancer and other emerging therapeutic areas. Concordant with the introduction of the term 'epi-informatics', the size of the epigenetically relevant chemical space has grown substantially and so did the number of applications of cheminformatic methods to epigenetics. Recent progress in epi-informatics has improved our understanding of the structure-epigenetic activity relationships and boosted the development of models predicting novel epigenetic agents. Herein, we review the advances in computational approaches to drug discovery of small molecules with epigenetic modulation profiles, summarize the current chemogenomics data available for epigenetic targets, and provide a perspective on the greater utility of biomedical knowledge mining as a means to advance the epigenetic drug discovery.
Collapse
|
15
|
van der Spoel D. Systematic design of biomolecular force fields. Curr Opin Struct Biol 2020; 67:18-24. [PMID: 32980615 DOI: 10.1016/j.sbi.2020.08.006] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Revised: 08/24/2020] [Accepted: 08/25/2020] [Indexed: 02/07/2023]
Abstract
Force fields for the study of biomolecules have been developed in a predominantly organic manner by regular updates over half a century. Together with better algorithms and advances in computer technology, force fields have improved to yield more robust predictions. However, there are also indications to suggest that intramolecular energy functions have not become better and that there still is room for improvement. In this review, systematic efforts toward development of novel force fields from scratch are described. This includes an estimate of the complexity of the problem and the prerequisites in the form of data and algorithms. It is suggested that in order to make progress, an effort is needed to standardize reference data and force field validation benchmarks.
Collapse
|
16
|
Prieto-Martínez FD, Medina-Franco JL. Current advances on the development of BET inhibitors: insights from computational methods. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2020; 122:127-180. [PMID: 32951810 DOI: 10.1016/bs.apcsb.2020.06.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Epigenetics was coined almost 70 years ago for the description of heritable phenotype without altering DNA sequences. Research on the field has uncovered significant roles of such mechanisms, that account for the biogenesis of several diseases. Further studies have led the way for drug development which targets epi-enzymes, mainly for cancer treatment. Of the numerous epi-targets involved with histone acetylation, bromodomains have captured the spotlight of drug discovery focused on novel therapies. However, due to high sequence identity, the development of potent and selective inhibitors poses a significant challenge. Herein, we discuss recent computational developments on BET inhibitors and other methods that may be applied for drug discovery in general. As a proof-of-concept, we discuss a virtual screening to identify novel BET inhibitors based on coumarin derivatives. From public data, we identified putative structure-activity relationships of coumarin scaffold and propose R-group modifications for BET selectivity. Results showed that the optimization and design of novel coumarins could be further explored.
Collapse
Affiliation(s)
- Fernando D Prieto-Martínez
- Department of Pharmacy, School of Chemistry, National Autonomous University of Mexico, Mexico City, Mexico
| | - José L Medina-Franco
- Department of Pharmacy, School of Chemistry, National Autonomous University of Mexico, Mexico City, Mexico
| |
Collapse
|
17
|
Wei W, Champion C, Barigye SJ, Liu Z, Labute P, Moitessier N. Use of Extended-Hückel Descriptors for Rapid and Accurate Predictions of Conjugated Torsional Energy Barriers. J Chem Inf Model 2020; 60:3534-3545. [DOI: 10.1021/acs.jcim.0c00440] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Wanlei Wei
- Department of Chemistry, McGill University, 801 Sherbrooke St. W., Montreal H3A 0B8, Québec, Canada
| | - Candide Champion
- Chemical Computing Group Incorporation, 1010 Sherbrooke St. W., Montreal H3A 2R7, Québec, Canada
| | - Stephen J. Barigye
- Department of Chemistry, McGill University, 801 Sherbrooke St. W., Montreal H3A 0B8, Québec, Canada
| | - Zhaomin Liu
- Department of Chemistry, McGill University, 801 Sherbrooke St. W., Montreal H3A 0B8, Québec, Canada
| | - Paul Labute
- Chemical Computing Group Incorporation, 1010 Sherbrooke St. W., Montreal H3A 2R7, Québec, Canada
| | - Nicolas Moitessier
- Department of Chemistry, McGill University, 801 Sherbrooke St. W., Montreal H3A 0B8, Québec, Canada
| |
Collapse
|
18
|
Lazzari F, Salvadori A, Mancini G, Barone V. Molecular Perception for Visualization and Computation: The Proxima Library. J Chem Inf Model 2020; 60:2668-2672. [PMID: 32271572 PMCID: PMC7997373 DOI: 10.1021/acs.jcim.0c00076] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Proxima is a molecular perception library designed with a double purpose: to be used with immersive molecular viewers (thus providing any required feature not supported by third party libraries) and to be integrated in workflow managers thus providing the functionalities needed for the first steps of molecular modeling studies. It thus stands at the boundary between visualization and computation. The purpose of the present article is to provide a general introduction to the first release of Proxima, describe its most significant features, and highlight its performance by means of some case studies. The current version of Proxima is available for evaluation purposes at https://bitbucket.org/sns-smartlab/proxima/src/master/.
Collapse
Affiliation(s)
- Federico Lazzari
- Scuola Normale Superiore, Piazza dei Cavalieri, 7-56126 Pisa, Italy
| | - Andrea Salvadori
- Scuola Normale Superiore, Piazza dei Cavalieri, 7-56126 Pisa, Italy
| | - Giordano Mancini
- Scuola Normale Superiore, Piazza dei Cavalieri, 7-56126 Pisa, Italy
| | - Vincenzo Barone
- Scuola Normale Superiore, Piazza dei Cavalieri, 7-56126 Pisa, Italy
| |
Collapse
|
19
|
Thompson MW, Gilmer JB, Matsumoto RA, Quach CD, Shamaprasad P, Yang AH, Iacovella CR, Cabe CM, Cummings PT. Towards Molecular Simulations that are Transparent, Reproducible, Usable By Others, and Extensible (TRUE). Mol Phys 2020; 118:e1742938. [PMID: 33100401 PMCID: PMC7576934 DOI: 10.1080/00268976.2020.1742938] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Accepted: 03/05/2020] [Indexed: 10/24/2022]
Abstract
Systems composed of soft matter (e.g., liquids, polymers, foams, gels, colloids, and most biological materials) are ubiquitous in science and engineering, but molecular simulations of such systems pose particular computational challenges, requiring time and/or ensemble-averaged data to be collected over long simulation trajectories for property evaluation. Performing a molecular simulation of a soft matter system involves multiple steps, which have traditionally been performed by researchers in a "bespoke" fashion, resulting in many published soft matter simulations not being reproducible based on the information provided in the publications. To address the issue of reproducibility and to provide tools for computational screening, we have been developing the open-source Molecular Simulation and Design Framework (MoSDeF) software suite. In this paper, we propose a set of principles to create Transparent, Reproducible, Usable by others, and Extensible (TRUE) molecular simulations. MoSDeF facilitates the publication and dissemination of TRUE simulations by automating many of the critical steps in molecular simulation, thus enhancing their reproducibility. We provide several examples of TRUE molecular simulations: All of the steps involved in creating, running and extracting properties from the simulations are distributed on open-source platforms (within MoSDeF and on GitHub), thus meeting the definition of TRUE simulations.
Collapse
Affiliation(s)
- Matthew W Thompson
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, TN, USA
- Multiscale Modeling and Simulation Center, Vanderbilt University, Nashville, TN, USA
| | - Justin B Gilmer
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, TN, USA
- Multiscale Modeling and Simulation Center, Vanderbilt University, Nashville, TN, USA
| | - Ray A Matsumoto
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, TN, USA
- Multiscale Modeling and Simulation Center, Vanderbilt University, Nashville, TN, USA
| | - Co D Quach
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, TN, USA
- Multiscale Modeling and Simulation Center, Vanderbilt University, Nashville, TN, USA
| | - Parashara Shamaprasad
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, TN, USA
- Multiscale Modeling and Simulation Center, Vanderbilt University, Nashville, TN, USA
| | - Alexander H Yang
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, TN, USA
- Multiscale Modeling and Simulation Center, Vanderbilt University, Nashville, TN, USA
| | - Christopher R Iacovella
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, TN, USA
- Multiscale Modeling and Simulation Center, Vanderbilt University, Nashville, TN, USA
| | - Clare M Cabe
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, TN, USA
- Multiscale Modeling and Simulation Center, Vanderbilt University, Nashville, TN, USA
| | - Peter T Cummings
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, TN, USA
- Multiscale Modeling and Simulation Center, Vanderbilt University, Nashville, TN, USA
| |
Collapse
|
20
|
Gygli G, Pleiss J. Simulation Foundry: Automated and F.A.I.R. Molecular Modeling. J Chem Inf Model 2020; 60:1922-1927. [DOI: 10.1021/acs.jcim.0c00018] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Affiliation(s)
- Gudrun Gygli
- Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Allmandring 31, 70569 Stuttgart, Germany
| | - Juergen Pleiss
- Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Allmandring 31, 70569 Stuttgart, Germany
| |
Collapse
|
21
|
Kantonen SM, Muddana HS, Schauperl M, Henriksen NM, Wang LP, Gilson MK. Data-Driven Mapping of Gas-Phase Quantum Calculations to General Force Field Lennard-Jones Parameters. J Chem Theory Comput 2020; 16:1115-1127. [PMID: 31917572 PMCID: PMC7101068 DOI: 10.1021/acs.jctc.9b00713] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Molecular dynamics simulations are helpful tools for a range of applications, ranging from drug discovery to protein structure determination. The successful use of this technology largely depends on the potential function, or force field, used to determine the potential energy at each configuration of the system. Most force fields encode all of the relevant parameters to be used in distinct atom types, each associated with parameters for all parts of the force field, typically bond stretches, angle bends, torsions, and nonbonded terms accounting for van der Waals and electrostatic interactions. Much attention has been paid to the nonbonded parameters and their derivation, which are important in particular due to their governance of noncovalent interactions, such as protein-ligand binding. Parametrization involves adjusting the nonbonded parameters to minimize the error between simulation results and experimental properties, such as heats of vaporization and densities of neat liquids. In this setting, determining the best set of atom types is far from trivial, and the large number of parameters to be fit for the atom types in a typical force field can make it difficult to approach a true optimum. Here, we utilize a previously described Minimal Basis Iterative Stockholder (MBIS) method to carry out an atoms-in-molecules partitioning of electron densities. Information from these atomic densities is then mapped to Lennard-Jones parameters using a set of mapping parameters much smaller than the typical number of atom types in a force field. This approach is advantageous in two ways: it eliminates atom types by allowing each atom to have unique Lennard-Jones parameters, and it greatly reduces the number of parameters to be optimized. We show that this approach yields results comparable to those obtained with the typed GAFF 1.7 force field, even when trained on a relatively small amount of experimental data.
Collapse
Affiliation(s)
- Sophie M Kantonen
- Skaggs School of Pharmacy and Pharmaceutical Sciences , University of California San Diego , 9500 Gilman Drive , La Jolla , California 92093-0736 , United States
| | - Hari S Muddana
- Skaggs School of Pharmacy and Pharmaceutical Sciences , University of California San Diego , 9500 Gilman Drive , La Jolla , California 92093-0736 , United States
- OpenEye Scientific Software, Inc. , 9 Bisbee Court, Suite D , Santa Fe , New Mexico 87508 , United States
| | - Michael Schauperl
- Skaggs School of Pharmacy and Pharmaceutical Sciences , University of California San Diego , 9500 Gilman Drive , La Jolla , California 92093-0736 , United States
| | - Niel M Henriksen
- Skaggs School of Pharmacy and Pharmaceutical Sciences , University of California San Diego , 9500 Gilman Drive , La Jolla , California 92093-0736 , United States
- AtomWise, Inc. , 717 Market Street, Suite 800 , San Francisco , California 94103 , United States
| | - Lee-Ping Wang
- Department of Chemistry , University of California Davis , One Shields Avenue , Davis , California 95616 , United States
| | - Michael K Gilson
- Skaggs School of Pharmacy and Pharmaceutical Sciences , University of California San Diego , 9500 Gilman Drive , La Jolla , California 92093-0736 , United States
| |
Collapse
|
22
|
Summers AZ, Gilmer JB, Iacovella CR, Cummings PT, MCabe C. MoSDeF, a Python Framework Enabling Large-Scale Computational Screening of Soft Matter: Application to Chemistry-Property Relationships in Lubricating Monolayer Films. J Chem Theory Comput 2020; 16:1779-1793. [DOI: 10.1021/acs.jctc.9b01183] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
23
|
Orioli S, Larsen AH, Bottaro S, Lindorff-Larsen K. How to learn from inconsistencies: Integrating molecular simulations with experimental data. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2020; 170:123-176. [PMID: 32145944 DOI: 10.1016/bs.pmbts.2019.12.006] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Molecular simulations and biophysical experiments can be used to provide independent and complementary insights into the molecular origin of biological processes. A particularly useful strategy is to use molecular simulations as a modeling tool to interpret experimental measurements, and to use experimental data to refine our biophysical models. Thus, explicit integration and synergy between molecular simulations and experiments is fundamental for furthering our understanding of biological processes. This is especially true in the case where discrepancies between measured and simulated observables emerge. In this chapter, we provide an overview of some of the core ideas behind methods that were developed to improve the consistency between experimental information and numerical predictions. We distinguish between situations where experiments are used to refine our understanding and models of specific systems, and situations where experiments are used more generally to refine transferable models. We discuss different philosophies and attempt to unify them in a single framework. Until now, such integration between experiments and simulations have mostly been applied to equilibrium data, and we discuss more recent developments aimed to analyze time-dependent or time-resolved data.
Collapse
Affiliation(s)
- Simone Orioli
- Structural Biology and NMR Laboratory & Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark; Structural Biophysics, Niels Bohr Institute, Faculty of Science, University of Copenhagen, Copenhagen, Denmark
| | - Andreas Haahr Larsen
- Structural Biology and NMR Laboratory & Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark; Structural Biophysics, Niels Bohr Institute, Faculty of Science, University of Copenhagen, Copenhagen, Denmark
| | - Sandro Bottaro
- Structural Biology and NMR Laboratory & Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark; Atomistic Simulations Laboratory, Istituto Italiano di Tecnologia, Genova, Italy
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory & Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
24
|
Grazioli G, Martin RW, Butts CT. Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods. Front Mol Biosci 2019; 6:42. [PMID: 31245383 PMCID: PMC6581705 DOI: 10.3389/fmolb.2019.00042] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2019] [Accepted: 05/20/2019] [Indexed: 01/23/2023] Open
Abstract
Simulations of intrinsically disordered proteins (IDPs) pose numerous challenges to comparative analysis, prominently including highly dynamic conformational states and a lack of well-defined secondary structure. Machine learning (ML) algorithms are especially effective at discriminating among high-dimensional inputs whose differences are extremely subtle, making them well suited to the study of IDPs. In this work, we apply various ML techniques, including support vector machines (SVM) and clustering, as well as related methods such as principal component analysis (PCA) and protein structure network (PSN) analysis, to the problem of uncovering differences between configurational data from molecular dynamics simulations of two variants of the same IDP. We examine molecular dynamics (MD) trajectories of wild-type amyloid beta (Aβ1−40) and its “Arctic” variant (E22G), systems that play a central role in the etiology of Alzheimer's disease. Our analyses demonstrate ways in which ML and related approaches can be used to elucidate subtle differences between these proteins, including transient structure that is poorly captured by conventional metrics.
Collapse
Affiliation(s)
- Gianmarc Grazioli
- California Institute for Telecommunications and Information Technology (Calit2), University of California, Irvine, Irvine, CA, United States.,Department of Chemistry, University of California, Irvine, Irvine, CA, United States
| | - Rachel W Martin
- Department of Chemistry, University of California, Irvine, Irvine, CA, United States.,Department of Molecular Biology and Biochemistry, University of California, Irvine, Irvine, CA, United States
| | - Carter T Butts
- California Institute for Telecommunications and Information Technology (Calit2), University of California, Irvine, Irvine, CA, United States.,Department of Computer Science, University of California, Irvine, Irvine, CA, United States.,Department of Sociology, Statistics, and Electrical Engineering and Computer Science, University of California, Irvine, Irvine, CA, United States
| |
Collapse
|
25
|
Visscher KM, Geerke DP. Deriving Force-Field Parameters from First Principles Using a Polarizable and Higher Order Dispersion Model. J Chem Theory Comput 2019; 15:1875-1883. [PMID: 30763086 PMCID: PMC6581419 DOI: 10.1021/acs.jctc.8b01105] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2018] [Indexed: 11/30/2022]
Abstract
In this work we propose a strategy based on quantum mechanical (QM) calculations to parametrize a polarizable force field for use in molecular dynamics (MD) simulations. We investigate the use of multiple atoms-in-molecules (AIM) strategies to partition QM determined molecular electron densities into atomic subregions. The partitioned atomic densities are subsequently used to compute atomic dispersion coefficients from effective exchange-hole-dipole moment (XDM) calculations. In order to derive values for the repulsive van der Waals parameters from first principles, we use a simple volume relation to scale effective atomic radii. Explicit inclusion of higher order dispersion coefficients was tested for a series of alkanes, and we show that combining C6 and C8 attractive terms together with a C11 repulsive potential yields satisfying models when used in combination with our van der Waals parameters and electrostatic and bonded parameters as directly obtained from quantum calculations as well. This result highlights that explicit inclusion of higher order dispersion terms could be viable in simulation, and it suggests that currently available QM analysis methods allow for first-principles parametrization of molecular mechanics models.
Collapse
Affiliation(s)
- Koen M. Visscher
- AIMMS Division of Molecular Toxicology,
Department of Chemistry and Pharmaceutical Sciences, Faculty of Science, Vrije Universiteit Amsterdam, De Boelelaan 1108, 1081 HZ Amsterdam, the Netherlands
| | - Daan P. Geerke
- AIMMS Division of Molecular Toxicology,
Department of Chemistry and Pharmaceutical Sciences, Faculty of Science, Vrije Universiteit Amsterdam, De Boelelaan 1108, 1081 HZ Amsterdam, the Netherlands
| |
Collapse
|