1
|
Pala D, Clark DE. Caught between a ROCK and a hard place: current challenges in structure-based drug design. Drug Discov Today 2024; 29:104106. [PMID: 39029868 DOI: 10.1016/j.drudis.2024.104106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Revised: 06/27/2024] [Accepted: 07/13/2024] [Indexed: 07/21/2024]
Abstract
The discipline of structure-based drug design (SBDD) is several decades old and it is tempting to think that the proliferation of experimental structures for many drug targets might make computer-aided drug design (CADD) straightforward. However, this is far from true. In this review, we illustrate some of the challenges that CADD scientists face every day in their work, even now. We use Rho-associated protein kinase (ROCK), and public domain structures and data, as an example to illustrate some of the challenges we have experienced during our project targeting this protein. We hope that this will help to prevent unrealistic expectations of what CADD can accomplish and to educate non-CADD scientists regarding the challenges still facing their CADD colleagues.
Collapse
Affiliation(s)
- Daniele Pala
- Medicinal Chemistry and Drug Design Technologies Department, Chiesi Farmaceutici S.p.A, Research Center, Largo Belloli 11/a, 43122 Parma, Italy
| | - David E Clark
- Charles River, 6-9 Spire Green Centre, Flex Meadow, Harlow CM19 5TR, UK.
| |
Collapse
|
2
|
Fraczkiewicz R, Quoc Nguyen H, Wu N, Kausch-Busies N, Grimbs S, Sommer K, Ter Laak A, Günther J, Wagner B, Reutlinger M. Best of both worlds: An expansion of the state of the art pK a model with data from three industrial partners. Mol Inform 2024:e202400088. [PMID: 39031889 DOI: 10.1002/minf.202400088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Revised: 05/11/2024] [Accepted: 05/12/2024] [Indexed: 07/22/2024]
Abstract
In a unique collaboration between Simulations Plus and several industrial partners, we were able to develop a new version 11.0 of the previously published in silico pKa model, S+pKa, with considerably improved prediction accuracy. The model's training set was vastly expanded by large amounts of experimental data obtained from F. Hoffmann-La Roche AG, Genentech Inc., and the Crop Science division of Bayer AG. The previous v7.0 of S+pKa was trained on data from public sources and the Pharmaceutical division of Bayer AG. The model has shown dramatic improvements in predictive accuracy when externally validated on three new contributor compound sets. Less expected was v11.0's improvement in prediction on new compounds developed at Bayer Pharma after v7.0 was released (2013-2023), even without contributing additional data to v11.0. We illustrate chemical space coverage by chemistries encountered in the five domains, public and industrial, outline model construction, and discuss factors contributing to model's success.
Collapse
Affiliation(s)
| | - Huy Quoc Nguyen
- Genentech Inc., Discovery Chemistry, 1 DNA Way, South San Francisco, CA 94080, USA
| | - Newton Wu
- Genentech Inc., Discovery Chemistry, 1 DNA Way, South San Francisco, CA 94080, USA
| | | | - Sergio Grimbs
- Bayer AG, Research & Development, Crop Science, 40789, Monheim, Germany
| | - Kai Sommer
- Bayer AG, Research & Development, Crop Science, 40789, Monheim, Germany
| | - Antonius Ter Laak
- Bayer AG, Research & Development, Pharmaceuticals, 13353, Berlin, Germany
| | - Judith Günther
- Bayer AG, Research & Development, Pharmaceuticals, 13353, Berlin, Germany
| | - Björn Wagner
- F. Hoffmann-La Roche AG, Roche Pharma Research and Early Development, 4070, Basel, Switzerland
| | - Michael Reutlinger
- F. Hoffmann-La Roche AG, Roche Pharma Research and Early Development, 4070, Basel, Switzerland
| |
Collapse
|
3
|
Abarbanel OD, Hutchison GR. QupKake: Integrating Machine Learning and Quantum Chemistry for Micro-p Ka Predictions. J Chem Theory Comput 2024. [PMID: 38832803 DOI: 10.1021/acs.jctc.4c00328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/05/2024]
Abstract
Accurate prediction of micro-pKa values is crucial for understanding and modulating the acidity and basicity of organic molecules, with applications in drug discovery, materials science, and environmental chemistry. This work introduces QupKake, a novel method that combines graph neural network models with semiempirical quantum mechanical (QM) features to achieve exceptional accuracy and generalization in micro-pKa prediction. QupKake outperforms state-of-the-art models on a variety of benchmark data sets, with root-mean-square errors between 0.5 and 0.8 pKa units on five external test sets. Feature importance analysis reveals the crucial role of QM features in both the reaction site enumeration and micro-pKa prediction models. QupKake represents a significant advancement in micro-pKa prediction, offering a powerful tool for various applications in chemistry and beyond.
Collapse
Affiliation(s)
- Omri D Abarbanel
- Department of Chemistry, University of Pittsburgh, 219 Parkman Avenue, Pittsburgh, Pennsylvania 15260, United States
| | - Geoffrey R Hutchison
- Department of Chemistry, University of Pittsburgh, 219 Parkman Avenue, Pittsburgh, Pennsylvania 15260, United States
- Department of Chemical and Petroleum Engineering, University of Pittsburgh, 3700 O'Hara Street, Pittsburgh, Pennsylvania 15261, United States
| |
Collapse
|
4
|
Champion C, Hünenberger PH, Riniker S. Multistate Method to Efficiently Account for Tautomerism and Protonation in Alchemical Free-Energy Calculations. J Chem Theory Comput 2024; 20:4350-4362. [PMID: 38742760 PMCID: PMC11137823 DOI: 10.1021/acs.jctc.4c00370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 04/25/2024] [Accepted: 04/29/2024] [Indexed: 05/16/2024]
Abstract
The majority of drug-like molecules contain at least one ionizable group, and many common drug scaffolds are subject to tautomeric equilibria. Thus, these compounds are found in a mixture of protonation and/or tautomeric states at physiological pH. Intrinsically, standard classical molecular dynamics (MD) simulations cannot describe such equilibria between states, which negatively impacts the prediction of key molecular properties in silico. Following the formalism described by de Oliveira and co-workers (J. Chem. Theory Comput. 2019, 15, 424-435) to consider the influence of all states on the binding process based on alchemical free-energy calculations, we demonstrate in this work that the multistate method replica-exchange enveloping distribution sampling (RE-EDS) is well suited to describe molecules with multiple protonation and/or tautomeric states in a single simulation. We apply our methodology to a series of eight inhibitors of factor Xa with two protonation states and a series of eight inhibitors of glycogen synthase kinase 3β (GSK3β) with two tautomeric states. In particular, we show that given a sufficient phase-space overlap between the states, RE-EDS is computationally more efficient than standard pairwise free-energy methods.
Collapse
Affiliation(s)
- Candide Champion
- Department of Chemistry and Applied
Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Philippe H. Hünenberger
- Department of Chemistry and Applied
Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Sereina Riniker
- Department of Chemistry and Applied
Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| |
Collapse
|
5
|
Tran TTV, Tayara H, Chong KT. Recent Studies of Artificial Intelligence on In Silico Drug Absorption. J Chem Inf Model 2023; 63:6198-6211. [PMID: 37819031 DOI: 10.1021/acs.jcim.3c00960] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/13/2023]
Abstract
Absorption is an important area of research in pharmacochemistry and drug development, because the drug has to be absorbed before any drug effects can occur. Furthermore, the ADMET (Absorption, Distribution, Metabolism, Excretion, and Toxicity) profile of drugs can be directly and considerably altered by modulating factors affecting absorption. Many drugs in development fail because of poor absorption. The research and continuous efforts of researchers in recent years have brought many successes and promises in drug absorption property prediction, especially in silico, which helps to reduce the time and cost significantly for screening undesirable drug candidates. In this report, we explicitly provide an overview of recent in silico studies on predicting absorption properties, especially from 2019 to the present, using artificial intelligence. Additionally, we have collected and investigated public databases that support absorption prediction research. On those grounds, we also proposed the challenges and development directions of absorption prediction in the future. We hope this review can provide researchers with valuable guidelines on absorption prediction to facilitate the development of newer approaches in drug discovery.
Collapse
Affiliation(s)
- Thi Tuyet Van Tran
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea
- Faculty of Information Technology, An Giang University, Long Xuyen 880000, Vietnam
- Vietnam National University, Ho Chi Minh City, Ho Chi Minh 700000, Vietnam
| | - Hilal Tayara
- School of International Engineering and Science, Jeonbuk National University, Jeonju 54896, Republic of Korea
| | - Kil To Chong
- Advances Electronics and Information Research Center, Jeonbuk National University, Jeonju 54896, Republic of Korea
| |
Collapse
|
6
|
Fischer TL, Bödecker M, Schweer SM, Dupont J, Lepère V, Zehnacker-Rentien A, Suhm MA, Schröder B, Henkes T, Andrada DM, Balabin RM, Singh HK, Bhattacharyya HP, Sarma M, Käser S, Töpfer K, Vazquez-Salazar LI, Boittier ED, Meuwly M, Mandelli G, Lanzi C, Conte R, Ceotto M, Dietrich F, Cisternas V, Gnanasekaran R, Hippler M, Jarraya M, Hochlaf M, Viswanathan N, Nevolianis T, Rath G, Kopp WA, Leonhard K, Mata RA. The first HyDRA challenge for computational vibrational spectroscopy. Phys Chem Chem Phys 2023; 25:22089-22102. [PMID: 37610422 DOI: 10.1039/d3cp01216f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]
Abstract
Vibrational spectroscopy in supersonic jet expansions is a powerful tool to assess molecular aggregates in close to ideal conditions for the benchmarking of quantum chemical approaches. The low temperatures achieved as well as the absence of environment effects allow for a direct comparison between computed and experimental spectra. This provides potential benchmarking data which can be revisited to hone different computational techniques, and it allows for the critical analysis of procedures under the setting of a blind challenge. In the latter case, the final result is unknown to modellers, providing an unbiased testing opportunity for quantum chemical models. In this work, we present the spectroscopic and computational results for the first HyDRA blind challenge. The latter deals with the prediction of water donor stretching vibrations in monohydrates of organic molecules. This edition features a test set of 10 systems. Experimental water donor OH vibrational wavenumbers for the vacuum-isolated monohydrates of formaldehyde, tetrahydrofuran, pyridine, tetrahydrothiophene, trifluoroethanol, methyl lactate, dimethylimidazolidinone, cyclooctanone, trifluoroacetophenone and 1-phenylcyclohexane-cis-1,2-diol are provided. The results of the challenge show promising predictive properties in both purely quantum mechanical approaches as well as regression and other machine learning strategies.
Collapse
Affiliation(s)
- Taija L Fischer
- Institut für Physikalische Chemie, Universität Göttingen, Tammannstraße 6, Göttingen, Germany.
| | - Margarethe Bödecker
- Institut für Physikalische Chemie, Universität Göttingen, Tammannstraße 6, Göttingen, Germany.
| | - Sophie M Schweer
- Institut für Physikalische Chemie, Universität Göttingen, Tammannstraße 6, Göttingen, Germany.
| | - Jennifer Dupont
- Institut des Sciences Moléculaires dOrsay, Université Paris-Saclay, CNRS, 91405 Orsay, France
| | - Valéria Lepère
- Institut des Sciences Moléculaires dOrsay, Université Paris-Saclay, CNRS, 91405 Orsay, France
| | - Anne Zehnacker-Rentien
- Institut des Sciences Moléculaires dOrsay, Université Paris-Saclay, CNRS, 91405 Orsay, France
| | - Martin A Suhm
- Institut für Physikalische Chemie, Universität Göttingen, Tammannstraße 6, Göttingen, Germany.
| | - Benjamin Schröder
- Institut für Physikalische Chemie, Universität Göttingen, Tammannstraße 6, Göttingen, Germany.
| | - Tobias Henkes
- Department of Physics and Materials Science, University of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Diego M Andrada
- Institute for Inorganic Chemistry, Saarland University, 66123 Saarbrücken, Germany
| | - Roman M Balabin
- Bond Street Holdings, Long Point Road, KN-1002 Henville Building 9, Charlestown, KN10 Nevis, St. Kitts and Nevis
| | - Haobam Kisan Singh
- Department of Chemistry, Indian Institute of Technology Guwahati, Assam-781039, India
| | | | - Manabendra Sarma
- Department of Chemistry, Indian Institute of Technology Guwahati, Assam-781039, India
| | - Silvan Käser
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, 4056 Basel, Switzerland
| | - Kai Töpfer
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, 4056 Basel, Switzerland
| | - Luis I Vazquez-Salazar
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, 4056 Basel, Switzerland
| | - Eric D Boittier
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, 4056 Basel, Switzerland
| | - Markus Meuwly
- Department of Chemistry, University of Basel, Klingelbergstrasse 80, 4056 Basel, Switzerland
| | - Giacomo Mandelli
- Dipartimento di Chimica, Università degli Studi di Milano, via C. Golgi 19, 20133 Milano, Italy
| | - Cecilia Lanzi
- Dipartimento di Chimica, Università degli Studi di Milano, via C. Golgi 19, 20133 Milano, Italy
| | - Riccardo Conte
- Dipartimento di Chimica, Università degli Studi di Milano, via C. Golgi 19, 20133 Milano, Italy
| | - Michele Ceotto
- Dipartimento di Chimica, Università degli Studi di Milano, via C. Golgi 19, 20133 Milano, Italy
| | - Fabian Dietrich
- Department of Physics Science, Universidad de La Frontera, Francisco Salazar 01145, Temuco, Chile
| | - Vicente Cisternas
- Department of Physics Science, Universidad de La Frontera, Francisco Salazar 01145, Temuco, Chile
| | - Ramachandran Gnanasekaran
- Vellore Institute of Technology, School of Advanced Sciences (SAS), ChemistryDivision, Chennai 600 027, India
| | - Michael Hippler
- Department of Chemistry, University of Sheffield, Sheffield S3 7HF, UK
| | - Mahmoud Jarraya
- U. Gustave Eiffel, COSYS/IMSE, 5 BD Descartes 77454, Champs-sur-Marne, France
| | - Majdi Hochlaf
- U. Gustave Eiffel, COSYS/IMSE, 5 BD Descartes 77454, Champs-sur-Marne, France
| | - Narasimhan Viswanathan
- Institute of Technical Thermodynamics, RWTH Aachen University, Schinkelstraße 8, D-52072 Aachen, Germany
| | - Thomas Nevolianis
- Institute of Technical Thermodynamics, RWTH Aachen University, Schinkelstraße 8, D-52072 Aachen, Germany
| | - Gabriel Rath
- Institute of Technical Thermodynamics, RWTH Aachen University, Schinkelstraße 8, D-52072 Aachen, Germany
| | - Wassja A Kopp
- Institute of Technical Thermodynamics, RWTH Aachen University, Schinkelstraße 8, D-52072 Aachen, Germany
| | - Kai Leonhard
- Institute of Technical Thermodynamics, RWTH Aachen University, Schinkelstraße 8, D-52072 Aachen, Germany
| | - Ricardo A Mata
- Institut für Physikalische Chemie, Universität Göttingen, Tammannstraße 6, Göttingen, Germany.
| |
Collapse
|
7
|
Fraczkiewicz R, Waldman M. p K50─A Rigorous Indicator of Individual Functional Group Acidity/Basicity in Multiprotic Compounds. J Chem Inf Model 2023; 63:3198-3208. [PMID: 37104727 DOI: 10.1021/acs.jcim.3c00187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
In this work, we show that the apparent pKa measured by standard titration experiments is an insufficient measure of acidity or basicity of organic functional groups in multiprotic compounds─a frequent aspect of lead optimization in pharmaceutical research. We show that the use of the apparent pKa in this context may result in costly mistakes. To properly represent the group's true acidity/basicity, we propose pK50─a single-proton midpoint measure derived from a statistical thermodynamics treatment of multiprotic ionization. We show that pK50, which may be directly measured in specialized NMR titration experiments, is superior in tracking the functional group's acidity/basicity across congeneric series of related compounds and converges to the well familiar ionization constant in the monoprotic case.
Collapse
Affiliation(s)
- Robert Fraczkiewicz
- Simulations Plus, Inc., 42505 10th Street West, Lancaster, California 93534, United States
| | - Marvin Waldman
- Simulations Plus, Inc., 42505 10th Street West, Lancaster, California 93534, United States
| |
Collapse
|
8
|
Johnston RC, Yao K, Kaplan Z, Chelliah M, Leswing K, Seekins S, Watts S, Calkins D, Chief Elk J, Jerome SV, Repasky MP, Shelley JC. Epik: p Ka and Protonation State Prediction through Machine Learning. J Chem Theory Comput 2023; 19:2380-2388. [PMID: 37023332 DOI: 10.1021/acs.jctc.3c00044] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/08/2023]
Abstract
Epik version 7 is a software program that uses machine learning for predicting the pKa values and protonation state distribution of complex, druglike molecules. Using an ensemble of atomic graph convolutional neural networks (GCNNs) trained on over 42,000 pKa values across broad chemical space from both experimental and computed origins, the model predicts pKa values with 0.42 and 0.72 pKa unit median absolute and root mean square errors, respectively, across seven test sets. Epik version 7 also generates protonation states and recovers 95% of the most populated protonation states compared to previous versions. Requiring on average only 47 ms per ligand, Epik version 7 is rapid and accurate enough to evaluate protonation states for crucial molecules and prepare ultra-large libraries of compounds to explore vast regions of chemical space. The simplicity and time required for the training allow for the generation of highly accurate models customized to a program's specific chemistry.
Collapse
Affiliation(s)
- Ryne C Johnston
- Schrödinger, Inc., 101 SW Main Street, Suite 1300, Portland, Oregon 97204, United States
| | - Kun Yao
- Schrödinger, Inc., 1540 Broadway Street, 24th Floor, New York, New York 10036, United States
| | - Zachary Kaplan
- Schrödinger, Inc., 1540 Broadway Street, 24th Floor, New York, New York 10036, United States
| | - Monica Chelliah
- Schrödinger, Inc., 1540 Broadway Street, 24th Floor, New York, New York 10036, United States
| | - Karl Leswing
- Schrödinger, Inc., 1540 Broadway Street, 24th Floor, New York, New York 10036, United States
| | - Sean Seekins
- Schrödinger, Inc., 101 SW Main Street, Suite 1300, Portland, Oregon 97204, United States
| | - Shawn Watts
- Schrödinger, Inc., 101 SW Main Street, Suite 1300, Portland, Oregon 97204, United States
| | - David Calkins
- Schrödinger, Inc., 101 SW Main Street, Suite 1300, Portland, Oregon 97204, United States
| | - Jackson Chief Elk
- Schrödinger, Inc., 101 SW Main Street, Suite 1300, Portland, Oregon 97204, United States
| | - Steven V Jerome
- Schrödinger, Inc., 9171 Towne Centre Drive, San Diego, California 92122, United States
| | - Matthew P Repasky
- Schrödinger, Inc., 101 SW Main Street, Suite 1300, Portland, Oregon 97204, United States
| | - John C Shelley
- Schrödinger, Inc., 101 SW Main Street, Suite 1300, Portland, Oregon 97204, United States
| |
Collapse
|
9
|
Sutikdja LW, Nguyen HVL, Jelisavac D, Stahl W, Mouhib H. Benchmarking quantum chemical methods for accurate gas-phase structure predictions of carbonyl compounds: the case of ethyl butyrate. Phys Chem Chem Phys 2023; 25:7688-7696. [PMID: 36857713 PMCID: PMC10015624 DOI: 10.1039/d2cp05774c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/11/2023]
Abstract
High-resolution spectroscopy techniques play a pivotal role to validate and efficiently benchmark available methods from quantum chemistry. In this work, we analyzed the microwave spectrum of ethyl butyrate within the scope of a systematic investigation to benchmark state-of-the-art exchange-correlation functionals and ab initio methods, to accurately predict the lowest energy conformers of carbonyl compounds in their isolated state. Under experimental conditions, we observed two distinct conformers, one of Cs and one of C1 symmetry. As reported earlier in the cases of some ethyl and methyl alkynoates, structural optimizations of the most abundant conformer that exhibits a C1 symmetry proved extremely challenging for several quantum chemical levels. To probe the sensitivity of different methods and basis sets, we use the identified soft-degree of freedom in proximity to the carbonyl group as an order parameter. The results of our study provide useful insight for spectroscopists to select an adapted method for structure prediction of carbonyl compounds based on their available computational resources, suggesting a reasonable trade-off between accuracy and CPU cost. At the same time, our observations and the resulting sets of highly accurate experimental constants from high-resolution spectroscopy experiments give an appeal to theoretical groups to look further into this seemingly simple family of chemical compounds, which may prove useful for the further development and parametrization of theoretical methods in computational chemistry.
Collapse
Affiliation(s)
- Lilian W Sutikdja
- Institute of Physical Chemistry, RWTH Aachen University, Landoltweg 2, D-52074, Aachen, Germany
| | - Ha Vinh Lam Nguyen
- Univ Paris Est Creteil and Université Paris Cité, CNRS, LISA, F-94010, Créteil, France. .,Institut Universitaire de France (IUF), F-75231, Paris cedex 05, France
| | - Dragan Jelisavac
- Institute of Physical Chemistry, RWTH Aachen University, Landoltweg 2, D-52074, Aachen, Germany
| | - Wolfgang Stahl
- Institute of Physical Chemistry, RWTH Aachen University, Landoltweg 2, D-52074, Aachen, Germany
| | - Halima Mouhib
- Department of Computer Science, VU Bioinformatics, Vrije Universiteit Amsterdam, De Boelelaan 1111, 1081 HV, Amsterdam, The Netherlands.
| |
Collapse
|
10
|
Wu J, Wang J, Wu Z, Zhang S, Deng Y, Kang Y, Cao D, Hsieh CY, Hou T. ALipSol: An Attention-Driven Mixture-of-Experts Model for Lipophilicity and Solubility Prediction. J Chem Inf Model 2022; 62:5975-5987. [PMID: 36417544 DOI: 10.1021/acs.jcim.2c01290] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Lipophilicity (logD) and aqueous solubility (logSw) play a central role in drug development. The accurate prediction of these properties remains to be solved due to data scarcity. Current methodologies neglect the intrinsic relationships between physicochemical properties and usually ignore the ionization effects. Here, we propose an attention-driven mixture-of-experts (MoE) model named ALipSol, which explicitly reproduces the hierarchy of task relationships. We adopt the principle of divide-and-conquer by breaking down the complex end point (logD or logSw) into simpler ones (acidic pKa, basic pKa, and logP) and allocating a specific expert network for each subproblem. Subsequently, we implement transfer learning to extract knowledge from related tasks, thus alleviating the dilemma of limited data. Additionally, we substitute the gating network with an attention mechanism to better capture the dynamic task relationships on a per-example basis. We adopt local fine-tuning and consensus prediction to further boost model performance. Extensive evaluation experiments verify the success of the ALipSol model, which achieves RMSE improvement of 8.04%, 2.49%, 8.57%, 12.8%, and 8.60% on the Lipop, ESOL, AqSolDB, external logD, and external logS data sets, respectively, compared with Attentive FP and the state-of-the-art in silico tools. In particular, our model yields more significant advantages (Welch's t-test) for small training data, implying its high robustness and generalizability. The interpretability analysis proves that the atom contributions learned by ALipSol are more reasonable compared with the vanilla Attentive FP, and the substitution effects in benzene derivatives agreed well with empirical constants, revealing the potential of our model to extract useful patterns from data and provide guidance for lead optimization.
Collapse
Affiliation(s)
- Jialu Wu
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058Zhejiang, P. R. China.,CarbonSilicon AI Technology Co., Ltd, Hangzhou, 310018Zhejiang, P. R. China
| | - Junmei Wang
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy, University of Pittsburgh, Pittsburgh, Pennsylvania15261, United States
| | - Zhenxing Wu
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058Zhejiang, P. R. China.,CarbonSilicon AI Technology Co., Ltd, Hangzhou, 310018Zhejiang, P. R. China
| | - Shengyu Zhang
- Tencent Quantum Laboratory, Tencent, Shenzhen, 518057Guangdong, P. R. China
| | - Yafeng Deng
- CarbonSilicon AI Technology Co., Ltd, Hangzhou, 310018Zhejiang, P. R. China
| | - Yu Kang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058Zhejiang, P. R. China
| | - Dongsheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha, 410004Hunan, P. R. China
| | - Chang-Yu Hsieh
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058Zhejiang, P. R. China
| | - Tingjun Hou
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058Zhejiang, P. R. China
| |
Collapse
|
11
|
Wu J, Wan Y, Wu Z, Zhang S, Cao D, Hsieh CY, Hou T. MF-SuP-pKa: Multi-fidelity modeling with subgraph pooling mechanism for pKa prediction. Acta Pharm Sin B 2022. [DOI: 10.1016/j.apsb.2022.11.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
12
|
Wu J, Kang Y, Pan P, Hou T. Machine learning methods for pK a prediction of small molecules: Advances and challenges. Drug Discov Today 2022; 27:103372. [PMID: 36167281 DOI: 10.1016/j.drudis.2022.103372] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 08/15/2022] [Accepted: 09/21/2022] [Indexed: 11/27/2022]
Abstract
The acid-base dissociation constant (pKa) is a fundamental property influencing many ADMET properties of small molecules. However, rapid and accurate pKa prediction remains a great challenge. In this review, we outline the current advances in machine-learning-based QSAR models for pKa prediction, including descriptor-based and graph-based approaches, and summarize their pros and cons. Moreover, we highlight the current challenges and future directions regarding experimental data, crucial factors influencing pKa and in silico prediction tools. We hope that this review can provide a practical guidance for the follow-up studies.
Collapse
Affiliation(s)
- Jialu Wu
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Yu Kang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Peichen Pan
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang, China.
| | - Tingjun Hou
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang, China.
| |
Collapse
|
13
|
Rodriguez SA, Tran JV, Sabatino SJ, Paluch AS. Predicting octanol/water partition coefficients and pKa for the SAMPL7 challenge using the SM12, SM8 and SMD solvation models. J Comput Aided Mol Des 2022; 36:687-705. [PMID: 36117236 DOI: 10.1007/s10822-022-00474-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Accepted: 08/29/2022] [Indexed: 11/29/2022]
Abstract
Blind predictions of octanol/water partition coefficients and pKa at 298.15 K for 22 drug-like compounds were made for the SAMPL7 challenge. Octanol/water partition coefficients were predicted from solvation free energies computed using electronic structure calculations with the SM12, SM8 and SMD solvation models. Within these calculations we compared the use of gas- and solution-phase optimized geometries of the solute. Based on these calculations we found that in general the use of solution phase-optimized geometries increases the affinity of the solutes for water as compared to octanol, with the use of gas-phase optimized geometries resulting in the better agreement with experiment. The pKa is computed using the direct approach, scaled solvent-accessible surface model, and the inclusion of an explicit water molecule, where the latter two methods have previously been shown to offer improved predictions as compared to the direct approach. We find that the use of an explicit water molecule provides superior predictions, and that the predicted macroscopic pKa is sensitive to the employed microstates.
Collapse
Affiliation(s)
- Sergio A Rodriguez
- Instituto de Ciencias Químicas, Facultad de Agronomía y Agroindustrias, Universidad Nacional de Santiago del Estero, CONICET, Santiago del Estero, Argentina
| | - Jasmine Vy Tran
- Department of Chemical, Paper, and Biomedical Engineering, Miami University, Oxford, OH, 45056, USA
| | - Spencer J Sabatino
- Department of Chemical, Paper, and Biomedical Engineering, Miami University, Oxford, OH, 45056, USA
| | - Andrew S Paluch
- Department of Chemical, Paper, and Biomedical Engineering, Miami University, Oxford, OH, 45056, USA.
| |
Collapse
|
14
|
Mayr F, Wieder M, Wieder O, Langer T. Improving Small Molecule pKa Prediction Using Transfer Learning With Graph Neural Networks. Front Chem 2022; 10:866585. [PMID: 35721000 PMCID: PMC9204323 DOI: 10.3389/fchem.2022.866585] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 04/04/2022] [Indexed: 11/13/2022] Open
Abstract
Enumerating protonation states and calculating microstate pKa values of small molecules is an important yet challenging task for lead optimization and molecular modeling. Commercial and non-commercial solutions have notable limitations such as restrictive and expensive licenses, high CPU/GPU hour requirements, or the need for expert knowledge to set up and use. We present a graph neural network model that is trained on 714,906 calculated microstate pKa predictions from molecules obtained from the ChEMBL database. The model is fine-tuned on a set of 5,994 experimental pKa values significantly improving its performance on two challenging test sets. Combining the graph neural network model with Dimorphite-DL, an open-source program for enumerating ionization states, we have developed the open-source Python package pkasolver, which is able to generate and enumerate protonation states and calculate pKa values with high accuracy.
Collapse
|
15
|
Zhang Y, Zhao Z, Wang K, Lyu K, Yao C, Li L, Shen X, Liu T, Guo X, Li H, Wang W, Lai TT. Molecular docking assisted exploration on solubilization of poorly soluble drug remdesivir in sulfobutyl ether-tycyclodextrin. AAPS OPEN 2022; 8:9. [PMID: 35498163 PMCID: PMC9035334 DOI: 10.1186/s41120-022-00054-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2021] [Accepted: 03/02/2022] [Indexed: 11/23/2022] Open
Abstract
Objective To study structure-specific solubilization effect of Sulfobutyl ether-β-cyclodextrin (SBE-β-CD) on Remdesivir (RDV) and to understand the experimental clathration with the aid of quantum mechanics (QM), molecular docking and molecular dynamics (MD) calculations. Methods The experiment was carried out by phase solubility method at various pH and temperatures, while the concentration of Remdesivir in the solution was determined by HPLC. The complexation mechanism and the pH dependence of drug loading were investigated following a novel procedure combining QM, MD and molecular docking, based on accurate pKa predictions. Results The phase solubility and solubilization effect of RDV in SBE-β-CD were explored kinetically and thermodynamically for each assessed condition. An optimal drug / SBE-β-CD feeding molar ratio was determined stoichiometrically for RDV solubility in pH1.7 solution. The supersaturated solubility was examined over time after pH of the solution was adjusted from 1.7 to 3.5. A possible hypothesis was raised to elucidate the experimentally observed stabilization of supersaturation based on the proposed RDV Cation A /SBE-β-CD pocket conformations. Conclusion The computational explorations conformed to the experimentally determined phase solubilization and well elucidated the mechanism of macroscopic clathration between RDV and SBE-β-CD from the perspective of microscopic molecular calculations. Graphical Abstract ![]()
Collapse
|
16
|
Grosjean H, Işık M, Aimon A, Mobley D, Chodera J, von Delft F, Biggin PC. SAMPL7 protein-ligand challenge: A community-wide evaluation of computational methods against fragment screening and pose-prediction. J Comput Aided Mol Des 2022; 36:291-311. [PMID: 35426591 PMCID: PMC9010448 DOI: 10.1007/s10822-022-00452-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Accepted: 03/22/2022] [Indexed: 11/01/2022]
Abstract
A novel crystallographic fragment screening data set was generated and used in the SAMPL7 challenge for protein-ligands. The SAMPL challenges prospectively assess the predictive power of methods involved in computer-aided drug design. Application of various methods to fragment molecules are now widely used in the search for new drugs. However, there is little in the way of systematic validation specifically for fragment-based approaches. We have performed a large crystallographic high-throughput fragment screen against the therapeutically relevant second bromodomain of the Pleckstrin-homology domain interacting protein (PHIP2) that revealed 52 different fragments bound across 4 distinct sites, 47 of which were bound to the pharmacologically relevant acetylated lysine (Kac) binding site. These data were used to assess computational screening, binding pose prediction and follow-up enumeration. All submissions performed randomly for screening. Pose prediction success rates (defined as less than 2 Å root mean squared deviation against heavy atom crystal positions) ranged between 0 and 25% and only a very few follow-up compounds were deemed viable candidates from a medicinal-chemistry perspective based on a common molecular descriptors analysis. The tight deadlines imposed during the challenge led to a small number of submissions suggesting that the accuracy of rapidly responsive workflows remains limited. In addition, the application of these methods to reproduce crystallographic fragment data still appears to be very challenging. The results show that there is room for improvement in the development of computational tools particularly when applied to fragment-based drug design.
Collapse
Affiliation(s)
- Harold Grosjean
- Structural Bioinformatics and Computational Biochemistry, Department of Biochemistry, South Parks Road, OX1 3QU, Oxford, UK
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, OX11 0QX, Didcot, UK
| | - Mehtap Işık
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, 10065, New York, NY, USA
| | - Anthony Aimon
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, OX11 0QX, Didcot, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, OX11 0FA, Didcot, UK
| | - David Mobley
- Department of Pharmaceutical Sciences, Department of Chemistry, University of California, 92617, Irvine, California, USA
| | - John Chodera
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, 10065, New York, NY, USA
| | - Frank von Delft
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, OX11 0QX, Didcot, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, OX11 0FA, Didcot, UK
- Centre for Medicines Discovery, University of Oxford, Old Road Campus, Roosevelt Drive, OX3 7DQ, Headington, UK
- Structural Genomics Consortium, University of Oxford, Old Road Campus, Roosevelt Drive, OX3 7DQ, Headington, UK
| | - Philip C Biggin
- Structural Bioinformatics and Computational Biochemistry, Department of Biochemistry, South Parks Road, OX1 3QU, Oxford, UK.
| |
Collapse
|
17
|
Recent Developments of Computational Methods for pKa Prediction Based on Electronic Structure Theory with Solvation Models. J 2021. [DOI: 10.3390/j4040058] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
The protonation/deprotonation reaction is one of the most fundamental processes in solutions and biological systems. Compounds with dissociative functional groups change their charge states by protonation/deprotonation. This change not only significantly alters the physical properties of a compound itself, but also has a profound effect on the surrounding molecules. In this paper, we review our recent developments of the methods for predicting the Ka, the equilibrium constant for protonation reactions or acid dissociation reactions. The pKa, which is a logarithm of Ka, is proportional to the reaction Gibbs energy of the protonation reaction, and the reaction free energy can be determined by electronic structure calculations with solvation models. The charge of the compound changes before and after protonation; therefore, the solvent effect plays an important role in determining the reaction Gibbs energy. Here, we review two solvation models: the continuum model, and the integral equation theory of molecular liquids. Furthermore, the reaction Gibbs energy calculations for the protonation reactions require special attention to the handling of dissociated protons. An efficient method for handling the free energy of dissociated protons will also be reviewed.
Collapse
|
18
|
Navo CD, Jiménez-Osés G. Computer Prediction of p K a Values in Small Molecules and Proteins. ACS Med Chem Lett 2021; 12:1624-1628. [PMID: 34795846 DOI: 10.1021/acsmedchemlett.1c00435] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Accurately determining the acid dissociation constants (K a or their logarithmic form, pK a) of small molecules and large biomolecules has proven to be pivotal for the study different biological processes and developing new drugs. This Viewpoint summarizes some of the most common methodologies and recent advances described for pK a prediction using computational techniques when experimental values are not easily accessible such as in proteins and/or for the screening of large libraries of new compounds.
Collapse
Affiliation(s)
- Claudio D. Navo
- CIC bioGUNE, Basque Research and Technology Alliance (BRTA), Bizkaia Technology Park, Building 800, Derio 48160, Spain
| | - Gonzalo Jiménez-Osés
- CIC bioGUNE, Basque Research and Technology Alliance (BRTA), Bizkaia Technology Park, Building 800, Derio 48160, Spain
- Ikerbasque, Basque Foundation for Science, 48013 Bilbao, Spain
| |
Collapse
|
19
|
SAMPL7 blind challenge: quantum-mechanical prediction of partition coefficients and acid dissociation constants for small drug-like molecules. J Comput Aided Mol Des 2021; 35:841-851. [PMID: 34164769 DOI: 10.1007/s10822-021-00402-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 06/17/2021] [Indexed: 02/02/2023]
Abstract
The physicochemical properties of a drug molecule determine the therapeutic effectiveness of the drug. Thus, the development of fast and accurate theoretical approaches for the prediction of such properties is inevitable. The participation to the SAMPL7 challenge is based on the estimation of logP coefficients and pKa values of small drug-like sulfonamide derivatives. Thereby, quantum mechanical calculations were carried out in order to calculate the free energy of solvation and the transfer energy of 22 drug-like compounds in different environments (water and n-octanol) by employing the SMD solvation model. For logP calculations, we studied eleven different methodologies to calculate the transfer free energies, the lowest RMSE value was obtained for the M06L/def2-TZVP//M06L/def2-SVP level of theory. On the other hand, we employed an isodesmic reaction scheme within the macro pKa framework; this was based on selecting reference molecules similar to the SAMPL7 challenge molecules. Consequently, highly well correlated pKa values were obtained with the M062X/6-311+G(2df,2p)//M052X/6-31+G(d,p) level of theory.
Collapse
|
20
|
Bergazin TD, Tielker N, Zhang Y, Mao J, Gunner MR, Francisco K, Ballatore C, Kast SM, Mobley DL. Evaluation of log P, pK a, and log D predictions from the SAMPL7 blind challenge. J Comput Aided Mol Des 2021; 35:771-802. [PMID: 34169394 PMCID: PMC8224998 DOI: 10.1007/s10822-021-00397-3] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Accepted: 06/05/2021] [Indexed: 12/16/2022]
Abstract
The Statistical Assessment of Modeling of Proteins and Ligands (SAMPL) challenges focuses the computational modeling community on areas in need of improvement for rational drug design. The SAMPL7 physical property challenge dealt with prediction of octanol-water partition coefficients and pKa for 22 compounds. The dataset was composed of a series of N-acylsulfonamides and related bioisosteres. 17 research groups participated in the log P challenge, submitting 33 blind submissions total. For the pKa challenge, 7 different groups participated, submitting 9 blind submissions in total. Overall, the accuracy of octanol-water log P predictions in the SAMPL7 challenge was lower than octanol-water log P predictions in SAMPL6, likely due to a more diverse dataset. Compared to the SAMPL6 pKa challenge, accuracy remains unchanged in SAMPL7. Interestingly, here, though macroscopic pKa values were often predicted with reasonable accuracy, there was dramatically more disagreement among participants as to which microscopic transitions produced these values (with methods often disagreeing even as to the sign of the free energy change associated with certain transitions), indicating far more work needs to be done on pKa prediction methods.
Collapse
Affiliation(s)
| | - Nicolas Tielker
- Physikalische Chemie III, Technische Universität Dortmund, Otto-Hahn-Str. 4a, 44227, Dortmund, Germany
| | - Yingying Zhang
- Department of Physics, The Graduate Center, City University of New York, New York, 10016, USA
| | - Junjun Mao
- Department of Physics, City College of New York, New York, 10031, USA
| | - M R Gunner
- Department of Physics, The Graduate Center, City University of New York, New York, 10016, USA.,Department of Physics, City College of New York, New York, 10031, USA
| | - Karol Francisco
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, Ja Jolla, CA, 92093-0756, USA
| | - Carlo Ballatore
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, Ja Jolla, CA, 92093-0756, USA
| | - Stefan M Kast
- Physikalische Chemie III, Technische Universität Dortmund, Otto-Hahn-Str. 4a, 44227, Dortmund, Germany
| | - David L Mobley
- Department of Pharmaceutical Sciences, University of California, Irvine, Irvine, CA, 92697, USA. .,Department of Chemistry, University of California, Irvine, Irvine, CA, 92697, USA.
| |
Collapse
|
21
|
Pracht P, Grimme S. Efficient Quantum-Chemical Calculations of Acid Dissociation Constants from Free-Energy Relationships. J Phys Chem A 2021; 125:5681-5692. [PMID: 34142841 DOI: 10.1021/acs.jpca.1c03463] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The calculation of acid dissociation constants (pKa) is an important task in computational chemistry and chemoinformatics. Theoretically and with minimal empiricism, this is possible from computed acid dissociation free energies via so-called linear free-energy relationships. In this study some modifications are introduced to the latter, providing a straightforward, broadly applicable protocol with an adjustable degree of sophistication for quantum chemistry-based calculations of pKa in water. It targets a wide pKa range (∼70 units) and medium-sized, flexible molecules. Herein, a focus is set on the recently published r2SCAN-3c and related efficient composite density functionals and the semiempirical GFN2-xTB method, including a newly introduced energy correction for heterolytic dissociation, both in combination with implicit solvation models. The performance is evaluated in comparison with experimental data, showing mean errors often smaller than a targeted 1 pKa unit accuracy. Larger deviations are observed only upon inclusion of challenging highly negative (<-5) or positive (>15) pKa values. Among all those tested, it is found that B97-3c is the best performing functional, although rather independently of the density functional theory (DFT) method used; low root-mean-square errors of 0.8-1.0 pKa units for typical drugs are obtained. For optimal performance, it is recommended to employ DFT functional specific free-energy relationship parameters. Additionally, a significant conformational dependence of the pKa values is revealed and quantified for some nonrigid drug molecules.
Collapse
Affiliation(s)
- Philipp Pracht
- Mulliken Center for Theoretical Chemistry, Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| | - Stefan Grimme
- Mulliken Center for Theoretical Chemistry, Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| |
Collapse
|