1
|
Pezzola S, Venanzi M, Conte V, Sabuzi F, Galloni P. New Insights in the Computational pKb Determination of Primary Amines and Anilines. Chemphyschem 2024; 25:e202400550. [PMID: 38798156 DOI: 10.1002/cphc.202400550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Revised: 05/22/2024] [Accepted: 05/22/2024] [Indexed: 05/29/2024]
Abstract
Extensive research has already provided reliable methods for the in silico prediction of pKa, while a trustworthy strategy for pKb determination is still being sought. Indeed, the approaches previously exploited for computing pKa have shown their weakness in predicting pKb. In the light of the exceptional reliability demonstrated in the pKa calculation of a wide panel of organic acids, in this work, we exploited our "easy to use methodology", based on the direct approach, to predict the pKb of primary amines. Herein, CAM-B3LYP was compared to WB97XD and B3PW91, exploring the solvation model based on density (SMD) and the polarizable continuum model (PCM), in the presence of two explicit water molecules. Noteworthy, CAM-B3LYP and WB97XD returned completely different solvent accessible surfaces (SAS) and electron potential maps (EPM) for the bases and the conjugated acids, independently from the nature of the substituents. Once again, CAM-B3LYP/SMD/2H2O method confirmed its remarkable reliability, leading to a minimum average error (MAE) lower than 0.3. This outstanding result strengthens the trustworthiness of our method, already successfully applied to predict the pKa of different substituted phenols and carboxylic acids. Thus, our "easy-to-use" process can predict also the pKb of primary ammines and anilines, always ensuring consistent outputs.
Collapse
Affiliation(s)
- Silvia Pezzola
- Department of Chemical Science and Technologies, University of Rome Tor Vergata, Address Via della Ricerca Scientifica snc, 00133, Roma
| | - Mariano Venanzi
- Department of Chemical Science and Technologies, University of Rome Tor Vergata, Address Via della Ricerca Scientifica snc, 00133, Roma
| | - Valeria Conte
- Department of Chemical Science and Technologies, University of Rome Tor Vergata, Address Via della Ricerca Scientifica snc, 00133, Roma
| | - Federica Sabuzi
- Department of Chemical Science and Technologies, University of Rome Tor Vergata, Address Via della Ricerca Scientifica snc, 00133, Roma
| | - Pierluca Galloni
- Department of Chemical Science and Technologies, University of Rome Tor Vergata, Address Via della Ricerca Scientifica snc, 00133, Roma
| |
Collapse
|
2
|
Sanchez AJ, Maier S, Raghavachari K. Leveraging DFT and Molecular Fragmentation for Chemically Accurate p Ka Prediction Using Machine Learning. J Chem Inf Model 2024; 64:712-723. [PMID: 38301279 DOI: 10.1021/acs.jcim.3c01923] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2024]
Abstract
We present a quantum mechanical/machine learning (ML) framework based on random forest to accurately predict the pKas of complex organic molecules using inexpensive density functional theory (DFT) calculations. By including physics-based features from low-level DFT calculations and structural features from our connectivity-based hierarchy (CBH) fragmentation protocol, we can correct the systematic error associated with DFT. The generalizability and performance of our model are evaluated on two benchmark sets (SAMPL6 and Novartis). We believe the carefully curated input of physics-based features lessens the model's data dependence and need for complex deep learning architectures, without compromising the accuracy of the test sets. As a point of novelty, our work extends the applicability of CBH, employing it for the generation of viable molecular descriptors for ML.
Collapse
Affiliation(s)
- Alec J Sanchez
- Department of Chemistry, Indiana University?, Bloomington, Indiana 47405, United States
| | - Sarah Maier
- Department of Chemistry, Indiana University?, Bloomington, Indiana 47405, United States
| | - Krishnan Raghavachari
- Department of Chemistry, Indiana University?, Bloomington, Indiana 47405, United States
| |
Collapse
|
3
|
Brzeski J, Ciesielska A, Makowski M. Theoretical Study on the Alkylimino-Substituted Sulfonamides with Potential Biological Activity. J Phys Chem B 2023; 127:6620-6627. [PMID: 37478052 PMCID: PMC10405214 DOI: 10.1021/acs.jpcb.3c01965] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Revised: 06/14/2023] [Indexed: 07/23/2023]
Abstract
Antibiotics play a key role in the fight against bacterial diseases. However, bacteria quickly learn how to minimize the effects of antibiotics and strengthen their resistance. Thus, the fight against them becomes more and more difficult and there is a constant search for new bactericidal compounds. It is important in this type of search to determine the basic properties of compounds such as pKa, hydrogen bond formation, or hydrophobicity. Here, we present the results of our in silico study of five sulfonamide derivatives differing in alkylamine substituent length. Based on our results, we propose a model of three possible pKa values for each of the studied compounds. Interestingly, the use of Muckerman's approach for pKa determination exhibits that theoretical and experimental results are in very good agreement. Intramolecular hydrogen bond formation affects pKa. The strength of the H-bond interaction increases from ethyl to butylamine and then decreases with the elongation of the alkylamine chain. The obtained partition coefficients (expressed here in the value of log P) increase with the number of carbon atoms in the alkylamine chain following Lipinski's rule of five. The presented results provide important structural, physicochemical, and thermodynamic information that allows for the understanding of the influence of some sulfonamides and their possible activity.
Collapse
Affiliation(s)
- Jakub Brzeski
- Faculty of Chemistry, University of Gdańsk, Wita Stwosza 63, 80-308 Gdańsk, Poland
| | | | - Mariusz Makowski
- Faculty of Chemistry, University of Gdańsk, Wita Stwosza 63, 80-308 Gdańsk, Poland
| |
Collapse
|
4
|
Ogata S, Uranagase M. Protonation of Strained Epoxy Resin under Wet Conditions via First-Principles Calculations Using the H +-Shift Method. J Phys Chem B 2023; 127:2629-2638. [PMID: 36917503 DOI: 10.1021/acs.jpcb.3c00401] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/16/2023]
Abstract
A significant challenge in adhesive bonding is the accelerated breaking of stretched adhesives under wet conditions, which is known as cohesive failure. One group of commonly used adhesives consists of the amine-cured epoxy resins. Based on deprotonation free-energy calculations of the unstrained resin in water, it has recently been proposed that these adhesives can undergo failure through breakage originating at the protonated amine group under neutral or acidic conditions [J. Phys. Chem. B 2021, 125, 8989-8996]. In this study, we comprehensively investigated the degree of protonation of the amine group under both stretched and compressed conditions by devising a robust first-principles protonation calculation method applicable to strained materials. It was found that the amine group was partially protonated in neutral water at 298 K and that the amine group was protonated when the epoxy resin was stretched to a greater extent in water, and vice versa. These findings support the physicochemical cause of cohesive failure due to protonation of the amine group in the stretched amine-cured epoxy resins.
Collapse
Affiliation(s)
- Shuji Ogata
- Graduate School of Engineering, Nagoya Institute of Technology, Gokiso-cho, Showa-ku, Nagoya 466-8555, Japan
| | - Masayuki Uranagase
- Graduate School of Engineering, Nagoya Institute of Technology, Gokiso-cho, Showa-ku, Nagoya 466-8555, Japan
| |
Collapse
|
5
|
Chang Y, Hawkins BA, Du JJ, Groundwater PW, Hibbs DE, Lai F. A Guide to In Silico Drug Design. Pharmaceutics 2022; 15:pharmaceutics15010049. [PMID: 36678678 PMCID: PMC9867171 DOI: 10.3390/pharmaceutics15010049] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Revised: 12/16/2022] [Accepted: 12/17/2022] [Indexed: 12/28/2022] Open
Abstract
The drug discovery process is a rocky path that is full of challenges, with the result that very few candidates progress from hit compound to a commercially available product, often due to factors, such as poor binding affinity, off-target effects, or physicochemical properties, such as solubility or stability. This process is further complicated by high research and development costs and time requirements. It is thus important to optimise every step of the process in order to maximise the chances of success. As a result of the recent advancements in computer power and technology, computer-aided drug design (CADD) has become an integral part of modern drug discovery to guide and accelerate the process. In this review, we present an overview of the important CADD methods and applications, such as in silico structure prediction, refinement, modelling and target validation, that are commonly used in this area.
Collapse
Affiliation(s)
- Yiqun Chang
- Sydney Pharmacy School, Faculty of Medicine and Health, The University of Sydney, Camperdown, NSW 2006, Australia
| | - Bryson A. Hawkins
- Sydney Pharmacy School, Faculty of Medicine and Health, The University of Sydney, Camperdown, NSW 2006, Australia
| | - Jonathan J. Du
- Department of Biochemistry, Emory University School of Medicine, Atlanta, GA 30322, USA
| | - Paul W. Groundwater
- Sydney Pharmacy School, Faculty of Medicine and Health, The University of Sydney, Camperdown, NSW 2006, Australia
| | - David E. Hibbs
- Sydney Pharmacy School, Faculty of Medicine and Health, The University of Sydney, Camperdown, NSW 2006, Australia
| | - Felcia Lai
- Sydney Pharmacy School, Faculty of Medicine and Health, The University of Sydney, Camperdown, NSW 2006, Australia
- Correspondence:
| |
Collapse
|
6
|
Oe M, Suzuki K, Miki K, Mu H, Ohe K. Steric Control in Activator-Induced Nucleophilic Quencher Detachment-Based Probes: High-Contrast Imaging of Aldehyde Dehydrogenase 1A1 in Cancer Stem Cells. Chempluschem 2022; 87:e202200319. [PMID: 36416250 DOI: 10.1002/cplu.202200319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Revised: 10/17/2022] [Indexed: 01/31/2023]
Abstract
Turn-on fluorescence probes can visualize the enzyme activity with high contrast. We have established a new turn-on mechanism, activator-induced nucleophilic quencher detachment (AiQd), and developed AiQd-based turn-on fluorescence probes for the detection of enzymes. Herein, we demonstrate that the precise steric control efficiently quenches the fluorescence of AiQd-based turn-on probes before the enzymatic transformation. Theoretical calculation appropriately predicted the ratio of the fluorescence-quenched closed-ring form of probes. βC5S-A, which has a sterically demanding methyl group at the β-position of a fluorescence-quenching nucleophilic mercapto group, showed a low background signal. βC5S-A responded to aldehyde dehydrogenase 1A1 (ALDH1A1) with high selectivity, thereby enabling high-contrast live imaging of cancer stem cells (signal-to-noise ratio >10). The ALDH1A1-responsiveness of βC5S-A was not significantly affected by amino acids and biological thiols, such as cysteine and glutathione.
Collapse
Affiliation(s)
- Masahiro Oe
- Department of Energy and Hydrocarbon Chemistry, Graduate School of Engineering, Kyoto University, Katsura, Nishikyo-ku, 615-8510, Kyoto, Japan
| | - Kanae Suzuki
- Department of Energy and Hydrocarbon Chemistry, Graduate School of Engineering, Kyoto University, Katsura, Nishikyo-ku, 615-8510, Kyoto, Japan
| | - Koji Miki
- Department of Energy and Hydrocarbon Chemistry, Graduate School of Engineering, Kyoto University, Katsura, Nishikyo-ku, 615-8510, Kyoto, Japan
| | - Huiying Mu
- Department of Energy and Hydrocarbon Chemistry, Graduate School of Engineering, Kyoto University, Katsura, Nishikyo-ku, 615-8510, Kyoto, Japan
| | - Kouichi Ohe
- Department of Energy and Hydrocarbon Chemistry, Graduate School of Engineering, Kyoto University, Katsura, Nishikyo-ku, 615-8510, Kyoto, Japan
| |
Collapse
|
7
|
Wu J, Kang Y, Pan P, Hou T. Machine learning methods for pK a prediction of small molecules: Advances and challenges. Drug Discov Today 2022; 27:103372. [PMID: 36167281 DOI: 10.1016/j.drudis.2022.103372] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 08/15/2022] [Accepted: 09/21/2022] [Indexed: 11/27/2022]
Abstract
The acid-base dissociation constant (pKa) is a fundamental property influencing many ADMET properties of small molecules. However, rapid and accurate pKa prediction remains a great challenge. In this review, we outline the current advances in machine-learning-based QSAR models for pKa prediction, including descriptor-based and graph-based approaches, and summarize their pros and cons. Moreover, we highlight the current challenges and future directions regarding experimental data, crucial factors influencing pKa and in silico prediction tools. We hope that this review can provide a practical guidance for the follow-up studies.
Collapse
Affiliation(s)
- Jialu Wu
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Yu Kang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Peichen Pan
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang, China.
| | - Tingjun Hou
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang, China.
| |
Collapse
|
8
|
Bilska-Markowska M, Jankowski W, Hoffmann M, Kaźmierczak M. Design and Synthesis of New α-hydroxy β-fluoro/β-trifluoromethyl and Unsaturated Phosphonates from Carbohydrate-Derived Building Blocks via Pudovik and Horner–Wadsworth–Emmons Reactions. Molecules 2022; 27:molecules27175404. [PMID: 36080169 PMCID: PMC9457578 DOI: 10.3390/molecules27175404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Revised: 08/21/2022] [Accepted: 08/22/2022] [Indexed: 11/16/2022] Open
Abstract
Herein, we present the application of fluorinated carbohydrate-derived building blocks for α-hydroxy β-fluoro/β-trifluoromethyl and unsaturated phosphonates synthesis. Pudovik and Horner–Wadsworth–Emmons reactions were applied to achieve this goal. The proposed pathway of the key reactions is supported by the experimental results, as well as quantum chemical calculations. The structure of the products was established by spectroscopic (1D, 2D NMR) and spectrometric (MS) techniques. Based on our data received, we claim that the progress of the Pudovik and HWE reactions is significantly influenced by the acidic protons present in the molecules as assessed by pKa values of the reagent.
Collapse
Affiliation(s)
- Monika Bilska-Markowska
- Faculty of Chemistry, Adam Mickiewicz University, Uniwersytetu Poznańskiego 8, 61-614 Poznań, Poland
| | - Wojciech Jankowski
- Faculty of Chemistry, Adam Mickiewicz University, Uniwersytetu Poznańskiego 8, 61-614 Poznań, Poland
| | - Marcin Hoffmann
- Faculty of Chemistry, Adam Mickiewicz University, Uniwersytetu Poznańskiego 8, 61-614 Poznań, Poland
| | - Marcin Kaźmierczak
- Faculty of Chemistry, Adam Mickiewicz University, Uniwersytetu Poznańskiego 8, 61-614 Poznań, Poland
- Centre for Advanced Technologies, Adam Mickiewicz University, Uniwersytetu Poznańskiego 10, 61-614 Poznań, Poland
- Correspondence:
| |
Collapse
|
9
|
Mayr F, Wieder M, Wieder O, Langer T. Improving Small Molecule pKa Prediction Using Transfer Learning With Graph Neural Networks. Front Chem 2022; 10:866585. [PMID: 35721000 PMCID: PMC9204323 DOI: 10.3389/fchem.2022.866585] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 04/04/2022] [Indexed: 11/13/2022] Open
Abstract
Enumerating protonation states and calculating microstate pKa values of small molecules is an important yet challenging task for lead optimization and molecular modeling. Commercial and non-commercial solutions have notable limitations such as restrictive and expensive licenses, high CPU/GPU hour requirements, or the need for expert knowledge to set up and use. We present a graph neural network model that is trained on 714,906 calculated microstate pKa predictions from molecules obtained from the ChEMBL database. The model is fine-tuned on a set of 5,994 experimental pKa values significantly improving its performance on two challenging test sets. Combining the graph neural network model with Dimorphite-DL, an open-source program for enumerating ionization states, we have developed the open-source Python package pkasolver, which is able to generate and enumerate protonation states and calculate pKa values with high accuracy.
Collapse
|
10
|
|
11
|
Pan X, Wang H, Li C, Zhang JZH, Ji C. MolGpka: A Web Server for Small Molecule p Ka Prediction Using a Graph-Convolutional Neural Network. J Chem Inf Model 2021; 61:3159-3165. [PMID: 34251213 DOI: 10.1021/acs.jcim.1c00075] [Citation(s) in RCA: 60] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
pKa is an important property in the lead optimization process since the charge state of a molecule in physiologic pH plays a critical role in its biological activity, solubility, membrane permeability, metabolism, and toxicity. Accurate and fast estimation of small molecule pKa is vital during the drug discovery process. We present MolGpKa, a web server for pKa prediction using a graph-convolutional neural network model. The model works by learning pKa related chemical patterns automatically and building reliable predictors with learned features. ACD/pKa data for 1.6 million compounds from the ChEMBL database was used for model training. We found that the performance of the model is better than machine learning models built with human-engineered fingerprints. Detailed analysis shows that the substitution effect on pKa is well learned by the model. MolGpKa is a handy tool for the rapid estimation of pKa during the ligand design process. The MolGpKa server is freely available to researchers and can be accessed at https://xundrug.cn/molgpka.
Collapse
Affiliation(s)
- Xiaolin Pan
- Shanghai Engineering Research Center for Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China.,NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
| | - Hao Wang
- Shanghai Engineering Research Center for Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China.,NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
| | - Cuiyu Li
- Advanced Computing East China Sub-center, Suma Technology Co., Ltd., Kunshan 215300, China
| | - John Z H Zhang
- Shanghai Engineering Research Center for Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China.,NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China.,Department of Chemistry, New York University, New York, New York 10003, United States.,Collaborative Innovation Center of Extreme Optics, Shanxi University, Taiyuan, Shanxi 030006, China
| | - Changge Ji
- Shanghai Engineering Research Center for Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China.,NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
| |
Collapse
|
12
|
Bergazin TD, Tielker N, Zhang Y, Mao J, Gunner MR, Francisco K, Ballatore C, Kast SM, Mobley DL. Evaluation of log P, pK a, and log D predictions from the SAMPL7 blind challenge. J Comput Aided Mol Des 2021; 35:771-802. [PMID: 34169394 PMCID: PMC8224998 DOI: 10.1007/s10822-021-00397-3] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Accepted: 06/05/2021] [Indexed: 12/16/2022]
Abstract
The Statistical Assessment of Modeling of Proteins and Ligands (SAMPL) challenges focuses the computational modeling community on areas in need of improvement for rational drug design. The SAMPL7 physical property challenge dealt with prediction of octanol-water partition coefficients and pKa for 22 compounds. The dataset was composed of a series of N-acylsulfonamides and related bioisosteres. 17 research groups participated in the log P challenge, submitting 33 blind submissions total. For the pKa challenge, 7 different groups participated, submitting 9 blind submissions in total. Overall, the accuracy of octanol-water log P predictions in the SAMPL7 challenge was lower than octanol-water log P predictions in SAMPL6, likely due to a more diverse dataset. Compared to the SAMPL6 pKa challenge, accuracy remains unchanged in SAMPL7. Interestingly, here, though macroscopic pKa values were often predicted with reasonable accuracy, there was dramatically more disagreement among participants as to which microscopic transitions produced these values (with methods often disagreeing even as to the sign of the free energy change associated with certain transitions), indicating far more work needs to be done on pKa prediction methods.
Collapse
Affiliation(s)
| | - Nicolas Tielker
- Physikalische Chemie III, Technische Universität Dortmund, Otto-Hahn-Str. 4a, 44227, Dortmund, Germany
| | - Yingying Zhang
- Department of Physics, The Graduate Center, City University of New York, New York, 10016, USA
| | - Junjun Mao
- Department of Physics, City College of New York, New York, 10031, USA
| | - M R Gunner
- Department of Physics, The Graduate Center, City University of New York, New York, 10016, USA.,Department of Physics, City College of New York, New York, 10031, USA
| | - Karol Francisco
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, Ja Jolla, CA, 92093-0756, USA
| | - Carlo Ballatore
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, Ja Jolla, CA, 92093-0756, USA
| | - Stefan M Kast
- Physikalische Chemie III, Technische Universität Dortmund, Otto-Hahn-Str. 4a, 44227, Dortmund, Germany
| | - David L Mobley
- Department of Pharmaceutical Sciences, University of California, Irvine, Irvine, CA, 92697, USA. .,Department of Chemistry, University of California, Irvine, Irvine, CA, 92697, USA.
| |
Collapse
|
13
|
Natho P, Yang Z, Allen LAT, Rey J, White AJP, Parsons PJ. An entry to 2-(cyclobut-1-en-1-yl)-1 H-indoles through a cyclobutenylation/deprotection cascade. Org Biomol Chem 2021; 19:4048-4053. [PMID: 33885127 DOI: 10.1039/d1ob00430a] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
A transition-metal-free strategy for the synthesis of 2-(cyclobut-1-en-1-yl)-1H-indoles under mild conditions is described herein. A series of substituted 2-(cyclobut-1-en-1-yl)-1H-indoles are accessed by a one-pot cyclobutenylation/deprotection cascade from N-Boc protected indoles. Preliminary experimental and density functional theory calculations suggest that a Boc-group transfer is involved in the underlying mechanism.
Collapse
Affiliation(s)
- Philipp Natho
- Department of Chemistry, Imperial College London, Molecular Sciences Research Hub, W12 0BZ London, UK.
| | - Zeyu Yang
- Department of Chemistry, Imperial College London, Molecular Sciences Research Hub, W12 0BZ London, UK.
| | - Lewis A T Allen
- Department of Chemistry, Imperial College London, Molecular Sciences Research Hub, W12 0BZ London, UK.
| | - Juliette Rey
- Department of Chemistry, Imperial College London, Molecular Sciences Research Hub, W12 0BZ London, UK.
| | - Andrew J P White
- Department of Chemistry, Imperial College London, Molecular Sciences Research Hub, W12 0BZ London, UK.
| | - Philip J Parsons
- Department of Chemistry, Imperial College London, Molecular Sciences Research Hub, W12 0BZ London, UK.
| |
Collapse
|
14
|
Işık M, Rustenburg AS, Rizzi A, Gunner MR, Mobley DL, Chodera JD. Overview of the SAMPL6 pK a challenge: evaluating small molecule microscopic and macroscopic pK a predictions. J Comput Aided Mol Des 2021; 35:131-166. [PMID: 33394238 PMCID: PMC7904668 DOI: 10.1007/s10822-020-00362-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Accepted: 11/17/2020] [Indexed: 01/01/2023]
Abstract
The prediction of acid dissociation constants (pKa) is a prerequisite for predicting many other properties of a small molecule, such as its protein-ligand binding affinity, distribution coefficient (log D), membrane permeability, and solubility. The prediction of each of these properties requires knowledge of the relevant protonation states and solution free energy penalties of each state. The SAMPL6 pKa Challenge was the first time that a separate challenge was conducted for evaluating pKa predictions as part of the Statistical Assessment of Modeling of Proteins and Ligands (SAMPL) exercises. This challenge was motivated by significant inaccuracies observed in prior physical property prediction challenges, such as the SAMPL5 log D Challenge, caused by protonation state and pKa prediction issues. The goal of the pKa challenge was to assess the performance of contemporary pKa prediction methods for drug-like molecules. The challenge set was composed of 24 small molecules that resembled fragments of kinase inhibitors, a number of which were multiprotic. Eleven research groups contributed blind predictions for a total of 37 pKa distinct prediction methods. In addition to blinded submissions, four widely used pKa prediction methods were included in the analysis as reference methods. Collecting both microscopic and macroscopic pKa predictions allowed in-depth evaluation of pKa prediction performance. This article highlights deficiencies of typical pKa prediction evaluation approaches when the distinction between microscopic and macroscopic pKas is ignored; in particular, we suggest more stringent evaluation criteria for microscopic and macroscopic pKa predictions guided by the available experimental data. Top-performing submissions for macroscopic pKa predictions achieved RMSE of 0.7-1.0 pKa units and included both quantum chemical and empirical approaches, where the total number of extra or missing macroscopic pKas predicted by these submissions were fewer than 8 for 24 molecules. A large number of submissions had RMSE spanning 1-3 pKa units. Molecules with sulfur-containing heterocycles or iodo and bromo groups were less accurately predicted on average considering all methods evaluated. For a subset of molecules, we utilized experimentally-determined microstates based on NMR to evaluate the dominant tautomer predictions for each macroscopic state. Prediction of dominant tautomers was a major source of error for microscopic pKa predictions, especially errors in charged tautomers. The degree of inaccuracy in pKa predictions observed in this challenge is detrimental to the protein-ligand binding affinity predictions due to errors in dominant protonation state predictions and the calculation of free energy corrections for multiple protonation states. Underestimation of ligand pKa by 1 unit can lead to errors in binding free energy errors up to 1.2 kcal/mol. The SAMPL6 pKa Challenge demonstrated the need for improving pKa prediction methods for drug-like molecules, especially for challenging moieties and multiprotic molecules.
Collapse
Affiliation(s)
- Mehtap Işık
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA.
- Tri-Institutional PhD Program in Chemical Biology, Weill Cornell Graduate School of Medical Sciences, Cornell University, New York, NY, 10065, USA.
| | - Ariën S Rustenburg
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
- Graduate Program in Physiology, Biophysics, and Systems Biology, Weill Cornell Medical College, New York, NY, 10065, USA
| | - Andrea Rizzi
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
- Tri-Institutional PhD Program in Computational Biology and Medicine, Weill Cornell Graduate School of Medical Sciences, Cornell University, New York, NY, 10065, USA
| | - M R Gunner
- Department of Physics, City College of New York, New York, NY, 10031, USA
| | - David L Mobley
- Department of Pharmaceutical Sciences and Department of Chemistry, University of California, Irvine, Irvine, CA, 92697, USA
| | - John D Chodera
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
| |
Collapse
|
15
|
Sinha V, Laan JJ, Pidko EA. Accurate and rapid prediction of pKa of transition metal complexes: semiempirical quantum chemistry with a data-augmented approach. Phys Chem Chem Phys 2021; 23:2557-2567. [DOI: 10.1039/d0cp05281g] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Data-augmented high-throughput QM approach to compute pKa of transition metal hydride complexes with hDFT accuracy and low cost.
Collapse
Affiliation(s)
- Vivek Sinha
- Inorganic Systems Engineering
- Department of Chemical Engineering
- Faculty of Applied Sciences
- Delft University of Technology
- Delft
| | - Jochem J. Laan
- Inorganic Systems Engineering
- Department of Chemical Engineering
- Faculty of Applied Sciences
- Delft University of Technology
- Delft
| | - Evgeny A. Pidko
- Inorganic Systems Engineering
- Department of Chemical Engineering
- Faculty of Applied Sciences
- Delft University of Technology
- Delft
| |
Collapse
|
16
|
Malloum A, Fifen JJ, Conradie J. Determination of the absolute solvation free energy and enthalpy of the proton in solutions. J Mol Liq 2021. [DOI: 10.1016/j.molliq.2020.114919] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
|
17
|
Yang Q, Li Y, Yang J, Liu Y, Zhang L, Luo S, Cheng J. Holistic Prediction of the p
K
a
in Diverse Solvents Based on a Machine‐Learning Approach. Angew Chem Int Ed Engl 2020; 59:19282-19291. [DOI: 10.1002/anie.202008528] [Citation(s) in RCA: 64] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Revised: 07/13/2020] [Indexed: 12/12/2022]
Affiliation(s)
- Qi Yang
- Center of Basic Molecular Science Department of Chemistry Tsinghua University 100084 Beijing China
| | - Yao Li
- Center of Basic Molecular Science Department of Chemistry Tsinghua University 100084 Beijing China
| | - Jin‐Dong Yang
- Center of Basic Molecular Science Department of Chemistry Tsinghua University 100084 Beijing China
| | - Yidi Liu
- Center of Basic Molecular Science Department of Chemistry Tsinghua University 100084 Beijing China
| | - Long Zhang
- Center of Basic Molecular Science Department of Chemistry Tsinghua University 100084 Beijing China
| | - Sanzhong Luo
- Center of Basic Molecular Science Department of Chemistry Tsinghua University 100084 Beijing China
| | - Jin‐Pei Cheng
- Center of Basic Molecular Science Department of Chemistry Tsinghua University 100084 Beijing China
| |
Collapse
|
18
|
Yang Q, Li Y, Yang J, Liu Y, Zhang L, Luo S, Cheng J. Holistic Prediction of the p
K
a
in Diverse Solvents Based on a Machine‐Learning Approach. Angew Chem Int Ed Engl 2020. [DOI: 10.1002/ange.202008528] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Qi Yang
- Center of Basic Molecular Science Department of Chemistry Tsinghua University 100084 Beijing China
| | - Yao Li
- Center of Basic Molecular Science Department of Chemistry Tsinghua University 100084 Beijing China
| | - Jin‐Dong Yang
- Center of Basic Molecular Science Department of Chemistry Tsinghua University 100084 Beijing China
| | - Yidi Liu
- Center of Basic Molecular Science Department of Chemistry Tsinghua University 100084 Beijing China
| | - Long Zhang
- Center of Basic Molecular Science Department of Chemistry Tsinghua University 100084 Beijing China
| | - Sanzhong Luo
- Center of Basic Molecular Science Department of Chemistry Tsinghua University 100084 Beijing China
| | - Jin‐Pei Cheng
- Center of Basic Molecular Science Department of Chemistry Tsinghua University 100084 Beijing China
| |
Collapse
|
19
|
Hunt P, Hosseini-Gerami L, Chrien T, Plante J, Ponting DJ, Segall M. Predicting p Ka Using a Combination of Semi-Empirical Quantum Mechanics and Radial Basis Function Methods. J Chem Inf Model 2020; 60:2989-2997. [PMID: 32357002 DOI: 10.1021/acs.jcim.0c00105] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The acid dissociation constant (pKa) has an important influence on molecular properties crucial to compound development in synthesis, formulation, and optimization of absorption, distribution, metabolism, and excretion properties. We will present a method that combines quantum mechanical calculations, at a semi-empirical level of theory, with machine learning to accurately predict pKa for a diverse range of mono- and polyprotic compounds. The resulting model has been tested on two external data sets, one specifically used to test pKa prediction methods (SAMPL6) and the second covering known drugs containing basic functionalities. Both sets were predicted with excellent accuracy (root-mean-square errors of 0.7-1.0 log units), comparable to other methodologies using a much higher level of theory and computational cost.
Collapse
Affiliation(s)
- Peter Hunt
- Optibrium Ltd., F5-6 Blenheim House, Cambridge Innovation Park, Denny End Road, Cambridge CB25 9PB, U.K
| | - Layla Hosseini-Gerami
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K
| | - Tomas Chrien
- Optibrium Ltd., F5-6 Blenheim House, Cambridge Innovation Park, Denny End Road, Cambridge CB25 9PB, U.K
| | - Jeffrey Plante
- Lhasa Limited, Granary Wharf House, 2 Canal Wharf, Leeds LS11 5PS, U.K
| | - David J Ponting
- Lhasa Limited, Granary Wharf House, 2 Canal Wharf, Leeds LS11 5PS, U.K
| | - Matthew Segall
- Optibrium Ltd., F5-6 Blenheim House, Cambridge Innovation Park, Denny End Road, Cambridge CB25 9PB, U.K
| |
Collapse
|
20
|
Patel P, Kuntz DM, Jones MR, Brooks BR, Wilson AK. SAMPL6 logP challenge: machine learning and quantum mechanical approaches. J Comput Aided Mol Des 2020; 34:495-510. [PMID: 32002780 PMCID: PMC10817701 DOI: 10.1007/s10822-020-00287-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Accepted: 01/08/2020] [Indexed: 10/25/2022]
Abstract
Two different types of approaches: (a) approaches that combine quantitative structure activity relationships, quantum mechanical electronic structure methods, and machine-learning and, (b) electronic structure vertical solvation approaches, were used to predict the logP coefficients of 11 molecules as part of the SAMPL6 logP blind prediction challenge. Using electronic structures optimized with density functional theory (DFT), several molecular descriptors were calculated for each molecule, including van der Waals areas and volumes, HOMO/LUMO energies, dipole moments, polarizabilities, and electrophilic and nucleophilic superdelocalizabilities. A multilinear regression model and a partial least squares model were used to train a set of 97 molecules. As well, descriptors were generated using the molecular operating environment and used to create additional machine learning models. Electronic structure vertical solvation approaches considered include DFT and the domain-based local pair natural orbital methods combined with the solvated variant of the correlation consistent composite approach.
Collapse
Affiliation(s)
- Prajay Patel
- Department of Chemistry, Michigan State University, East Lansing, MI, 48824-1322, USA
| | - David M Kuntz
- Department of Chemistry and Center for Advanced Scientific Computing and Modeling (CASCaM), University of North Texas, Denton, TX, 76203-5070, USA
| | - Michael R Jones
- Laboratory of Computational Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20852-5690, USA
| | - Bernard R Brooks
- Laboratory of Computational Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20852-5690, USA
| | - Angela K Wilson
- Department of Chemistry, Michigan State University, East Lansing, MI, 48824-1322, USA.
- Department of Chemistry and Center for Advanced Scientific Computing and Modeling (CASCaM), University of North Texas, Denton, TX, 76203-5070, USA.
| |
Collapse
|
21
|
Standard state free energies, not pK as, are ideal for describing small molecule protonation and tautomeric states. J Comput Aided Mol Des 2020; 34:561-573. [PMID: 32052350 DOI: 10.1007/s10822-020-00280-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2019] [Accepted: 01/08/2020] [Indexed: 12/14/2022]
Abstract
The pKa is the standard measure used to describe the aqueous proton affinity of a compound, indicating the proton concentration (pH) at which two protonation states (e.g. A- and AH) have equal free energy. However, compounds can have additional protonation states (e.g. AH2+), and may assume multiple tautomeric forms, with the protons in different positions (microstates). Macroscopic pKas give the pH where the molecule changes its total number of protons, while microscopic pKas identify the tautomeric states involved. As tautomers have the same number of protons, the free energy difference between them and their relative probability is pH independent so there is no pKa connecting them. The question arises: What is the best way to describe protonation equilibria of a complex molecule in any pH range? Knowing the number of protons and the relative free energy of all microstates at a single pH, ∆G°, provides all the information needed to determine the free energy, and thus the probability of each microstate at each pH. Microstate probabilities as a function of pH generate titration curves that highlight the low energy, observable microstates, which can then be compared with experiment. A network description connecting microstates as nodes makes it straightforward to test thermodynamic consistency of microstate free energies. The utility of this analysis is illustrated by a description of one molecule from the SAMPL6 Blind pKa Prediction Challenge. Analysis of microstate ∆G°s also makes a more compact way to archive and compare the pH dependent behavior of compounds with multiple protonatable sites.
Collapse
|
22
|
Jones MR, Brooks BR. Quantum chemical predictions of water-octanol partition coefficients applied to the SAMPL6 logP blind challenge. J Comput Aided Mol Des 2020; 34:485-493. [PMID: 32002778 DOI: 10.1007/s10822-020-00286-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Accepted: 01/08/2020] [Indexed: 11/30/2022]
Abstract
Theoretical approaches for predicting physicochemical properties are valuable tools for accelerating the drug discovery process. In this work, quantum chemical methods are used to predict water-octanol partition coefficients as a part of the SAMPL6 blind challenge. The SMD continuum solvent model was employed with MP2 and eight DFT functionals in conjunction with correlation consistent basis sets to determine the water-octanol transfer free energy. Several tactics towards improving the predictions of the partition coefficient were examined, including increasing the quality of basis sets, considering tautomerization, and accounting for inhomogeneities in the water and n-octanol phases. Evaluation of these various schemes highlights the impact of modeling approaches across different methods. With the inclusion of tautomers and adjustments to the permittivity constants, the best predictions were obtained with smaller basis sets and the O3LYP functional, which yielded an RMSE of 0.79 logP units. The results presented correspond to the SAMPL6 logP submission IDs: DYXBT, O7DJK, and AHMTF.
Collapse
Affiliation(s)
- Michael R Jones
- Laboratory of Computational Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20892-5690, USA.
| | - Bernard R Brooks
- Laboratory of Computational Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20892-5690, USA
| |
Collapse
|