1
|
Wu Y, Liu F, Glenn I, Fonseca-Valencia K, Paris L, Xiong Y, Jerome SV, Brooks CL, Shoichet BK. Identifying Artifacts from Large Library Docking. J Med Chem 2024; 67:16796-16806. [PMID: 39255340 DOI: 10.1021/acs.jmedchem.4c01632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/12/2024]
Abstract
While large library docking has discovered potent ligands for multiple targets, as the libraries have grown the hit lists can become dominated by rare artifacts that cheat our scoring functions. Here, we investigate rescoring top-ranked docked molecules with orthogonal methods to identify these artifacts, exploring implicit solvent models and absolute binding free energy perturbation as cross-filters. In retrospective studies, this approach deprioritized high-ranking nonbinders for nine targets while leaving true ligands relatively unaffected. We tested the method prospectively against hits from docking against AmpC β-lactamase. We prioritized 128 high-ranking molecules for synthesis and testing, a mixture of 39 molecules flagged as likely cheaters and 89 that were plausible inhibitors. None of the predicted cheating compounds inhibited AmpC detectably, while 57% of the 89 plausible compounds did so. As our libraries continue to grow, deprioritizing docking artifacts by rescoring with orthogonal methods may find wide use.
Collapse
Affiliation(s)
- Yujin Wu
- Department of Pharmaceutical Chemistry, University of California, San Francisco, California 94158, United States
| | - Fangyu Liu
- Department of Pharmaceutical Chemistry, University of California, San Francisco, California 94158, United States
| | - Isabella Glenn
- Department of Pharmaceutical Chemistry, University of California, San Francisco, California 94158, United States
| | - Karla Fonseca-Valencia
- Department of Pharmaceutical Chemistry, University of California, San Francisco, California 94158, United States
| | - Lu Paris
- Department of Pharmaceutical Chemistry, University of California, San Francisco, California 94158, United States
| | - Yuyue Xiong
- Schrödinger, Inc., 9868 Scranton Road, San Diego, California 92121, United States
| | - Steven V Jerome
- Schrödinger, Inc., 1540 Broadway, New York, New York 10036, United States
| | - Charles L Brooks
- Biophysics Program, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Brian K Shoichet
- Department of Pharmaceutical Chemistry, University of California, San Francisco, California 94158, United States
| |
Collapse
|
2
|
Wu Y, Liu F, Glenn I, Fonseca-Valencia K, Paris L, Xiong Y, Jerome SV, Brooks CL, Shoichet BK. Identifying Artifacts from Large Library Docking. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.17.603966. [PMID: 39071262 PMCID: PMC11275789 DOI: 10.1101/2024.07.17.603966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
While large library docking has discovered potent ligands for multiple targets, as the libraries have grown, the very top of the hit-lists can become populated with artifacts that cheat our scoring functions. Though these cheating molecules are rare, they become ever-more dominant with library growth. Here, we investigate rescoring top-ranked molecules from docking screens with orthogonal methods to identify these artifacts, exploring implicit solvent models and absolute binding free energy perturbation (AB-FEP) as cross-filters. In retrospective studies, this approach deprioritized high-ranking non-binders for nine targets while leaving true ligands relatively unaffected. We tested the method prospectively against results from large library docking AmpC β-lactamase. From the very top of the docking hit lists, we prioritized 128 molecules for synthesis and experimental testing, a mixture of 39 molecules that rescoring flagged as likely cheaters and another 89 that were plausible true actives. None of the 39 predicted cheating compounds inhibited AmpC up to 200 μ M in enzyme assays, while 57% of the 89 plausible true actives did do so, with 19 of them inhibiting the enzyme with apparentK i values better than 50 μ M . As our libraries continue to grow, a strategy of catching docking artifacts by rescoring with orthogonal methods may find wide use in the field.
Collapse
Affiliation(s)
- Yujin Wu
- Department of Pharmaceutical Chemistry, University of California, San Francisco, 94158, United States
| | - Fangyu Liu
- Department of Pharmaceutical Chemistry, University of California, San Francisco, 94158, United States
| | - Isabella Glenn
- Department of Pharmaceutical Chemistry, University of California, San Francisco, 94158, United States
| | - Karla Fonseca-Valencia
- Department of Pharmaceutical Chemistry, University of California, San Francisco, 94158, United States
| | - Lu Paris
- Department of Pharmaceutical Chemistry, University of California, San Francisco, 94158, United States
| | - Yuyue Xiong
- Schrödinger, Inc., 9868 Scranton Road, San Diego, California 92121, United States
| | - Steven V. Jerome
- Schrödinger, Inc., 1540 Broadway, New York, New York, 10036, United States
| | - Charles L. Brooks
- Biophysics Program, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Brian K. Shoichet
- Department of Pharmaceutical Chemistry, University of California, San Francisco, 94158, United States
| |
Collapse
|
3
|
Luginina AP, Khnykin AN, Khorn PA, Moiseeva OV, Safronova NA, Pospelov VA, Dashevskii DE, Belousov AS, Borschevskiy VI, Mishin AV. Rational Design of Drugs Targeting G-Protein-Coupled Receptors: Ligand Search and Screening. BIOCHEMISTRY. BIOKHIMIIA 2024; 89:958-972. [PMID: 38880655 DOI: 10.1134/s0006297924050158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Revised: 02/22/2024] [Accepted: 02/23/2024] [Indexed: 06/18/2024]
Abstract
G protein-coupled receptors (GPCRs) are transmembrane proteins that participate in many physiological processes and represent major pharmacological targets. Recent advances in structural biology of GPCRs have enabled the development of drugs based on the receptor structure (structure-based drug design, SBDD). SBDD utilizes information about the receptor-ligand complex to search for suitable compounds, thus expanding the chemical space of possible receptor ligands without the need for experimental screening. The review describes the use of structure-based virtual screening (SBVS) for GPCR ligands and approaches for the functional testing of potential drug compounds, as well as discusses recent advances and successful examples in the application of SBDD for the identification of GPCR ligands.
Collapse
Affiliation(s)
- Aleksandra P Luginina
- Research Center for Molecular Mechanisms of Aging and Age-Related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia
| | - Andrey N Khnykin
- Research Center for Molecular Mechanisms of Aging and Age-Related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia
| | - Polina A Khorn
- Research Center for Molecular Mechanisms of Aging and Age-Related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia
| | - Olga V Moiseeva
- Research Center for Molecular Mechanisms of Aging and Age-Related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia
- Skryabin Institute of Biochemistry and Physiology of Microorganisms, Russian Academy of Sciences, Pushchino, Moscow Region, 142290, Russia
| | - Nadezhda A Safronova
- Research Center for Molecular Mechanisms of Aging and Age-Related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia
| | - Vladimir A Pospelov
- Research Center for Molecular Mechanisms of Aging and Age-Related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia
| | - Dmitrii E Dashevskii
- Research Center for Molecular Mechanisms of Aging and Age-Related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia
| | - Anatolii S Belousov
- Research Center for Molecular Mechanisms of Aging and Age-Related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia
| | - Valentin I Borschevskiy
- Research Center for Molecular Mechanisms of Aging and Age-Related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia.
- Frank Laboratory of Neutron Physics, Joint Institute for Nuclear Research, Dubna, Moscow Region, 141980, Russia
| | - Alexey V Mishin
- Research Center for Molecular Mechanisms of Aging and Age-Related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia.
| |
Collapse
|
4
|
Marin E, Kovaleva M, Kadukova M, Mustafin K, Khorn P, Rogachev A, Mishin A, Guskov A, Borshchevskiy V. Regression-Based Active Learning for Accessible Acceleration of Ultra-Large Library Docking. J Chem Inf Model 2024; 64:2612-2623. [PMID: 38157481 PMCID: PMC11005039 DOI: 10.1021/acs.jcim.3c01661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 11/28/2023] [Accepted: 12/04/2023] [Indexed: 01/03/2024]
Abstract
Structure-based drug discovery is a process for both hit finding and optimization that relies on a validated three-dimensional model of a target biomolecule, used to rationalize the structure-function relationship for this particular target. An ultralarge virtual screening approach has emerged recently for rapid discovery of high-affinity hit compounds, but it requires substantial computational resources. This study shows that active learning with simple linear regression models can accelerate virtual screening, retrieving up to 90% of the top-1% of the docking hit list after docking just 10% of the ligands. The results demonstrate that it is unnecessary to use complex models, such as deep learning approaches, to predict the imprecise results of ligand docking with a low sampling depth. Furthermore, we explore active learning meta-parameters and find that constant batch size models with a simple ensembling method provide the best ligand retrieval rate. Finally, our approach is validated on the ultralarge size virtual screening data set, retrieving 70% of the top-0.05% of ligands after screening only 2% of the library. Altogether, this work provides a computationally accessible approach for accelerated virtual screening that can serve as a blueprint for the future design of low-compute agents for exploration of the chemical space via large-scale accelerated docking. With recent breakthroughs in protein structure prediction, this method can significantly increase accessibility for the academic community and aid in the rapid discovery of high-affinity hit compounds for various targets.
Collapse
Affiliation(s)
- Egor Marin
- Research
Center for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
| | - Margarita Kovaleva
- Research
Center for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
| | - Maria Kadukova
- Research
Center for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
- University
Grenoble Alpes, Inria, CNRS, Grenoble INP, LJK, 38000 Grenoble, France
| | - Khalid Mustafin
- Research
Center for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
| | - Polina Khorn
- Research
Center for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
| | - Andrey Rogachev
- Research
Center for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
- Joint
Institute for Nuclear Research, Dubna 141980, Russian
Federation
| | - Alexey Mishin
- Research
Center for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
| | - Albert Guskov
- Groningen
Biomolecular Sciences and Biotechnology Institute, University of Groningen, Nijenborgh 4, 9747 AG Groningen, The Netherlands
| | - Valentin Borshchevskiy
- Research
Center for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
- Joint
Institute for Nuclear Research, Dubna 141980, Russian
Federation
| |
Collapse
|
5
|
Sindt F, Seyller A, Eguida M, Rognan D. Protein Structure-Based Organic Chemistry-Driven Ligand Design from Ultralarge Chemical Spaces. ACS CENTRAL SCIENCE 2024; 10:615-627. [PMID: 38559302 PMCID: PMC10979501 DOI: 10.1021/acscentsci.3c01521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 01/25/2024] [Accepted: 01/29/2024] [Indexed: 04/04/2024]
Abstract
Ultralarge chemical spaces describing several billion compounds are revolutionizing hit identification in early drug discovery. Because of their size, such chemical spaces cannot be fully enumerated and require ad-hoc computational tools to navigate them and pick potentially interesting hits. We here propose a structure-based approach to ultralarge chemical space screening in which commercial chemical reagents are first docked to the target of interest and then directly connected according to organic chemistry and topological rules, to enumerate drug-like compounds under three-dimensional constraints of the target. When applied to bespoke chemical spaces of different sizes and chemical complexity targeting two receptors of pharmaceutical interest (estrogen β receptor, dopamine D3 receptor), the computational method was able to quickly enumerate hits that were either known ligands (or very close analogs) of targeted receptors as well as chemically novel candidates that could be experimentally confirmed by in vitro binding assays. The proposed approach is generic, can be applied to any docking algorithm, and requires few computational resources to prioritize easily synthesizable hits from billion-sized chemical spaces.
Collapse
Affiliation(s)
- François Sindt
- Laboratoire d’innovation
thérapeutique, UMR7200 CNRS-Université de Strasbourg, Illkirch 67400, France
| | - Anthony Seyller
- Laboratoire d’innovation
thérapeutique, UMR7200 CNRS-Université de Strasbourg, Illkirch 67400, France
| | | | - Didier Rognan
- Laboratoire d’innovation
thérapeutique, UMR7200 CNRS-Université de Strasbourg, Illkirch 67400, France
| |
Collapse
|
6
|
Knight IS, Mailhot O, Tang KG, Irwin JJ. DockOpt: A Tool for Automatic Optimization of Docking Models. J Chem Inf Model 2024; 64:1004-1016. [PMID: 38206771 PMCID: PMC10865354 DOI: 10.1021/acs.jcim.3c01406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2023] [Revised: 12/17/2023] [Accepted: 12/26/2023] [Indexed: 01/13/2024]
Abstract
Molecular docking is a widely used technique for leveraging protein structure for ligand discovery, but it remains difficult to utilize due to limitations that have not been adequately addressed. Despite some progress toward automation, docking still requires expert guidance, hindering its adoption by a broader range of investigators. To make docking more accessible, we developed a new utility called DockOpt, which automates the creation, evaluation, and optimization of docking models prior to their deployment in large-scale prospective screens. DockOpt outperforms our previous automated pipeline across all 43 targets in the DUDE-Z benchmark data set, and the generated models for 84% of targets demonstrate sufficient enrichment to warrant their use in prospective screens, with normalized LogAUC values of at least 15%. DockOpt is available as part of the Python package Pydock3 included in the UCSF DOCK 3.8 distribution, which is available for free to academic researchers at https://dock.compbio.ucsf.edu and free for everyone upon registration at https://tldr.docking.org.
Collapse
Affiliation(s)
- Ian S. Knight
- Department of Pharmaceutical Chemistry, UCSF, 1700 Fourth Street, San Francisco, California 94158-2330, United States
| | - Olivier Mailhot
- Department of Pharmaceutical Chemistry, UCSF, 1700 Fourth Street, San Francisco, California 94158-2330, United States
| | - Khanh G. Tang
- Department of Pharmaceutical Chemistry, UCSF, 1700 Fourth Street, San Francisco, California 94158-2330, United States
| | - John J. Irwin
- Department of Pharmaceutical Chemistry, UCSF, 1700 Fourth Street, San Francisco, California 94158-2330, United States
| |
Collapse
|
7
|
Gahbauer S, DeLeon C, Braz JM, Craik V, Kang HJ, Wan X, Huang XP, Billesbølle CB, Liu Y, Che T, Deshpande I, Jewell M, Fink EA, Kondratov IS, Moroz YS, Irwin JJ, Basbaum AI, Roth BL, Shoichet BK. Docking for EP4R antagonists active against inflammatory pain. Nat Commun 2023; 14:8067. [PMID: 38057319 PMCID: PMC10700596 DOI: 10.1038/s41467-023-43506-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 11/12/2023] [Indexed: 12/08/2023] Open
Abstract
The lipid prostaglandin E2 (PGE2) mediates inflammatory pain by activating G protein-coupled receptors, including the prostaglandin E2 receptor 4 (EP4R). Nonsteroidal anti-inflammatory drugs (NSAIDs) reduce nociception by inhibiting prostaglandin synthesis, however, the disruption of upstream prostanoid biosynthesis can lead to pleiotropic effects including gastrointestinal bleeding and cardiac complications. In contrast, by acting downstream, EP4R antagonists may act specifically as anti-inflammatory agents and, to date, no selective EP4R antagonists have been approved for human use. In this work, seeking to diversify EP4R antagonist scaffolds, we computationally dock over 400 million compounds against an EP4R crystal structure and experimentally validate 71 highly ranked, de novo synthesized molecules. Further, we show how structure-based optimization of initial docking hits identifies a potent and selective antagonist with 16 nanomolar potency. Finally, we demonstrate favorable pharmacokinetics for the discovered compound as well as anti-allodynic and anti-inflammatory activity in several preclinical pain models in mice.
Collapse
Affiliation(s)
- Stefan Gahbauer
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Chelsea DeLeon
- Department of Pharmacology, University of North Carolina at Chapel Hill School of Medicine, Chapel Hill, NC, 27514, USA
| | - Joao M Braz
- Department of Anatomy, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Veronica Craik
- Department of Anatomy, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Hye Jin Kang
- Department of Pharmacology, University of North Carolina at Chapel Hill School of Medicine, Chapel Hill, NC, 27514, USA
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul, South Korea
| | - Xiaobo Wan
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Xi-Ping Huang
- Department of Pharmacology, University of North Carolina at Chapel Hill School of Medicine, Chapel Hill, NC, 27514, USA
| | - Christian B Billesbølle
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Yongfeng Liu
- Department of Pharmacology, University of North Carolina at Chapel Hill School of Medicine, Chapel Hill, NC, 27514, USA
| | - Tao Che
- Department of Pharmacology, University of North Carolina at Chapel Hill School of Medicine, Chapel Hill, NC, 27514, USA
- Center of Clinical Pharmacology, Department of Anesthesiology, Washington University School of Medicine, St. Louis, MO, 63110, USA
| | - Ishan Deshpande
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Madison Jewell
- Department of Anatomy, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Elissa A Fink
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Ivan S Kondratov
- Enamine Ltd., Kyiv, Ukraine
- V.P. Kukhar Institute of Bioorganic Chemistry and Petrochemistry, National Academy of Sciences of Ukraine, Kyiv, Ukraine
| | - Yurii S Moroz
- Chemspace LLC, Kyiv, Ukraine
- National Taras Shevchenko University of Kyiv, Kyiv, Ukraine
| | - John J Irwin
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Allan I Basbaum
- Department of Anatomy, University of California San Francisco, San Francisco, CA, 94158, USA.
| | - Bryan L Roth
- Department of Pharmacology, University of North Carolina at Chapel Hill School of Medicine, Chapel Hill, NC, 27514, USA.
- National Institute of Mental Health Psychoactive Drug Screening Program, University of North Carolina at Chapel Hill School of Medicine, Chapel Hill, NC, 27514, USA.
- Division of Chemical Biology and Medicinal Chemistry, University of North Carolina at Chapel Hill Eshelman School of Pharmacy, Chapel Hill, NC, 27514, USA.
| | - Brian K Shoichet
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, 94158, USA.
| |
Collapse
|
8
|
Luginina A, Gusach A, Lyapina E, Khorn P, Safronova N, Shevtsov M, Dmitirieva D, Dashevskii D, Kotova T, Smirnova E, Borshchevskiy V, Cherezov V, Mishin A. Structural diversity of leukotriene G-protein coupled receptors. J Biol Chem 2023; 299:105247. [PMID: 37703990 PMCID: PMC10570957 DOI: 10.1016/j.jbc.2023.105247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 09/01/2023] [Accepted: 09/06/2023] [Indexed: 09/15/2023] Open
Abstract
Dihydroxy acid leukotriene (LTB4) and cysteinyl leukotrienes (LTC4, LTD4, and LTE4) are inflammatory mediators derived from arachidonic acid via the 5-lipoxygenase pathway. While structurally similar, these two types of leukotrienes (LTs) exert their functions through interactions with two distinct G protein-coupled receptor (GPCR) families, BLT and CysLT receptors, which share low sequence similarity and belong to phylogenetically divergent GPCR groups. Selective antagonism of LT receptors has been proposed as a promising strategy for the treatment of many inflammation-related diseases including asthma and chronic obstructive pulmonary disease, rheumatoid arthritis, cystic fibrosis, diabetes, and several types of cancer. Selective CysLT1R antagonists are currently used as antiasthmatic drugs, however, there are no approved drugs targeting CysLT2 and BLT receptors. In this review, we highlight recently published structures of BLT1R and CysLTRs revealing unique structural features of the two receptor families. X-ray and cryo-EM data shed light on their overall conformations, differences in functional motifs involved in receptor activation, and details of the ligand-binding pockets. An unexpected binding mode of the selective antagonist BIIL260 in the BLT1R structure makes it the first example of a compound targeting the sodium-binding site of GPCRs and suggests a novel strategy for the receptor activity modulation. Taken together, these recent structural data reveal dramatic differences in the molecular architecture of the two LT receptor families and pave the way to new therapeutic strategies of selective targeting individual receptors with novel tool compounds obtained by the structure-based drug design approach.
Collapse
Affiliation(s)
- Aleksandra Luginina
- Research Сenter for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Russia
| | - Anastasiia Gusach
- Research Сenter for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Russia
| | - Elizaveta Lyapina
- Research Сenter for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Russia
| | - Polina Khorn
- Research Сenter for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Russia
| | - Nadezda Safronova
- Research Сenter for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Russia
| | - Mikhail Shevtsov
- Research Сenter for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Russia
| | - Daria Dmitirieva
- Research Сenter for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Russia
| | - Dmitrii Dashevskii
- Research Сenter for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Russia
| | - Tatiana Kotova
- Research Сenter for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Russia
| | - Ekaterina Smirnova
- Research Сenter for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Russia
| | - Valentin Borshchevskiy
- Research Сenter for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Russia; Joint Institute for Nuclear Research, Dubna, Russia
| | - Vadim Cherezov
- Bridge Institute, Department of Chemistry, University of Southern California, Los Angeles, California, USA.
| | - Alexey Mishin
- Research Сenter for Molecular Mechanisms of Aging and Age-related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Russia.
| |
Collapse
|
9
|
Alnammi M, Liu S, Ericksen SS, Ananiev GE, Voter AF, Guo S, Keck JL, Hoffmann FM, Wildman SA, Gitter A. Evaluating Scalable Supervised Learning for Synthesize-on-Demand Chemical Libraries. J Chem Inf Model 2023; 63:5513-5528. [PMID: 37625010 PMCID: PMC10538940 DOI: 10.1021/acs.jcim.3c00912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Indexed: 08/27/2023]
Abstract
Traditional small-molecule drug discovery is a time-consuming and costly endeavor. High-throughput chemical screening can only assess a tiny fraction of drug-like chemical space. The strong predictive power of modern machine-learning methods for virtual chemical screening enables training models on known active and inactive compounds and extrapolating to much larger chemical libraries. However, there has been limited experimental validation of these methods in practical applications on large commercially available or synthesize-on-demand chemical libraries. Through a prospective evaluation with the bacterial protein-protein interaction PriA-SSB, we demonstrate that ligand-based virtual screening can identify many active compounds in large commercial libraries. We use cross-validation to compare different types of supervised learning models and select a random forest (RF) classifier as the best model for this target. When predicting the activity of more than 8 million compounds from Aldrich Market Select, the RF substantially outperforms a naïve baseline based on chemical structure similarity. 48% of the RF's 701 selected compounds are active. The RF model easily scales to score one billion compounds from the synthesize-on-demand Enamine REAL database. We tested 68 chemically diverse top predictions from Enamine REAL and observed 31 hits (46%), including one with an IC50 value of 1.3 μM.
Collapse
Affiliation(s)
- Moayad Alnammi
- Department
of Computer Sciences, University of Wisconsin−Madison, Madison, Wisconsin 53706, United States
- Morgridge
Institute for Research, Madison, Wisconsin 53715, United States
- Department
of Information and Computer Science, King
Fahd University of Petroleum & Minerals, Dhahran 31261, Saudi Arabia
| | - Shengchao Liu
- Department
of Computer Sciences, University of Wisconsin−Madison, Madison, Wisconsin 53706, United States
- Morgridge
Institute for Research, Madison, Wisconsin 53715, United States
| | - Spencer S. Ericksen
- Small
Molecule Screening Facility, University
of Wisconsin−Madison, Madison, Wisconsin 53792, United States
| | - Gene E. Ananiev
- Small
Molecule Screening Facility, University
of Wisconsin−Madison, Madison, Wisconsin 53792, United States
| | - Andrew F. Voter
- Department
of Biomolecular Chemistry, University of
Wisconsin−Madison, Madison, Wisconsin 53706, United States
| | - Song Guo
- Small
Molecule Screening Facility, University
of Wisconsin−Madison, Madison, Wisconsin 53792, United States
| | - James L. Keck
- Department
of Biomolecular Chemistry, University of
Wisconsin−Madison, Madison, Wisconsin 53706, United States
| | - F. Michael Hoffmann
- Small
Molecule Screening Facility, University
of Wisconsin−Madison, Madison, Wisconsin 53792, United States
- McArdle Laboratory
for Cancer Research, University of Wisconsin−Madison, Madison, Wisconsin 53705, United States
| | - Scott A. Wildman
- Small
Molecule Screening Facility, University
of Wisconsin−Madison, Madison, Wisconsin 53792, United States
| | - Anthony Gitter
- Department
of Computer Sciences, University of Wisconsin−Madison, Madison, Wisconsin 53706, United States
- Morgridge
Institute for Research, Madison, Wisconsin 53715, United States
- Department
of Biostatistics and Medical Informatics, University of Wisconsin−Madison, Madison, Wisconsin 53792, United States
| |
Collapse
|
10
|
Singh I, Seth A, Billesbølle CB, Braz J, Rodriguiz RM, Roy K, Bekele B, Craik V, Huang XP, Boytsov D, Pogorelov VM, Lak P, O'Donnell H, Sandtner W, Irwin JJ, Roth BL, Basbaum AI, Wetsel WC, Manglik A, Shoichet BK, Rudnick G. Structure-based discovery of conformationally selective inhibitors of the serotonin transporter. Cell 2023; 186:2160-2175.e17. [PMID: 37137306 PMCID: PMC10306110 DOI: 10.1016/j.cell.2023.04.010] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Revised: 02/05/2023] [Accepted: 04/06/2023] [Indexed: 05/05/2023]
Abstract
The serotonin transporter (SERT) removes synaptic serotonin and is the target of anti-depressant drugs. SERT adopts three conformations: outward-open, occluded, and inward-open. All known inhibitors target the outward-open state except ibogaine, which has unusual anti-depressant and substance-withdrawal effects, and stabilizes the inward-open conformation. Unfortunately, ibogaine's promiscuity and cardiotoxicity limit the understanding of inward-open state ligands. We docked over 200 million small molecules against the inward-open state of the SERT. Thirty-six top-ranking compounds were synthesized, and thirteen inhibited; further structure-based optimization led to the selection of two potent (low nanomolar) inhibitors. These stabilized an outward-closed state of the SERT with little activity against common off-targets. A cryo-EM structure of one of these bound to the SERT confirmed the predicted geometry. In mouse behavioral assays, both compounds had anxiolytic- and anti-depressant-like activity, with potencies up to 200-fold better than fluoxetine (Prozac), and one substantially reversed morphine withdrawal effects.
Collapse
Affiliation(s)
- Isha Singh
- Department of Pharmaceutical Chemistry, University of California, San Francisco, 1700 4th St., Byers Hall Suite 508D, San Francisco, CA 94143, USA
| | - Anubha Seth
- Department of Pharmacology, Yale University School of Medicine, 333 Cedar Street, New Haven, CT 06520-8066, USA
| | - Christian B Billesbølle
- Department of Pharmaceutical Chemistry, University of California, San Francisco, 1700 4th St., Byers Hall Suite 508D, San Francisco, CA 94143, USA
| | - Joao Braz
- Department of Anatomy, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Ramona M Rodriguiz
- Department of Psychiatry and Behavioral Sciences, Duke University Medical Center, Durham, NC 27710, USA; Mouse Behavioral and Neuroendocrine Analysis Core Facility, Duke University Medical Center, Durham, NC 27710, USA
| | - Kasturi Roy
- Department of Pharmacology, Yale University School of Medicine, 333 Cedar Street, New Haven, CT 06520-8066, USA
| | - Bethlehem Bekele
- Department of Pharmacology, Yale University School of Medicine, 333 Cedar Street, New Haven, CT 06520-8066, USA
| | - Veronica Craik
- Department of Anatomy, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Xi-Ping Huang
- Department of Pharmacology, NIMH Psychoactive Drug Screening Program, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Danila Boytsov
- Institute of Pharmacology, Center for Physiology and Pharmacology, Medical University of Vienna, 1090 Vienna, Austria
| | - Vladimir M Pogorelov
- Department of Psychiatry and Behavioral Sciences, Duke University Medical Center, Durham, NC 27710, USA
| | - Parnian Lak
- Department of Pharmaceutical Chemistry, University of California, San Francisco, 1700 4th St., Byers Hall Suite 508D, San Francisco, CA 94143, USA
| | - Henry O'Donnell
- Department of Pharmaceutical Chemistry, University of California, San Francisco, 1700 4th St., Byers Hall Suite 508D, San Francisco, CA 94143, USA
| | - Walter Sandtner
- Institute of Pharmacology, Center for Physiology and Pharmacology, Medical University of Vienna, 1090 Vienna, Austria
| | - John J Irwin
- Department of Pharmaceutical Chemistry, University of California, San Francisco, 1700 4th St., Byers Hall Suite 508D, San Francisco, CA 94143, USA
| | - Bryan L Roth
- Department of Pharmacology, NIMH Psychoactive Drug Screening Program, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA; Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Allan I Basbaum
- Department of Anatomy, University of California, San Francisco, San Francisco, CA 94143, USA.
| | - William C Wetsel
- Department of Psychiatry and Behavioral Sciences, Duke University Medical Center, Durham, NC 27710, USA; Mouse Behavioral and Neuroendocrine Analysis Core Facility, Duke University Medical Center, Durham, NC 27710, USA; Departments of Cell Biology and Neurobiology, Duke University Medical Center, Durham, NC 27710, USA.
| | - Aashish Manglik
- Department of Pharmaceutical Chemistry, University of California, San Francisco, 1700 4th St., Byers Hall Suite 508D, San Francisco, CA 94143, USA; Department of Anesthesia and Perioperative Care, University of California, San Francisco, San Francisco, CA 94115, USA.
| | - Brian K Shoichet
- Department of Pharmaceutical Chemistry, University of California, San Francisco, 1700 4th St., Byers Hall Suite 508D, San Francisco, CA 94143, USA.
| | - Gary Rudnick
- Department of Pharmacology, Yale University School of Medicine, 333 Cedar Street, New Haven, CT 06520-8066, USA.
| |
Collapse
|
11
|
Cavasotto CN, Di Filippo JI. The Impact of Supervised Learning Methods in Ultralarge High-Throughput Docking. J Chem Inf Model 2023; 63:2267-2280. [PMID: 37036491 DOI: 10.1021/acs.jcim.2c01471] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/11/2023]
Abstract
Structure-based virtual screening methods are, nowadays, one of the key pillars of computational drug discovery. In recent years, a series of studies have reported docking-based virtual screening campaigns of large databases ranging from hundreds to thousands of millions compounds, further identifying novel hits after experimental validation. As these larg-scale efforts are not generally accessible, machine learning-based protocols have emerged to accelerate the identification of virtual hits within an ultralarge chemical space, reaching impressive reductions in computational time. Herein, we illustrate the motivation and the problem behind the screening of large databases, providing an overview of key concepts and essential applications of machine learning-accelerated protocols, specifically concerning supervised learning methods. We also discuss where the field stands with these novel developments, highlighting possible insights for future studies.
Collapse
Affiliation(s)
- Claudio N Cavasotto
- Computational Drug Design and Biomedical Informatics Laboratory, Instituto de Investigaciones en Medicina Traslacional (IIMT), CONICET-Universidad Austral, Av. Juan Domingo Perón 1500, B1629AHJ Pilar, Argentina
- Facultad de Ciencias Biomédicas, and Facultad de Ingeniería, Universidad Austral, Av. Juan Domingo Perón 1500, B1629AHJ Pilar, Argentina
- Austral Institute for Applied Artificial Intelligence, Universidad Austral, Av. Juan Domingo Perón 1500, B1629AHJ Pilar, Argentina
| | - Juan I Di Filippo
- Computational Drug Design and Biomedical Informatics Laboratory, Instituto de Investigaciones en Medicina Traslacional (IIMT), CONICET-Universidad Austral, Av. Juan Domingo Perón 1500, B1629AHJ Pilar, Argentina
- Facultad de Ciencias Biomédicas, and Facultad de Ingeniería, Universidad Austral, Av. Juan Domingo Perón 1500, B1629AHJ Pilar, Argentina
- Austral Institute for Applied Artificial Intelligence, Universidad Austral, Av. Juan Domingo Perón 1500, B1629AHJ Pilar, Argentina
| |
Collapse
|
12
|
Sala D, Batebi H, Ledwitch K, Hildebrand PW, Meiler J. Targeting in silico GPCR conformations with ultra-large library screening for hit discovery. Trends Pharmacol Sci 2023; 44:150-161. [PMID: 36669974 PMCID: PMC9974811 DOI: 10.1016/j.tips.2022.12.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 12/23/2022] [Accepted: 12/27/2022] [Indexed: 01/20/2023]
Abstract
The use of deep machine learning (ML) in protein structure prediction has made it possible to easily access a large number of annotated conformations that can potentially compensate for missing experimental structures in structure-based drug discovery (SBDD). However, it is still unclear whether the accuracy of these predicted conformations is sufficient for screening chemical compounds that will effectively interact with a protein target for pharmacological purposes. In this opinion article, we examine the potential benefits and limitations of using state-annotated conformations for ultra-large library screening (ULLS) in light of the growing size of ultra-large libraries (ULLs). We believe that targeting different conformational states of common drug targets like G-protein-coupled receptors (GPCRs), which can regulate human physiology by switching between different conformations, can offer multiple advantages.
Collapse
Affiliation(s)
- D Sala
- Institute of Drug Discovery, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany
| | - H Batebi
- Institute of Medical Physics and Biophysics, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany
| | - K Ledwitch
- Center for Structural Biology, Vanderbilt University, Nashville, TN 37240, USA; Department of Chemistry, Vanderbilt University, Nashville, TN 37235, USA
| | - P W Hildebrand
- Institute of Medical Physics and Biophysics, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany
| | - J Meiler
- Institute of Drug Discovery, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany; Center for Structural Biology, Vanderbilt University, Nashville, TN 37240, USA; Department of Chemistry, Vanderbilt University, Nashville, TN 37235, USA.
| |
Collapse
|
13
|
Perebyinis M, Rognan D. Overlap of On-demand Ultra-large Combinatorial Spaces with On-the-shelf Drug-like Libraries. Mol Inform 2023; 42:e2200163. [PMID: 36072995 DOI: 10.1002/minf.202200163] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Accepted: 09/07/2022] [Indexed: 01/12/2023]
Abstract
On-demand combinatorial spaces are shifting paradigms in early drug discovery, by considerably increasing the searchable chemical space to several billions of compounds while securing their synthetic accessibility. We here systematically compared the on-the-shelf available drug-like chemical space (9 million compounds) to three on-demand ultra-large (ODUL) combinatorial fragment spaces (REAL, CHEMriya, GalaXi) covering 32 billion of readily accessible molecules. Surprisingly, only one space (REAL) intersects almost entirely the currently available drug-like space, suggesting that it is the only ODUL widely suitable for in-stock hit expansion. Of course, expanding a preliminary ODUL hit in the same chemical space is the best possible strategy to rapidly generate structure-activity relationships. All three spaces remain well suited to early hit finding initiatives since they all provide numerous unique scaffolds that are not described by on-the shelf collections.
Collapse
Affiliation(s)
- Mariana Perebyinis
- Laboratoire d'Innovation Thérapeutique, UMR7200 CNRS-Université de Strasbourg, 74 route du Rhin, F-67400, Illkirch, France
| | - Didier Rognan
- Laboratoire d'Innovation Thérapeutique, UMR7200 CNRS-Université de Strasbourg, 74 route du Rhin, F-67400, Illkirch, France
| |
Collapse
|
14
|
He J, Li C, Hu W, Li C, Liu S, Sui J, Zhang T, Sun Q, Luo Y. Identification of selective mtbDHFR inhibitors by virtual screening and experimental approaches. Chem Biol Drug Des 2022; 100:1005-1016. [PMID: 34981654 DOI: 10.1111/cbdd.14018] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Revised: 11/26/2021] [Accepted: 12/18/2021] [Indexed: 02/05/2023]
Abstract
mtbDHFR-targeting inhibition has become a promising approach for tuberculosis treatment. In the current research, a multi-step virtual screening effort toward ZINC and MCE databases was devoted to discover novel mtbDHFR inhibitors. Based on binding affinity of small molecules through molecular docking study in AutoDock Vina, the number of compounds was reduced to 952,688. Further, these compounds were employed by a step-by-step multiple docking programs of Schrödinger suite and filtered by pharmacokinetics and PAINS parameters. Finally, nine ZINC compounds and 400 MCE compounds were obtained. These compounds of binding ability were tested with mtbDHFR by FluoPol-ABPP approach established in this work. Finally, AF-353 compound was found to have strong binding effect to mtbDHFR. AF-353 was further tested for mtb and hDHFR enzymatic activities, and it was proved to possess 50-fold selectivity toward mtbDHFR over hDHFR. In silico MD simulation results supported this selectivity.
Collapse
Affiliation(s)
- Juan He
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, West China Medical School, Sichuan University, Chengdu, China
| | - Cong Li
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, West China Medical School, Sichuan University, Chengdu, China
| | - Wei Hu
- Chengdu FenDi pharmaceutical Co., Ltd., Chengdu, China
| | - Chungen Li
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, West China Medical School, Sichuan University, Chengdu, China
| | - Song Liu
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, West China Medical School, Sichuan University, Chengdu, China
| | - Jing Sui
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, West China Medical School, Sichuan University, Chengdu, China
| | - Tianyu Zhang
- State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine and Health (GIBH), Chinese Academy of Sciences (CAS), Guangzhou, China
| | - Qingxiang Sun
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, West China Medical School, Sichuan University, Chengdu, China
| | - Youfu Luo
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, West China Medical School, Sichuan University, Chengdu, China
| |
Collapse
|
15
|
Ballante F, Kooistra AJ, Kampen S, de Graaf C, Carlsson J. Structure-Based Virtual Screening for Ligands of G Protein-Coupled Receptors: What Can Molecular Docking Do for You? Pharmacol Rev 2021; 73:527-565. [PMID: 34907092 DOI: 10.1124/pharmrev.120.000246] [Citation(s) in RCA: 61] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
G protein-coupled receptors (GPCRs) constitute the largest family of membrane proteins in the human genome and are important therapeutic targets. During the last decade, the number of atomic-resolution structures of GPCRs has increased rapidly, providing insights into drug binding at the molecular level. These breakthroughs have created excitement regarding the potential of using structural information in ligand design and initiated a new era of rational drug discovery for GPCRs. The molecular docking method is now widely applied to model the three-dimensional structures of GPCR-ligand complexes and screen for chemical probes in large compound libraries. In this review article, we first summarize the current structural coverage of the GPCR superfamily and the understanding of receptor-ligand interactions at atomic resolution. We then present the general workflow of structure-based virtual screening and strategies to discover GPCR ligands in chemical libraries. We assess the state of the art of this research field by summarizing prospective applications of virtual screening based on experimental structures. Strategies to identify compounds with specific efficacy and selectivity profiles are discussed, illustrating the opportunities and limitations of the molecular docking method. Our overview shows that structure-based virtual screening can discover novel leads and will be essential in pursuing the next generation of GPCR drugs. SIGNIFICANCE STATEMENT: Extraordinary advances in the structural biology of G protein-coupled receptors have revealed the molecular details of ligand recognition by this large family of therapeutic targets, providing novel avenues for rational drug design. Structure-based docking is an efficient computational approach to identify novel chemical probes from large compound libraries, which has the potential to accelerate the development of drug candidates.
Collapse
Affiliation(s)
- Flavio Ballante
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden (F.B., S.K., J.C.); Department of Drug Design and Pharmacology, University of Copenhagen, Copenhagen, Denmark (A.J.K.); and Sosei Heptares, Steinmetz Building, Granta Park, Great Abington, Cambridge, United Kingdom (C.d.G.)
| | - Albert J Kooistra
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden (F.B., S.K., J.C.); Department of Drug Design and Pharmacology, University of Copenhagen, Copenhagen, Denmark (A.J.K.); and Sosei Heptares, Steinmetz Building, Granta Park, Great Abington, Cambridge, United Kingdom (C.d.G.)
| | - Stefanie Kampen
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden (F.B., S.K., J.C.); Department of Drug Design and Pharmacology, University of Copenhagen, Copenhagen, Denmark (A.J.K.); and Sosei Heptares, Steinmetz Building, Granta Park, Great Abington, Cambridge, United Kingdom (C.d.G.)
| | - Chris de Graaf
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden (F.B., S.K., J.C.); Department of Drug Design and Pharmacology, University of Copenhagen, Copenhagen, Denmark (A.J.K.); and Sosei Heptares, Steinmetz Building, Granta Park, Great Abington, Cambridge, United Kingdom (C.d.G.)
| | - Jens Carlsson
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden (F.B., S.K., J.C.); Department of Drug Design and Pharmacology, University of Copenhagen, Copenhagen, Denmark (A.J.K.); and Sosei Heptares, Steinmetz Building, Granta Park, Great Abington, Cambridge, United Kingdom (C.d.G.)
| |
Collapse
|
16
|
Yang Y, Yao K, Repasky MP, Leswing K, Abel R, Shoichet BK, Jerome SV. Efficient Exploration of Chemical Space with Docking and Deep Learning. J Chem Theory Comput 2021; 17:7106-7119. [PMID: 34592101 DOI: 10.1021/acs.jctc.1c00810] [Citation(s) in RCA: 77] [Impact Index Per Article: 25.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
With the advent of make-on-demand commercial libraries, the number of purchasable compounds available for virtual screening and assay has grown explosively in recent years, with several libraries eclipsing one billion compounds. Today's screening libraries are larger and more diverse, enabling the discovery of more-potent hit compounds and unlocking new areas of chemical space, represented by new core scaffolds. Applying physics-based in silico screening methods in an exhaustive manner, where every molecule in the library must be enumerated and evaluated independently, is increasingly cost-prohibitive. Here, we introduce a protocol for machine learning-enhanced molecular docking based on active learning to dramatically increase throughput over traditional docking. We leverage a novel selection protocol that strikes a balance between two objectives: (1) identifying the best scoring compounds and (2) exploring a large region of chemical space, demonstrating superior performance compared to a purely greedy approach. Together with automated redocking of the top compounds, this method captures almost all the high scoring scaffolds in the library found by exhaustive docking. This protocol is applied to our recent virtual screening campaigns against the D4 and AMPC targets that produced dozens of highly potent, novel inhibitors, and a blind test against the MT1 target. Our protocol recovers more than 80% of the experimentally confirmed hits with a 14-fold reduction in compute cost, and more than 90% of the hit scaffolds in the top 5% of model predictions, preserving the diversity of the experimentally confirmed hit compounds.
Collapse
Affiliation(s)
- Ying Yang
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, California 94158, United States
| | - Kun Yao
- Schrödinger, Inc., 120 West 45th Street, 17th Floor, New York, New York 10036, United States
| | - Matthew P Repasky
- Schrödinger, Inc., 101 SW Main Street, #1300, Portland, Oregon 97239, United States
| | - Karl Leswing
- Schrödinger, Inc., 120 West 45th Street, 17th Floor, New York, New York 10036, United States
| | - Robert Abel
- Schrödinger, Inc., 120 West 45th Street, 17th Floor, New York, New York 10036, United States
| | - Brian K Shoichet
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, California 94158, United States
| | - Steven V Jerome
- Schrödinger, Inc., 10201 Wateridge Cir Suite 220, San Diego, California 92121, United States
| |
Collapse
|