1
|
Tang X, Dai H, Knight E, Wu F, Li Y, Li T, Gerstein M. A survey of generative AI for de novo drug design: new frontiers in molecule and protein generation. Brief Bioinform 2024; 25:bbae338. [PMID: 39007594 DOI: 10.1093/bib/bbae338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2024] [Revised: 05/21/2024] [Accepted: 06/27/2024] [Indexed: 07/16/2024] Open
Abstract
Artificial intelligence (AI)-driven methods can vastly improve the historically costly drug design process, with various generative models already in widespread use. Generative models for de novo drug design, in particular, focus on the creation of novel biological compounds entirely from scratch, representing a promising future direction. Rapid development in the field, combined with the inherent complexity of the drug design process, creates a difficult landscape for new researchers to enter. In this survey, we organize de novo drug design into two overarching themes: small molecule and protein generation. Within each theme, we identify a variety of subtasks and applications, highlighting important datasets, benchmarks, and model architectures and comparing the performance of top models. We take a broad approach to AI-driven drug design, allowing for both micro-level comparisons of various methods within each subtask and macro-level observations across different fields. We discuss parallel challenges and approaches between the two applications and highlight future directions for AI-driven de novo drug design as a whole. An organized repository of all covered sources is available at https://github.com/gersteinlab/GenAI4Drug.
Collapse
Affiliation(s)
- Xiangru Tang
- Department of Computer Science, Yale University, New Haven, CT 06520, United States
| | - Howard Dai
- Department of Computer Science, Yale University, New Haven, CT 06520, United States
| | - Elizabeth Knight
- School of Medicine, Yale University, New Haven, CT 06520, United States
| | - Fang Wu
- Computer Science Department, Stanford University, CA 94305, United States
| | - Yunyang Li
- Department of Computer Science, Yale University, New Haven, CT 06520, United States
| | - Tianxiao Li
- Program in Computational Biology & Bioinformatics, Yale University, New Haven, CT 06520, United States
| | - Mark Gerstein
- Department of Computer Science, Yale University, New Haven, CT 06520, United States
- Program in Computational Biology & Bioinformatics, Yale University, New Haven, CT 06520, United States
- Department of Statistics & Data Science, Yale University, New Haven, CT 06520, United States
- Department of Biomedical Informatics & Data Science, Yale University, New Haven, CT 06520, United States
- Department of Molecular Biophysics & Biochemistry, Yale University, New Haven, CT 06520, United States
| |
Collapse
|
2
|
Atz K, Cotos L, Isert C, Håkansson M, Focht D, Hilleke M, Nippa DF, Iff M, Ledergerber J, Schiebroek CCG, Romeo V, Hiss JA, Merk D, Schneider P, Kuhn B, Grether U, Schneider G. Prospective de novo drug design with deep interactome learning. Nat Commun 2024; 15:3408. [PMID: 38649351 PMCID: PMC11035696 DOI: 10.1038/s41467-024-47613-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 04/02/2024] [Indexed: 04/25/2024] Open
Abstract
De novo drug design aims to generate molecules from scratch that possess specific chemical and pharmacological properties. We present a computational approach utilizing interactome-based deep learning for ligand- and structure-based generation of drug-like molecules. This method capitalizes on the unique strengths of both graph neural networks and chemical language models, offering an alternative to the need for application-specific reinforcement, transfer, or few-shot learning. It enables the "zero-shot" construction of compound libraries tailored to possess specific bioactivity, synthesizability, and structural novelty. In order to proactively evaluate the deep interactome learning framework for protein structure-based drug design, potential new ligands targeting the binding site of the human peroxisome proliferator-activated receptor (PPAR) subtype gamma are generated. The top-ranking designs are chemically synthesized and computationally, biophysically, and biochemically characterized. Potent PPAR partial agonists are identified, demonstrating favorable activity and the desired selectivity profiles for both nuclear receptors and off-target interactions. Crystal structure determination of the ligand-receptor complex confirms the anticipated binding mode. This successful outcome positively advocates interactome-based de novo design for application in bioorganic and medicinal chemistry, enabling the creation of innovative bioactive molecules.
Collapse
Affiliation(s)
- Kenneth Atz
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
| | - Leandro Cotos
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
| | - Clemens Isert
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
| | - Maria Håkansson
- SARomics Biostructures AB, Medicon Village, SE-223 81, Lund, Sweden
| | - Dorota Focht
- SARomics Biostructures AB, Medicon Village, SE-223 81, Lund, Sweden
| | - Mattis Hilleke
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
| | - David F Nippa
- Roche Pharma Research and Early Development (pRED), Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd., Grenzacherstrasse 124, CH-4070, Basel, Switzerland
- Department of Pharmacy, Ludwig-Maximilians-Universität München, Butenandtstrasse 5, 81377, Munich, Germany
| | - Michael Iff
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
| | - Jann Ledergerber
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
| | - Carl C G Schiebroek
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
| | - Valentina Romeo
- Roche Pharma Research and Early Development (pRED), Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd., Grenzacherstrasse 124, CH-4070, Basel, Switzerland
| | - Jan A Hiss
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
| | - Daniel Merk
- Department of Pharmacy, Ludwig-Maximilians-Universität München, Butenandtstrasse 5, 81377, Munich, Germany
| | - Petra Schneider
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
| | - Bernd Kuhn
- Roche Pharma Research and Early Development (pRED), Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd., Grenzacherstrasse 124, CH-4070, Basel, Switzerland
| | - Uwe Grether
- Roche Pharma Research and Early Development (pRED), Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd., Grenzacherstrasse 124, CH-4070, Basel, Switzerland
| | - Gisbert Schneider
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland.
| |
Collapse
|
3
|
Wang M, Wu Z, Wang J, Weng G, Kang Y, Pan P, Li D, Deng Y, Yao X, Bing Z, Hsieh CY, Hou T. Genetic Algorithm-Based Receptor Ligand: A Genetic Algorithm-Guided Generative Model to Boost the Novelty and Drug-Likeness of Molecules in a Sampling Chemical Space. J Chem Inf Model 2024; 64:1213-1228. [PMID: 38302422 DOI: 10.1021/acs.jcim.3c01964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2024]
Abstract
Deep learning-based de novo molecular design has recently gained significant attention. While numerous DL-based generative models have been successfully developed for designing novel compounds, the majority of the generated molecules lack sufficiently novel scaffolds or high drug-like profiles. The aforementioned issues may not be fully captured by commonly used metrics for the assessment of molecular generative models, such as novelty, diversity, and quantitative estimation of the drug-likeness score. To address these limitations, we proposed a genetic algorithm-guided generative model called GARel (genetic algorithm-based receptor-ligand interaction generator), a novel framework for training a DL-based generative model to produce drug-like molecules with novel scaffolds. To efficiently train the GARel model, we utilized dense net to update the parameters based on molecules with novel scaffolds and drug-like features. To demonstrate the capability of the GARel model, we used it to design inhibitors for three targets: AA2AR, EGFR, and SARS-Cov2. The results indicate that GARel-generated molecules feature more diverse and novel scaffolds and possess more desirable physicochemical properties and favorable docking scores. Compared with other generative models, GARel makes significant progress in balancing novelty and drug-likeness, providing a promising direction for the further development of DL-based de novo design methodology with potential impacts on drug discovery.
Collapse
Affiliation(s)
- Mingyang Wang
- College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang ,China
- CarbonSilicon AI Technology Co., Ltd., Hangzhou 310018, Zhejiang ,China
| | - Zhengjian Wu
- College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang ,China
- School of Computer Science, Wuhan University, Wuhan 430072, Hubei ,China
| | - Jike Wang
- College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang ,China
- CarbonSilicon AI Technology Co., Ltd., Hangzhou 310018, Zhejiang ,China
| | - Gaoqi Weng
- College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang ,China
| | - Yu Kang
- College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang ,China
| | - Peichen Pan
- College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang ,China
| | - Dan Li
- College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang ,China
| | - Yafeng Deng
- CarbonSilicon AI Technology Co., Ltd., Hangzhou 310018, Zhejiang ,China
| | - Xiaojun Yao
- Dr. Neher's Biophysics Laboratory for Innovative Drug Discovery Macau Institute for Applied Research in Medicine and Health State Key Laboratory of Quality Research in Chinese Medicine, Macau University of Science and Technology, Taipa, Macau 999078, China
| | - Zhitong Bing
- Institute of Modern Physics, Chinese Academy of Sciences, Lanzhou, Gansu 730000, China
| | - Chang-Yu Hsieh
- College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang ,China
| | - Tingjun Hou
- College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang ,China
| |
Collapse
|
4
|
Huang CH, Lin ST. MARS Plus: An Improved Molecular Design Tool for Complex Compounds Involving Ionic, Stereo, and Cis-Trans Isomeric Structures. J Chem Inf Model 2023; 63:7711-7728. [PMID: 38100117 DOI: 10.1021/acs.jcim.3c01745] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2023]
Abstract
MARS (Molecular Assembling and Representation Suite) (Hsu et al. J. Chem. Inf. Model. 2019, 59, 3703-3713) is a toolbox for the molecular design of organic molecules. MARS uses integer arrays to represent the elements and connectivity between elements of a molecule. It provides a collection of operations to manipulate the elemental composition and connectivity of a molecule (or a pair of molecules), enabling the creation of novel chemical compounds. In this work, the original MARS is extended to handle complex molecular structures, including geometric (cis-trans) isomers, stereo isomers, cyclic compounds, and ionic species. The extended version of MARS, referred to as MARS+, has a more comprehensive coverage of the chemical space and therefore can explore molecules with a greater chemical and physical diversity. Compared to other molecular design tools, MARS+ is designed to perform all possible manipulations on a given molecule or a pair of molecules. Molecular structure manipulation can be conducted in either a controlled or a random fashion. Furthermore, every structure manipulation has a counterpart so that the operation can be reversed. Nearly any possible chemical structure can be generated with MARS+ via a combination of molecular operations. The capabilities of MARS+ are examined by the design of new ionic liquids (ILs). The results show that MARS+ is a useful tool for computer-aided molecular design (CAMD) and molecular structure enumeration.
Collapse
Affiliation(s)
- Chen-Hsuan Huang
- Department of Chemical Engineering, National Taiwan University, Taipei 10617, Taiwan
| | - Shiang-Tai Lin
- Department of Chemical Engineering, National Taiwan University, Taipei 10617, Taiwan
| |
Collapse
|
5
|
Wang M, Wang J, Weng G, Kang Y, Pan P, Li D, Deng Y, Li H, Hsieh CY, Hou T. ReMODE: a deep learning-based web server for target-specific drug design. J Cheminform 2022; 14:84. [PMID: 36510307 PMCID: PMC9743675 DOI: 10.1186/s13321-022-00665-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 12/01/2022] [Indexed: 12/14/2022] Open
Abstract
Deep learning (DL) and machine learning contribute significantly to basic biology research and drug discovery in the past few decades. Recent advances in DL-based generative models have led to superior developments in de novo drug design. However, data availability, deep data processing, and the lack of user-friendly DL tools and interfaces make it difficult to apply these DL techniques to drug design. We hereby present ReMODE (Receptor-based MOlecular DEsign), a new web server based on DL algorithm for target-specific ligand design, which integrates different functional modules to enable users to develop customizable drug design tasks. As designed, the ReMODE sever can construct the target-specific tasks toward the protein targets selected by users. Meanwhile, the server also provides some extensions: users can optimize the drug-likeness or synthetic accessibility of the generated molecules, and control other physicochemical properties; users can also choose a sub-structure/scaffold as a starting point for fragment-based drug design. The ReMODE server also enables users to optimize the pharmacophore matching and docking conformations of the generated molecules. We believe that the ReMODE server will benefit researchers for drug discovery. ReMODE is publicly available at http://cadd.zju.edu.cn/relation/remode/ .
Collapse
Affiliation(s)
- Mingyang Wang
- grid.13402.340000 0004 1759 700XInnovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou, 310058 Zhejiang People’s Republic of China ,CarbonSilicon AI Technology Co., Ltd, Hangzhou, 310018 Zhejiang People’s Republic of China
| | - Jike Wang
- grid.13402.340000 0004 1759 700XInnovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou, 310058 Zhejiang People’s Republic of China ,CarbonSilicon AI Technology Co., Ltd, Hangzhou, 310018 Zhejiang People’s Republic of China
| | - Gaoqi Weng
- grid.13402.340000 0004 1759 700XInnovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou, 310058 Zhejiang People’s Republic of China ,CarbonSilicon AI Technology Co., Ltd, Hangzhou, 310018 Zhejiang People’s Republic of China
| | - Yu Kang
- grid.13402.340000 0004 1759 700XInnovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou, 310058 Zhejiang People’s Republic of China
| | - Peichen Pan
- grid.13402.340000 0004 1759 700XInnovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou, 310058 Zhejiang People’s Republic of China
| | - Dan Li
- grid.13402.340000 0004 1759 700XInnovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou, 310058 Zhejiang People’s Republic of China
| | - Yafeng Deng
- CarbonSilicon AI Technology Co., Ltd, Hangzhou, 310018 Zhejiang People’s Republic of China
| | - Honglin Li
- grid.28056.390000 0001 2163 4895Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science & Technology, Shanghai, 200237 People’s Republic of China
| | - Chang-Yu Hsieh
- grid.13402.340000 0004 1759 700XInnovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou, 310058 Zhejiang People’s Republic of China
| | - Tingjun Hou
- grid.13402.340000 0004 1759 700XInnovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou, 310058 Zhejiang People’s Republic of China
| |
Collapse
|
6
|
Kulczyk S, Koszytkowska-Stawińska M. Novel drug design framework as a response to neglected and emerging diseases. J Biomol Struct Dyn 2022:1-12. [DOI: 10.1080/07391102.2022.2110519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]
Affiliation(s)
- Stanisław Kulczyk
- Faculty of Chemistry, Warsaw University of Technology, Warsaw, Poland
| | | |
Collapse
|
7
|
Menon D, Ranganathan R. A Generative Approach to Materials Discovery, Design, and Optimization. ACS OMEGA 2022; 7:25958-25973. [PMID: 35936396 PMCID: PMC9352221 DOI: 10.1021/acsomega.2c03264] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Accepted: 07/11/2022] [Indexed: 05/25/2023]
Abstract
Despite its potential to transform society, materials research suffers from a major drawback: its long research timeline. Recently, machine-learning techniques have emerged as a viable solution to this drawback and have shown accuracies comparable to other computational techniques like density functional theory (DFT) at a fraction of the computational time. One particular class of machine-learning models, known as "generative models", is of particular interest owing to its ability to approximate high-dimensional probability distribution functions, which in turn can be used to generate novel data such as molecular structures by sampling these approximated probability distribution functions. This review article aims to provide an in-depth understanding of the underlying mathematical principles of popular generative models such as recurrent neural networks, variational autoencoders, and generative adversarial networks and discuss their state-of-the-art applications in the domains of biomaterials and organic drug-like materials, energy materials, and structural materials. Here, we discuss a broad range of applications of these models spanning from the discovery of drugs that treat cancer to finding the first room-temperature superconductor and from the discovery and optimization of battery and photovoltaic materials to the optimization of high-entropy alloys. We conclude by presenting a brief outlook of the major challenges that lie ahead for the mainstream usage of these models for materials research.
Collapse
|
8
|
Wang M, Hsieh CY, Wang J, Wang D, Weng G, Shen C, Yao X, Bing Z, Li H, Cao D, Hou T. RELATION: A Deep Generative Model for Structure-Based De Novo Drug Design. J Med Chem 2022; 65:9478-9492. [PMID: 35713420 DOI: 10.1021/acs.jmedchem.2c00732] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Deep learning (DL)-based de novo molecular design has recently gained considerable traction. Many DL-based generative models have been successfully developed to design novel molecules, but most of them are ligand-centric and the role of the 3D geometries of target binding pockets in molecular generation has not been well-exploited. Here, we proposed a new 3D-based generative model called RELATION. In the RELATION model, the BiTL algorithm was specifically designed to extract and transfer the desired geometric features of the protein-ligand complexes to a latent space for generation. The pharmacophore conditioning and docking-based Bayesian sampling were applied to efficiently navigate the vast chemical space for the design of molecules with desired geometric properties and pharmacophore features. As a proof of concept, the RELATION model was used to design inhibitors for two targets, AKT1 and CDK2. The calculation results demonstrated that the RELATION model could efficiently generate novel molecules with favorable binding affinity and pharmacophore features.
Collapse
Affiliation(s)
- Mingyang Wang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Chang-Yu Hsieh
- Tencent, Tencent Quantum Lab, Shenzhen 518057, Guangdong, P. R. China
| | - Jike Wang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Dong Wang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Gaoqi Weng
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Chao Shen
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Xiaojun Yao
- Dr. Neher's Biophysics Laboratory for Innovative Drug Discovery Macau Institute for Applied Research in Medicine and Health State Key Laboratory of Quality Research in Chinese Medicine, Macau University of Science and Technology, Taipa 999078, Macau, P. R. China
| | - Zhitong Bing
- Institute of Modern Physics, Chinese Academy of Sciences, Lanzhou 730000, P. R. China
| | - Honglin Li
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science & Technology, Shanghai 200237, P. R. China
| | - Dongsheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410013, Hunan, P. R. China
| | - Tingjun Hou
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| |
Collapse
|
9
|
Li B, Chen H. Prediction of Compound Synthesis Accessibility Based on Reaction Knowledge Graph. MOLECULES (BASEL, SWITZERLAND) 2022; 27:molecules27031039. [PMID: 35164303 PMCID: PMC8838603 DOI: 10.3390/molecules27031039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Revised: 01/19/2022] [Accepted: 02/01/2022] [Indexed: 11/16/2022]
Abstract
With the increasing application of deep-learning-based generative models for de novo molecule design, the quantitative estimation of molecular synthetic accessibility (SA) has become a crucial factor for prioritizing the structures generated from generative models. It is also useful for helping in the prioritization of hit/lead compounds and guiding retrosynthesis analysis. In this study, based on the USPTO and Pistachio reaction datasets, a chemical reaction network was constructed for the identification of the shortest reaction paths (SRP) needed to synthesize compounds, and different SRP cut-offs were then used as the threshold to distinguish a organic compound as either an easy-to-synthesize (ES) or hard-to-synthesize (HS) class. Two synthesis accessibility models (DNN-ECFP model and graph-based CMPNN model) were built using deep learning/machine learning algorithms. Compared to other existing synthesis accessibility scoring schemes, such as SYBA, SCScore, and SAScore, our results show that CMPNN (ROC AUC: 0.791) performs better than SYBA (ROC AUC: 0.76), albeit marginally, and outperforms SAScore and SCScore. Our prediction models based on historical reaction knowledge could be a potential tool for estimating molecule SA.
Collapse
Affiliation(s)
- Baiqing Li
- Guangdong Provincial Key Laboratory of Laboratory Animals, Guangdong Laboratory Animals Monitoring Institute, Guangzhou 510663, China;
- State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou 510530, China
- Bioland Laboratory (Guangzhou Regenerative Medicine and Health-Guangdong Laboratory), Guangzhou 510530, China
| | - Hongming Chen
- Guangdong Provincial Key Laboratory of Laboratory Animals, Guangdong Laboratory Animals Monitoring Institute, Guangzhou 510663, China;
- Bioland Laboratory (Guangzhou Regenerative Medicine and Health-Guangdong Laboratory), Guangzhou 510530, China
- Guangzhou Laboratory, Guangzhou International Bio Island, No. 9 XinDaoHuanBei Road, Guangzhou 510005, China
- Correspondence:
| |
Collapse
|
10
|
Wang M, Sun H, Wang J, Pang J, Chai X, Xu L, Li H, Cao D, Hou T. Comprehensive assessment of deep generative architectures for de novo drug design. Brief Bioinform 2021; 23:6470970. [PMID: 34929743 DOI: 10.1093/bib/bbab544] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 11/24/2021] [Accepted: 11/25/2021] [Indexed: 01/20/2023] Open
Abstract
Recently, deep learning (DL)-based de novo drug design represents a new trend in pharmaceutical research, and numerous DL-based methods have been developed for the generation of novel compounds with desired properties. However, a comprehensive understanding of the advantages and disadvantages of these methods is still lacking. In this study, the performances of different generative models were evaluated by analyzing the properties of the generated molecules in different scenarios, such as goal-directed (rediscovery, optimization and scaffold hopping of active compounds) and target-specific (generation of novel compounds for a given target) tasks. In overall, the DL-based models have significant advantages over the baseline models built by the traditional methods in learning the physicochemical property distributions of the training sets and may be more suitable for target-specific tasks. However, both the baselines and DL-based generative models cannot fully exploit the scaffolds of the training sets, and the molecules generated by the DL-based methods even have lower scaffold diversity than those generated by the traditional models. Moreover, our assessment illustrates that the DL-based methods do not exhibit obvious advantages over the genetic algorithm-based baselines in goal-directed tasks. We believe that our study provides valuable guidance for the effective use of generative models in de novo drug design.
Collapse
Affiliation(s)
- Mingyang Wang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Huiyong Sun
- Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, Jiangsu, P. R. China
| | - Jike Wang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Jinping Pang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Xin Chai
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Lei Xu
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou 213001, Jiangsu, China
| | - Honglin Li
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science & Technology, Shanghai 200237, P. R. China
| | - Dongsheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410013, Hunan, P. R. China
| | - Tingjun Hou
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| |
Collapse
|
11
|
Fragment-to-lead tailored in silico design. DRUG DISCOVERY TODAY. TECHNOLOGIES 2021; 40:44-57. [PMID: 34916022 DOI: 10.1016/j.ddtec.2021.08.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Revised: 06/25/2021] [Accepted: 08/11/2021] [Indexed: 02/07/2023]
Abstract
Fragment-based drug discovery (FBDD) emerged as a disruptive technology and became established during the last two decades. Its rationality and low entry costs make it appealing, and the numerous examples of approved drugs discovered through FBDD validate the approach. However, FBDD still faces numerous challenges. Perhaps the most important one is the transformation of the initial fragment hits into viable leads. Fragment-to-lead (F2L) optimization is resource-intensive and is therefore limited in the possibilities that can be actively pursued. In silico strategies play an important role in F2L, as they can perform a deeper exploration of chemical space, prioritize molecules with high probabilities of being active and generate non-obvious ideas. Here we provide a critical overview of current in silico strategies in F2L optimization and highlight their remarkable impact. While very effective, most solutions are target- or fragment- specific. We propose that fully integrated in silico strategies, capable of automatically and systematically exploring the fast-growing available chemical space can have a significant impact on accelerating the release of fragment originated drugs.
Collapse
|
12
|
Wang M, Wang Z, Sun H, Wang J, Shen C, Weng G, Chai X, Li H, Cao D, Hou T. Deep learning approaches for de novo drug design: An overview. Curr Opin Struct Biol 2021; 72:135-144. [PMID: 34823138 DOI: 10.1016/j.sbi.2021.10.001] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2021] [Revised: 08/28/2021] [Accepted: 10/10/2021] [Indexed: 01/01/2023]
Abstract
De novo drug design is the process of generating novel lead compounds with desirable pharmacological and physiochemical properties. The application of deep learning (DL) in de novo drug design has become a hot topic, and many DL-based approaches have been developed for molecular generation tasks. Generally, these approaches were developed as per four frameworks: recurrent neural networks; encoder-decoder; reinforcement learning; and generative adversarial networks. In this review, we first introduced the molecular representation and assessment metrics used in DL-based de novo drug design. Then, we summarized the features of each architecture. Finally, the potential challenges and future directions of DL-based molecular generation were prospected.
Collapse
Affiliation(s)
- Mingyang Wang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, PR China
| | - Zhe Wang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, PR China
| | - Huiyong Sun
- Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, Jiangsu, PR China
| | - Jike Wang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, PR China
| | - Chao Shen
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, PR China
| | - Gaoqi Weng
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, PR China
| | - Xin Chai
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, PR China
| | - Honglin Li
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, PR China; Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science & Technology, Shanghai 200237, PR China.
| | - Dongsheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410013, Hunan, PR China.
| | - Tingjun Hou
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, PR China.
| |
Collapse
|
13
|
Nichols PL. Automated and enabling technologies for medicinal chemistry. PROGRESS IN MEDICINAL CHEMISTRY 2021; 60:191-272. [PMID: 34147203 DOI: 10.1016/bs.pmch.2021.01.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Having always been driven by the need to get new treatments to patients as quickly as possible, drug discovery is a constantly evolving process. This chapter will review how medicinal chemistry was established, how it has changed over the years due to the emergence of new enabling technologies, and how early advances in synthesis, purification and analysis, have provided the foundations upon which the current automated and enabling technologies are built. Looking beyond the established technologies, this chapter will also consider technologies that are now emerging, and their impact on the future of drug discovery and the role of medicinal chemists.
Collapse
Affiliation(s)
- Paula L Nichols
- Synple Chem AG, Kemptthal, Switzerland; ETH, Zurich, Switzerland.
| |
Collapse
|
14
|
Berenger F, Kumar A, Zhang KYJ, Yamanishi Y. Lean-Docking: Exploiting Ligands' Predicted Docking Scores to Accelerate Molecular Docking. J Chem Inf Model 2021; 61:2341-2352. [PMID: 33861591 DOI: 10.1021/acs.jcim.0c01452] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
In structure-based virtual screening (SBVS), a binding site on a protein structure is used to search for ligands with favorable nonbonded interactions. Because it is computationally difficult, docking is time-consuming and any docking user will eventually encounter a chemical library that is too big to dock. This problem might arise because there is not enough computing power or because preparing and storing so many three-dimensional (3D) ligands requires too much space. In this study, however, we show that quality regressors can be trained to predict docking scores from molecular fingerprints. Although typical docking has a screening rate of less than one ligand per second on one CPU core, our regressors can predict about 5800 docking scores per second. This approach allows us to focus docking on the portion of a database that is predicted to have docking scores below a user-chosen threshold. Herein, usage examples are shown, where only 25% of a ligand database is docked, without any significant virtual screening performance loss. We call this method "lean-docking". To validate lean-docking, a massive docking campaign using several state-of-the-art docking software packages was undertaken on an unbiased data set, with only wet-lab tested active and inactive molecules. Although regressors allow the screening of a larger chemical space, even at a constant docking power, it is also clear that significant progress in the virtual screening power of docking scores is desirable.
Collapse
Affiliation(s)
- Francois Berenger
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka 820-8502, Japan
| | - Ashutosh Kumar
- Laboratory for Structural Bioinformatics, Center for Biosystems Dynamics Research, RIKEN, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | - Kam Y J Zhang
- Laboratory for Structural Bioinformatics, Center for Biosystems Dynamics Research, RIKEN, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | - Yoshihiro Yamanishi
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka 820-8502, Japan
| |
Collapse
|
15
|
Computer-Aided Drug Designing. Adv Bioinformatics 2021. [DOI: 10.1007/978-981-33-6191-1_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open
|
16
|
Ghanakota P, Bos PH, Konze KD, Staker J, Marques G, Marshall K, Leswing K, Abel R, Bhat S. Combining Cloud-Based Free-Energy Calculations, Synthetically Aware Enumerations, and Goal-Directed Generative Machine Learning for Rapid Large-Scale Chemical Exploration and Optimization. J Chem Inf Model 2020; 60:4311-4325. [PMID: 32484669 DOI: 10.1021/acs.jcim.0c00120] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Affiliation(s)
- Phani Ghanakota
- Schrödinger, Inc., 120 West 45th Street, 17th floor, New York, New York 10036, United States
| | - Pieter H. Bos
- Schrödinger, Inc., 120 West 45th Street, 17th floor, New York, New York 10036, United States
| | - Kyle D. Konze
- Schrödinger, Inc., 120 West 45th Street, 17th floor, New York, New York 10036, United States
| | - Joshua Staker
- Schrödinger, Inc., 120 West 45th Street, 17th floor, New York, New York 10036, United States
| | - Gabriel Marques
- Schrödinger, Inc., 120 West 45th Street, 17th floor, New York, New York 10036, United States
| | - Kyle Marshall
- Schrödinger, Inc., 120 West 45th Street, 17th floor, New York, New York 10036, United States
| | - Karl Leswing
- Schrödinger, Inc., 120 West 45th Street, 17th floor, New York, New York 10036, United States
| | - Robert Abel
- Schrödinger, Inc., 120 West 45th Street, 17th floor, New York, New York 10036, United States
| | - Sathesh Bhat
- Schrödinger, Inc., 120 West 45th Street, 17th floor, New York, New York 10036, United States
| |
Collapse
|
17
|
SYBA: Bayesian estimation of synthetic accessibility of organic compounds. J Cheminform 2020; 12:35. [PMID: 33431015 PMCID: PMC7238540 DOI: 10.1186/s13321-020-00439-2] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Accepted: 05/09/2020] [Indexed: 12/11/2022] Open
Abstract
SYBA (SYnthetic Bayesian Accessibility) is a fragment-based method for the rapid classification of organic compounds as easy- (ES) or hard-to-synthesize (HS). It is based on a Bernoulli naïve Bayes classifier that is used to assign SYBA score contributions to individual fragments based on their frequencies in the database of ES and HS molecules. SYBA was trained on ES molecules available in the ZINC15 database and on HS molecules generated by the Nonpher methodology. SYBA was compared with a random forest, that was utilized as a baseline method, as well as with other two methods for synthetic accessibility assessment: SAScore and SCScore. When used with their suggested thresholds, SYBA improves over random forest classification, albeit marginally, and outperforms SAScore and SCScore. However, upon the optimization of SAScore threshold (that changes from 6.0 to – 4.5), SAScore yields similar results as SYBA. Because SYBA is based merely on fragment contributions, it can be used for the analysis of the contribution of individual molecular parts to compound synthetic accessibility. SYBA is publicly available at https://github.com/lich-uct/syba under the GNU General Public License.
Collapse
|
18
|
Banjare L, Verma SK, Jain AK, Thareja S. Lead Molecules as Novel Aromatase Inhibitors: In Silico De Novo Designing and Binding Affinity Studies. LETT DRUG DES DISCOV 2020. [DOI: 10.2174/1570180816666190703152659] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:Aromatase inhibitors emerged as a pivotal moiety to selectively block estrogen production, prevention and treatment of tumour growth in breast cancer. De novo drug design is an alternative approach to blind virtual screening for successful designing of the novel molecule against various therapeutic targets.Objective:In the present study, we have explored the de novo approach to design novel aromatase inhibitors.Method:The e-LEA3D, a computational-aided drug design web server was used to design novel drug-like candidates against the target aromatase. For drug-likeness ADME parameters (molecular weight, H-bond acceptors, H-bond donors, LogP and number of rotatable bonds) of designed molecules were calculated in TSAR software package, geometry optimization and energy minimization was accomplished using Chem Office. Further, molecular docking study was performed in Molegro Virtual Docker (MVD).Results:Among 17 generated molecules using the de novo pathway, 13 molecules passed the Lipinski filter pertaining to their bioavailability characteristics. De novo designed molecules with drug-likeness were further docked into the mapped active site of aromatase to scale up their affinity and binding fitness with the target. Among de novo fabricated drug like candidates (1-13), two molecules (5, 6) exhibited higher affinity with aromatase in terms of MolDock score (-150.650, -172.680 Kcal/mol, respectively) while molecule 8 showed lowest target affinity (-85.588 Kcal/mol).Conclusion:The binding patterns of lead molecules (5, 6) could be used as a pharmacophore for medicinal chemists to explore these molecules for their aromatase inhibitory potential.
Collapse
Affiliation(s)
- Laxmi Banjare
- School of Pharmaceutical Sciences, Guru Ghasidas Central University, Bilaspur- 495009 (C.G.), India
| | - Sant Kumar Verma
- School of Pharmaceutical Sciences, Guru Ghasidas Central University, Bilaspur- 495009 (C.G.), India
| | - Akhlesh Kumar Jain
- School of Pharmaceutical Sciences, Guru Ghasidas Central University, Bilaspur- 495009 (C.G.), India
| | - Suresh Thareja
- School of Pharmaceutical Sciences, Guru Ghasidas Central University, Bilaspur- 495009 (C.G.), India
| |
Collapse
|
19
|
Matos MJB, Pina AS, Roque ACA. Rational design of affinity ligands for bioseparation. J Chromatogr A 2020; 1619:460871. [PMID: 32044126 DOI: 10.1016/j.chroma.2020.460871] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Revised: 01/05/2020] [Accepted: 01/08/2020] [Indexed: 11/25/2022]
Abstract
Affinity adsorbents have been the cornerstone in protein purification. The selective nature of the molecular recognition interactions established between an affinity ligands and its target provide the basis for efficient capture and isolation of proteins. The plethora of affinity adsorbents available in the market reflects the importance of affinity chromatography in the bioseparation industry. Ligand discovery relies on the implementation of rational design techniques, which provides the foundation for the engineering of novel affinity ligands. The main goal for the design of affinity ligands is to discover or improve functionality, such as increased stability or selectivity. However, the methodologies must adapt to the current needs, namely to the number and diversity of biologicals being developed, and the availability of new tools for big data analysis and artificial intelligence. In this review, we offer an overview on the development of affinity ligands for bioseparation, including the evolution of rational design techniques, dating back to the years of early discovery up to the current and future trends in the field.
Collapse
Affiliation(s)
- Manuel J B Matos
- UCIBIO, Chemistry Department, School of Sciences and Technology, NOVA University of Lisbon, 2829-516 Caparica, Portugal
| | - Ana S Pina
- UCIBIO, Chemistry Department, School of Sciences and Technology, NOVA University of Lisbon, 2829-516 Caparica, Portugal
| | - A C A Roque
- UCIBIO, Chemistry Department, School of Sciences and Technology, NOVA University of Lisbon, 2829-516 Caparica, Portugal.
| |
Collapse
|
20
|
Schneider P, Walters WP, Plowright AT, Sieroka N, Listgarten J, Goodnow RA, Fisher J, Jansen JM, Duca JS, Rush TS, Zentgraf M, Hill JE, Krutoholow E, Kohler M, Blaney J, Funatsu K, Luebkemann C, Schneider G. Rethinking drug design in the artificial intelligence era. Nat Rev Drug Discov 2019. [DOI: 78495111110.1038/s41573-019-0050-3' target='_blank'>'"<>78495111110.1038/s41573-019-0050-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [78495111110.1038/s41573-019-0050-3','', '10.1517/17460441.2010.497534')">Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/29/2022]
78495111110.1038/s41573-019-0050-3" />
|
21
|
Rethinking drug design in the artificial intelligence era. Nat Rev Drug Discov 2019; 19:353-364. [DOI: 10.1038/s41573-019-0050-3] [Citation(s) in RCA: 222] [Impact Index Per Article: 44.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/28/2019] [Indexed: 12/17/2022]
|
22
|
Hsu HH, Huang CH, Lin ST. New Data Structure for Computational Molecular Design with Atomic or Fragment Resolution. J Chem Inf Model 2019; 59:3703-3713. [PMID: 31393721 DOI: 10.1021/acs.jcim.9b00478] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A new molecular data structure and molecular structure operation algorithms are proposed for general purpose molecular design. The data structure allows for a variety of molecular operations for creating new molecules. Two types of molecular operations were developed, unimolecular and bimolecular operations. In unimolecular operations, a child molecule can be created from a parent via addition of a functional group, deletion of a fragment, mutation of an atom, etc. In bimolecular operations, children molecules are generated from two parent molecules through combination or crossover (hybridization). These molecular operations are essential for the creation and modification of molecules for the purpose of molecular design. The data structure is capable of representing linear, branched, multifunctional, and multivalent compounds. Algorithms are developed for deriving the molecular data structure of a molecule from its atomic coordinates and vice versa. We show that this new molecular data structure and the developed algorithms, referred to as Molecular Assembling and Representation Suite, allow one to generate a comprehensive library of new molecules via performing every possible molecular structure modification.
Collapse
Affiliation(s)
- Hsuan-Hao Hsu
- Department of Chemical Engineering , National Taiwan University , Taipei 10617 , Taiwan
| | - Chen-Hsuan Huang
- Department of Chemical Engineering , National Taiwan University , Taipei 10617 , Taiwan
| | - Shiang-Tai Lin
- Department of Chemical Engineering , National Taiwan University , Taipei 10617 , Taiwan
| |
Collapse
|
23
|
Foscato M, Venkatraman V, Jensen VR. DENOPTIM: Software for Computational de Novo Design of Organic and Inorganic Molecules. J Chem Inf Model 2019; 59:4077-4082. [DOI: 10.1021/acs.jcim.9b00516] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Affiliation(s)
- Marco Foscato
- Department of Chemistry, University of Bergen, Allégaten 41, N-5007 Bergen, Norway
| | - Vishwesh Venkatraman
- Department of Chemistry, Norwegian University of Science and Technology, N-7491 Trondheim, Norway
| | - Vidar R. Jensen
- Department of Chemistry, University of Bergen, Allégaten 41, N-5007 Bergen, Norway
| |
Collapse
|
24
|
Konze KD, Bos PH, Dahlgren MK, Leswing K, Tubert-Brohman I, Bortolato A, Robbason B, Abel R, Bhat S. Reaction-Based Enumeration, Active Learning, and Free Energy Calculations To Rapidly Explore Synthetically Tractable Chemical Space and Optimize Potency of Cyclin-Dependent Kinase 2 Inhibitors. J Chem Inf Model 2019; 59:3782-3793. [PMID: 31404495 DOI: 10.1021/acs.jcim.9b00367] [Citation(s) in RCA: 59] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
The hit-to-lead and lead optimization processes usually involve the design, synthesis, and profiling of thousands of analogs prior to clinical candidate nomination. A hit finding campaign may begin with a virtual screen that explores millions of compounds, if not more. However, this scale of computational profiling is not frequently performed in the hit-to-lead or lead optimization phases of drug discovery. This is likely due to the lack of appropriate computational tools to generate synthetically tractable lead-like compounds in silico, and a lack of computational methods to accurately profile compounds prospectively on a large scale. Recent advances in computational power and methods provide the ability to profile much larger libraries of ligands than previously possible. Herein, we report a new computational technique, referred to as "PathFinder", that uses retrosynthetic analysis followed by combinatorial synthesis to generate novel compounds in synthetically accessible chemical space. In this work, the integration of PathFinder-driven compound generation, cloud-based FEP simulations, and active learning are used to rapidly optimize R-groups, and generate new cores for inhibitors of cyclin-dependent kinase 2 (CDK2). Using this approach, we explored >300 000 ideas, performed >5000 FEP simulations, and identified >100 ligands with a predicted IC50 < 100 nM, including four unique cores. To our knowledge, this is the largest set of FEP calculations disclosed in the literature to date. The rapid turnaround time, and scale of chemical exploration, suggests that this is a useful approach to accelerate the discovery of novel chemical matter in drug discovery campaigns.
Collapse
Affiliation(s)
- Kyle D Konze
- Schrödinger Inc. , 120 West 45th Street, 17th floor , New York , New York 10036 , United States
| | - Pieter H Bos
- Schrödinger Inc. , 120 West 45th Street, 17th floor , New York , New York 10036 , United States
| | - Markus K Dahlgren
- Schrödinger Inc. , 120 West 45th Street, 17th floor , New York , New York 10036 , United States
| | - Karl Leswing
- Schrödinger Inc. , 120 West 45th Street, 17th floor , New York , New York 10036 , United States
| | - Ivan Tubert-Brohman
- Schrödinger Inc. , 120 West 45th Street, 17th floor , New York , New York 10036 , United States
| | - Andrea Bortolato
- Schrödinger Inc. , 120 West 45th Street, 17th floor , New York , New York 10036 , United States
| | - Braxton Robbason
- Schrödinger Inc. , 120 West 45th Street, 17th floor , New York , New York 10036 , United States
| | - Robert Abel
- Schrödinger Inc. , 120 West 45th Street, 17th floor , New York , New York 10036 , United States
| | - Sathesh Bhat
- Schrödinger Inc. , 120 West 45th Street, 17th floor , New York , New York 10036 , United States
| |
Collapse
|
25
|
Abstract
De novo drug design aims to generate novel chemical compounds with desirable chemical and pharmacological properties from scratch using computer-based methods. Recently, deep generative neural networks have become a very active research frontier in de novo drug discovery, both in theoretical and in experimental evidence, shedding light on a promising new direction of automatic molecular generation and optimization. In this review, we discussed recent development of deep learning models for molecular generation and summarized them as four different generative architectures with four different optimization strategies. We also discussed future directions of deep generative models for de novo drug design.
Collapse
|
26
|
Sommer K, Flachsenberg F, Rarey M. NAOMInext – Synthetically feasible fragment growing in a structure-based design context. Eur J Med Chem 2019; 163:747-762. [DOI: 10.1016/j.ejmech.2018.11.075] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Revised: 11/27/2018] [Accepted: 11/30/2018] [Indexed: 12/31/2022]
|
27
|
Abstract
Although significant advances in experimental high throughput screening (HTS) have been made for drug lead identification, in silico virtual screening (VS) is indispensable owing to its unique advantage over experimental HTS, target-focused, cheap, and efficient, albeit its disadvantage of producing false positive hits. For both experimental HTS and VS, the quality of screening libraries is crucial and determines the outcome of those studies. In this paper, we first reviewed the recent progress on screening library construction. We realized the urgent need for compiling high-quality screening libraries in drug discovery. Then we compiled a set of screening libraries from about 20 million druglike ZINC molecules by running fingerprint-based similarity searches against known drug molecules. Lastly, the screening libraries were objectively evaluated using 5847 external actives covering more than 2000 drug targets. The result of the assessment is very encouraging. For example, with the Tanimoto coefficient being set to 0.75, 36% of external actives were retrieved and the enrichment factor was 13. Additionally, drug target family specific screening libraries were also constructed and evaluated. The druglike screening libraries are available for download from https://mulan.pharmacy.pitt.edu .
Collapse
Affiliation(s)
- Junmei Wang
- Department of Pharmaceutical Sciences , The University of Pittsburgh , 3501 Terrace Street , Pittsburgh , Pennsylvania 15261 , United States
| | - Yubin Ge
- Department of Pharmaceutical Sciences , The University of Pittsburgh , 3501 Terrace Street , Pittsburgh , Pennsylvania 15261 , United States
| | - Xiang-Qun Xie
- Department of Pharmaceutical Sciences , The University of Pittsburgh , 3501 Terrace Street , Pittsburgh , Pennsylvania 15261 , United States
| |
Collapse
|
28
|
Su M, Yang Q, Du Y, Feng G, Liu Z, Li Y, Wang R. Comparative Assessment of Scoring Functions: The CASF-2016 Update. J Chem Inf Model 2018; 59:895-913. [DOI: 10.1021/acs.jcim.8b00545] [Citation(s) in RCA: 208] [Impact Index Per Article: 34.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Affiliation(s)
- Minyi Su
- State Key Laboratory of Bioorganic and Natural Products Chemistry, Center for Excellence in Molecular Synthesis, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
- University of Chinese Academy of Sciences, Beijing 100049, People’s Republic of China
| | - Qifan Yang
- State Key Laboratory of Bioorganic and Natural Products Chemistry, Center for Excellence in Molecular Synthesis, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
- University of Chinese Academy of Sciences, Beijing 100049, People’s Republic of China
| | - Yu Du
- State Key Laboratory of Bioorganic and Natural Products Chemistry, Center for Excellence in Molecular Synthesis, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
- University of Chinese Academy of Sciences, Beijing 100049, People’s Republic of China
| | - Guoqin Feng
- State Key Laboratory of Bioorganic and Natural Products Chemistry, Center for Excellence in Molecular Synthesis, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
- University of Chinese Academy of Sciences, Beijing 100049, People’s Republic of China
| | - Zhihai Liu
- State Key Laboratory of Bioorganic and Natural Products Chemistry, Center for Excellence in Molecular Synthesis, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
| | - Yan Li
- State Key Laboratory of Bioorganic and Natural Products Chemistry, Center for Excellence in Molecular Synthesis, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
| | - Renxiao Wang
- State Key Laboratory of Bioorganic and Natural Products Chemistry, Center for Excellence in Molecular Synthesis, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
- University of Chinese Academy of Sciences, Beijing 100049, People’s Republic of China
- Shanxi Key Laboratory of Innovative Drugs for the Treatment of Serious Diseases Basing on Chronic Inflammation, College of Traditional Chinese Medicines, Shanxi University of Chinese Medicine, Taiyuan, Shanxi 030619, People’s Republic of China
| |
Collapse
|
29
|
Panteleev J, Gao H, Jia L. Recent applications of machine learning in medicinal chemistry. Bioorg Med Chem Lett 2018; 28:2807-2815. [PMID: 30122222 DOI: 10.1016/j.bmcl.2018.06.046] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2018] [Revised: 06/24/2018] [Accepted: 06/26/2018] [Indexed: 12/20/2022]
Abstract
In recent decades, artificial intelligence and machine learning have played a significant role in increasing the efficiency of processes across a wide spectrum of industries. When it comes to the pharmaceutical and biotechnology sectors, numerous tools enabled by advancement of computer science have been developed and are now routinely utilized. However, there are many aspects of the drug discovery process, which can further benefit from refinement of computational methods and tools, as well as improvement of accessibility of these new technologies. In this review, examples of recent developments in machine learning application are described, which have the potential to impact different parts of the drug discovery and development flow scheme. Notably, new deep learning-based approaches across compound design and synthesis, prediction of binding, activity and ADMET properties, as well as applications of genetic algorithms are highlighted.
Collapse
Affiliation(s)
- Jane Panteleev
- Amgen Discovery Research, 360 Binney St., Cambridge, MA 02141, USA
| | - Hua Gao
- Amgen Discovery Research, 360 Binney St., Cambridge, MA 02141, USA
| | - Lei Jia
- Amgen Discovery Research, One Amgen Center Dr., Thousand Oaks, CA 91320, USA.
| |
Collapse
|
30
|
|
31
|
Kumar A, Zhang KYJ. A cross docking pipeline for improving pose prediction and virtual screening performance. J Comput Aided Mol Des 2017; 32:163-173. [PMID: 28836076 DOI: 10.1007/s10822-017-0048-z] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2017] [Accepted: 08/18/2017] [Indexed: 02/02/2023]
Abstract
Pose prediction and virtual screening performance of a molecular docking method depend on the choice of protein structures used for docking. Multiple structures for a target protein are often used to take into account the receptor flexibility and problems associated with a single receptor structure. However, the use of multiple receptor structures is computationally expensive when docking a large library of small molecules. Here, we propose a new cross-docking pipeline suitable to dock a large library of molecules while taking advantage of multiple target protein structures. Our method involves the selection of a suitable receptor for each ligand in a screening library utilizing ligand 3D shape similarity with crystallographic ligands. We have prospectively evaluated our method in D3R Grand Challenge 2 and demonstrated that our cross-docking pipeline can achieve similar or better performance than using either single or multiple-receptor structures. Moreover, our method displayed not only decent pose prediction performance but also better virtual screening performance over several other methods.
Collapse
Affiliation(s)
- Ashutosh Kumar
- Structural Bioinformatics Team, Center for Life Science Technologies, RIKEN, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa, 230-0045, Japan
| | - Kam Y J Zhang
- Structural Bioinformatics Team, Center for Life Science Technologies, RIKEN, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa, 230-0045, Japan.
| |
Collapse
|
32
|
A pose prediction approach based on ligand 3D shape similarity. J Comput Aided Mol Des 2016; 30:457-69. [DOI: 10.1007/s10822-016-9923-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2016] [Accepted: 07/01/2016] [Indexed: 11/27/2022]
|
33
|
Zhou QT, Liang HJ, Shakhnovich E. Virtual Screening of Human O-GlcNAc Transferase Inhibitors. CHINESE J CHEM PHYS 2016. [DOI: 10.1063/1674-0068/29/cjcp1510211] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
|
34
|
Ramírez D. Computational Methods Applied to Rational Drug Design. THE OPEN MEDICINAL CHEMISTRY JOURNAL 2016; 10:7-20. [PMID: 27708723 PMCID: PMC5039900 DOI: 10.2174/1874104501610010007] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/27/2015] [Revised: 01/27/2016] [Accepted: 01/28/2016] [Indexed: 11/22/2022]
Abstract
Due
to the synergic relationship between medical chemistry, bioinformatics and
molecular simulation, the development of new accurate computational tools for
small molecules drug design has been rising over the last years. The main result
is the increased number of publications where computational techniques such as
molecular docking, de novo design as well as virtual screening have been
used to estimate the binding mode, site and energy of novel small molecules. In
this work I review some tools, which enable the study of biological systems at
the atomistic level, providing relevant information and thereby, enhancing the
process of rational drug design.
Collapse
Affiliation(s)
- David Ramírez
- Centro de Bioinformática y Simulación Molecular, Universidad de Talca, 2 Norte 685, Casilla, Talca, Chile
| |
Collapse
|
35
|
Li Y, Zhao Z, Liu Z, Su M, Wang R. AutoT&T v.2: An Efficient and Versatile Tool for Lead Structure Generation and Optimization. J Chem Inf Model 2016; 56:435-53. [DOI: 10.1021/acs.jcim.5b00691] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Affiliation(s)
- Yan Li
- State
Key Laboratory of Bioorganic and Natural Products Chemistry, Collaborative
Innovation Center of Chemistry for Life Sciences, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
| | - Zhixiong Zhao
- State
Key Laboratory of Bioorganic and Natural Products Chemistry, Collaborative
Innovation Center of Chemistry for Life Sciences, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
| | - Zhihai Liu
- State
Key Laboratory of Bioorganic and Natural Products Chemistry, Collaborative
Innovation Center of Chemistry for Life Sciences, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
| | - Minyi Su
- State
Key Laboratory of Bioorganic and Natural Products Chemistry, Collaborative
Innovation Center of Chemistry for Life Sciences, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
| | - Renxiao Wang
- State
Key Laboratory of Bioorganic and Natural Products Chemistry, Collaborative
Innovation Center of Chemistry for Life Sciences, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
- State
Key Laboratory of Quality Research in Chinese Medicine, Macau Institute
for Applied Research in Medicine and Health, Macau University of Science and Technology, Macau, People’s Republic of China
| |
Collapse
|
36
|
Chéron N, Jasty N, Shakhnovich EI. OpenGrowth: An Automated and Rational Algorithm for Finding New Protein Ligands. J Med Chem 2015; 59:4171-88. [DOI: 10.1021/acs.jmedchem.5b00886] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Affiliation(s)
- Nicolas Chéron
- Department of Chemistry and
Chemical Biology, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Naveen Jasty
- Department of Chemistry and
Chemical Biology, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Eugene I. Shakhnovich
- Department of Chemistry and
Chemical Biology, Harvard University, Cambridge, Massachusetts 02138, United States
| |
Collapse
|
37
|
Foscato M, Houghton BJ, Occhipinti G, Deeth RJ, Jensen VR. Ring Closure To Form Metal Chelates in 3D Fragment-Based de Novo Design. J Chem Inf Model 2015; 55:1844-56. [DOI: 10.1021/acs.jcim.5b00424] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Marco Foscato
- Department
of Chemistry, University of Bergen, Allégaten 41, N-5007 Bergen, Norway
| | - Benjamin J. Houghton
- Inorganic
Computational Chemistry Group, Department of Chemistry, University of Warwick, Gibbet Hill Road, Coventry CV4 7AL, Great Britain
| | - Giovanni Occhipinti
- Department
of Chemistry, University of Bergen, Allégaten 41, N-5007 Bergen, Norway
| | - Robert J. Deeth
- Inorganic
Computational Chemistry Group, Department of Chemistry, University of Warwick, Gibbet Hill Road, Coventry CV4 7AL, Great Britain
| | - Vidar R. Jensen
- Department
of Chemistry, University of Bergen, Allégaten 41, N-5007 Bergen, Norway
| |
Collapse
|
38
|
Kumar A, Zhang KYJ. Application of Shape Similarity in Pose Selection and Virtual Screening in CSARdock2014 Exercise. J Chem Inf Model 2015; 56:965-73. [PMID: 26247231 DOI: 10.1021/acs.jcim.5b00279] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
To evaluate the applicability of shape similarity in docking-based pose selection and virtual screening, we participated in the CSARdock2014 benchmark exercise for identifying the correct docking pose of inhibitors targeting factor XA, spleen tyrosine kinase, and tRNA methyltransferase. This exercise provides a valuable opportunity for researchers to test their docking programs, methods, and protocols in a blind testing environment. In the CSARdock2014 benchmark exercise, we have implemented an approach that uses ligand 3D shape similarity to facilitate docking-based pose selection and virtual screening. We showed here that ligand 3D shape similarity between bound poses could be used to identify the native-like pose from an ensemble of docking-generated poses. Our method correctly identified the native pose as the top-ranking pose for 73% of test cases in a blind testing environment. Moreover, the pose selection results also revealed an excellent correlation between ligand 3D shape similarity scores and RMSD to X-ray crystal structure ligand. In the virtual screening exercise, the average RMSD for our pose prediction was found to be 1.02 Å, and it was one of the top performances achieved in CSARdock2014 benchmark exercise. Furthermore, the inclusion of shape similarity improved virtual screening performance of docking-based scoring and ranking. The coefficient of determination (r(2)) between experimental activities and docking scores for 276 spleen tyrosine kinase inhibitors was found to be 0.365 but reached 0.614 when the ligand 3D shape similarity was included.
Collapse
Affiliation(s)
- Ashutosh Kumar
- Structural Bioinformatics Team, Center for Life Science Technologies, RIKEN , 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa 230-0045, Japan
| | - Kam Y J Zhang
- Structural Bioinformatics Team, Center for Life Science Technologies, RIKEN , 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa 230-0045, Japan
| |
Collapse
|
39
|
Lolli M, Narramore S, Fishwick CW, Pors K. Refining the chemical toolbox to be fit for educational and practical purpose for drug discovery in the 21st Century. Drug Discov Today 2015; 20:1018-26. [DOI: 10.1016/j.drudis.2015.04.010] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2014] [Revised: 04/08/2015] [Accepted: 04/29/2015] [Indexed: 12/16/2022]
|
40
|
Abstract
Drug discovery utilizes chemical biology and computational drug design approaches for the efficient identification and optimization of lead compounds. Chemical biology is mostly involved in the elucidation of the biological function of a target and the mechanism of action of a chemical modulator. On the other hand, computer-aided drug design makes use of the structural knowledge of either the target (structure-based) or known ligands with bioactivity (ligand-based) to facilitate the determination of promising candidate drugs. Various virtual screening techniques are now being used by both pharmaceutical companies and academic research groups to reduce the cost and time required for the discovery of a potent drug. Despite the rapid advances in these methods, continuous improvements are critical for future drug discovery tools. Advantages presented by structure-based and ligand-based drug design suggest that their complementary use, as well as their integration with experimental routines, has a powerful impact on rational drug design. In this article, we give an overview of the current computational drug design and their application in integrated rational drug development to aid in the progress of drug discovery research.
Collapse
Affiliation(s)
- Stephani Joy Y Macalino
- National Leading Research Laboratory of Molecular Modeling and Drug Design, College of Pharmacy and Graduate School of Pharmaceutical Sciences, and Global Top 5 Research Program, Ewha Womans University, Seoul, 120-750, Korea
| | - Vijayakumar Gosu
- National Leading Research Laboratory of Molecular Modeling and Drug Design, College of Pharmacy and Graduate School of Pharmaceutical Sciences, and Global Top 5 Research Program, Ewha Womans University, Seoul, 120-750, Korea
| | - Sunhye Hong
- National Leading Research Laboratory of Molecular Modeling and Drug Design, College of Pharmacy and Graduate School of Pharmaceutical Sciences, and Global Top 5 Research Program, Ewha Womans University, Seoul, 120-750, Korea
| | - Sun Choi
- National Leading Research Laboratory of Molecular Modeling and Drug Design, College of Pharmacy and Graduate School of Pharmaceutical Sciences, and Global Top 5 Research Program, Ewha Womans University, Seoul, 120-750, Korea.
| |
Collapse
|
41
|
Ramakrishnan R, Dral PO, Rupp M, von Lilienfeld OA. Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach. J Chem Theory Comput 2015; 11:2087-96. [DOI: 10.1021/acs.jctc.5b00099] [Citation(s) in RCA: 419] [Impact Index Per Article: 46.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Affiliation(s)
- Raghunathan Ramakrishnan
- Institute
of Physical Chemistry and National Center for Computational Design
and Discovery of Novel Materials, Department of Chemistry, University of Basel, Klingelbergstraße 80, CH-4056 Basel, Switzerland
| | - Pavlo O. Dral
- Max-Planck-Institut
für Kohlenforschung, Kaiser-Wilhelm-Platz
1, 45470 Mülheim
an der Ruhr, Germany
- Computer-Chemie-Centrum
and Interdisciplinary Center for Molecular Materials, Department Chemie
und Pharmazie, Friedrich-Alexander-Universität Erlangen-Nürnberg, Nägelsbachstraße 25, 91052 Erlangen, Germany
| | - Matthias Rupp
- Institute
of Physical Chemistry and National Center for Computational Design
and Discovery of Novel Materials, Department of Chemistry, University of Basel, Klingelbergstraße 80, CH-4056 Basel, Switzerland
| | - O. Anatole von Lilienfeld
- Institute
of Physical Chemistry and National Center for Computational Design
and Discovery of Novel Materials, Department of Chemistry, University of Basel, Klingelbergstraße 80, CH-4056 Basel, Switzerland
- Argonne
Leadership Computing Facility, Argonne National Laboratory, 9700 S. Cass
Avenue, Lemont, Illinois 60439, United States
| |
Collapse
|
42
|
Kumar A, Zhang KYJ. Hierarchical virtual screening approaches in small molecule drug discovery. Methods 2015; 71:26-37. [PMID: 25072167 PMCID: PMC7129923 DOI: 10.1016/j.ymeth.2014.07.007] [Citation(s) in RCA: 95] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2014] [Revised: 07/16/2014] [Accepted: 07/17/2014] [Indexed: 02/06/2023] Open
Abstract
Virtual screening has played a significant role in the discovery of small molecule inhibitors of therapeutic targets in last two decades. Various ligand and structure-based virtual screening approaches are employed to identify small molecule ligands for proteins of interest. These approaches are often combined in either hierarchical or parallel manner to take advantage of the strength and avoid the limitations associated with individual methods. Hierarchical combination of ligand and structure-based virtual screening approaches has received noteworthy success in numerous drug discovery campaigns. In hierarchical virtual screening, several filters using ligand and structure-based approaches are sequentially applied to reduce a large screening library to a number small enough for experimental testing. In this review, we focus on different hierarchical virtual screening strategies and their application in the discovery of small molecule modulators of important drug targets. Several virtual screening studies are discussed to demonstrate the successful application of hierarchical virtual screening in small molecule drug discovery.
Collapse
Affiliation(s)
- Ashutosh Kumar
- Structural Bioinformatics Team, Center for Life Science Technologies, RIKEN, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa 230-0045, Japan
| | - Kam Y J Zhang
- Structural Bioinformatics Team, Center for Life Science Technologies, RIKEN, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa 230-0045, Japan.
| |
Collapse
|
43
|
Pei J, Yin N, Ma X, Lai L. Systems Biology Brings New Dimensions for Structure-Based Drug Design. J Am Chem Soc 2014; 136:11556-65. [DOI: 10.1021/ja504810z] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- Jianfeng Pei
- Center
for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Ning Yin
- Center
for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Xiaomin Ma
- Center
for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Luhua Lai
- Center
for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
- Beijing
National Laboratory for Molecular Science, State Key Laboratory for
Structural Chemistry of Unstable and Stable Species, College of Chemistry
and Molecular Engineering, Peking University, Beijing 100871, China
- Peking-Tsinghua
Center for Life Sciences, Peking University, Beijing 100871, China
| |
Collapse
|
44
|
Yadahalli S, Hemanth Giri Rao VV, Gosavi S. Modeling Non-Native Interactions in Designed Proteins. Isr J Chem 2014. [DOI: 10.1002/ijch.201400035] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
45
|
Applications of structure-based design to antibacterial drug discovery. Bioorg Chem 2014; 55:69-76. [PMID: 24962384 DOI: 10.1016/j.bioorg.2014.05.008] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2014] [Revised: 05/15/2014] [Accepted: 05/15/2014] [Indexed: 11/21/2022]
Abstract
In recent years bacterial resistance has been observed against many of our current antibiotics, for instance most worryingly against the cephalosporins which are typically the last line of defence against many bacterial infections. Additionally the failure of high throughput screening in the discovery of new antibacterial drug leads has led to a decline in the number of antibacterial agents reaching the market. Alternative methods of drug discovery including structure based drug design are needed to meet the threats caused by the emergence of resistance. In this review we explore the latest advancements in the identification of new antibacterial agents through the use of a number of structure based drug design programs.
Collapse
|
46
|
Foscato M, Occhipinti G, Venkatraman V, Alsberg BK, Jensen VR. Automated Design of Realistic Organometallic Molecules from Fragments. J Chem Inf Model 2014; 54:767-80. [DOI: 10.1021/ci4007497] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Affiliation(s)
- Marco Foscato
- Department
of Chemistry, University of Bergen, Allégaten 41, N-5007 Bergen, Norway
| | - Giovanni Occhipinti
- Department
of Chemistry, University of Bergen, Allégaten 41, N-5007 Bergen, Norway
| | - Vishwesh Venkatraman
- Department
of Chemistry, Norwegian University of Science and Technology, N-7491 Trondheim, Norway
| | - Bjørn K. Alsberg
- Department
of Chemistry, Norwegian University of Science and Technology, N-7491 Trondheim, Norway
| | - Vidar R. Jensen
- Department
of Chemistry, University of Bergen, Allégaten 41, N-5007 Bergen, Norway
| |
Collapse
|
47
|
Cumming JG, Davis AM, Muresan S, Haeberlein M, Chen H. Chemical predictive modelling to improve compound quality. Nat Rev Drug Discov 2014; 12:948-62. [PMID: 24287782 DOI: 10.1038/nrd4128] [Citation(s) in RCA: 156] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The 'quality' of small-molecule drug candidates, encompassing aspects including their potency, selectivity and ADMET (absorption, distribution, metabolism, excretion and toxicity) characteristics, is a key factor influencing the chances of success in clinical trials. Importantly, such characteristics are under the control of chemists during the identification and optimization of lead compounds. Here, we discuss the application of computational methods, particularly quantitative structure-activity relationships (QSARs), in guiding the selection of higher-quality drug candidates, as well as cultural factors that may have affected their use and impact.
Collapse
Affiliation(s)
- John G Cumming
- Chemistry Innovation Centre, Discovery Sciences, AstraZeneca R&D, Alderley Park, Macclesfield SK10 4TG, UK
| | | | | | | | | |
Collapse
|
48
|
Wassermann AM, Kutchukian PS, Lounkine E, Luethi T, Hamon J, Bocker MT, Malik HA, Cowan-Jacob SW, Glick M. Efficient search of chemical space: navigating from fragments to structurally diverse chemotypes. J Med Chem 2013; 56:8879-91. [PMID: 24117015 DOI: 10.1021/jm401309q] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
We introduce a novel strategy to sample bioactive chemical space, which follows-up on hits from fragment campaigns without the need for a crystal structure. Our results strongly suggest that screening a few hundred or thousand fragments can substantially improve the selection of small-molecule screening subsets. By combining fragment-based screening with virtual fragment linking and HTS fingerprints, we have developed an effective strategy not only to expand from low-affinity hits to potent compounds but also to hop in chemical space to substantially novel chemotypes. In benchmark calculations, our approach accessed subsets of compounds that were substantially enriched in chemically diverse hit compounds for various activity classes. Overall, half of the hits in the screening collection were found by screening only 10% of the library. Furthermore, a prospective application led to the discovery of two structurally novel histone deacetylase 4 inhibitors.
Collapse
Affiliation(s)
- Anne Mai Wassermann
- Novartis Institutes for Biomedical Research Inc. , 250 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | | | | | | | | | | | | | | | | |
Collapse
|
49
|
Beccari AR, Cavazzoni C, Beato C, Costantino G. LiGen: a high performance workflow for chemistry driven de novo design. J Chem Inf Model 2013; 53:1518-27. [PMID: 23617275 DOI: 10.1021/ci400078g] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
Tools for molecular de novo design are actively sought incorporating sets of chemical rules for fast and efficient identification of structurally new chemotypes endowed with a desired set of biological properties. In this paper, we present LiGen, a suite of programs which can be used sequentially or as stand-alone tools for specific purposes. In its standard application, LiGen modules are used to define input constraints, either structure-based, through active site identification, or ligand-based, through pharmacophore definition, to docking and to de novo generation. Alternatively, individual modules can be combined in a user-defined manner to generate project-centric workflows. Specific features of LiGen are the use of a pharmacophore-based docking procedure which allows flexible docking without conformer enumeration and accurate and flexible reactant mapping coupled with reactant tagging through substructure searching. The full description of LiGen functionalities is presented.
Collapse
Affiliation(s)
- Andrea R Beccari
- Dompé R&D Centre, Dompé SpA, Via Campo di Pile, 67100 L'Aquila, Italy.
| | | | | | | |
Collapse
|
50
|
García-Sosa AT. Hydration Properties of Ligands and Drugs in Protein Binding Sites: Tightly-Bound, Bridging Water Molecules and Their Effects and Consequences on Molecular Design Strategies. J Chem Inf Model 2013; 53:1388-405. [DOI: 10.1021/ci3005786] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|