1
|
Cheng C, Beroza P. Shape-Aware Synthon Search (SASS) for Virtual Screening of Synthon-Based Chemical Spaces. J Chem Inf Model 2024; 64:1251-1260. [PMID: 38335044 DOI: 10.1021/acs.jcim.3c01865] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2024]
Abstract
Virtual screening of large-scale chemical libraries has become increasingly useful for identifying high-quality candidates for drug discovery. While it is possible to exhaustively screen chemical spaces that number on the order of billions, indirect combinatorial approaches are needed to efficiently navigate larger, synthon-based virtual spaces. We describe Shape-Aware Synthon Search (SASS), a synthon-based virtual screening method that carries out shape similarity searches in the synthon space instead of the enumerated product space. SASS can replicate results from exhaustive searches in ultralarge, combinatorial spaces with high recall on a variety of query molecules while only scoring a small subspace of possible enumerated products, thereby significantly accelerating large-scale, shape-based virtual screening.
Collapse
Affiliation(s)
- Chen Cheng
- Discovery Chemistry, Genentech, South San Francisco, California 94080, United States
| | - Paul Beroza
- Discovery Chemistry, Genentech, South San Francisco, California 94080, United States
| |
Collapse
|
2
|
Abstract
We present an efficient algorithm for substructure search in combinatorial libraries defined by synthons, i.e., substructures with connection points. Our method improves on existing approaches by introducing powerful heuristics and fast fingerprint screening to quickly eliminate branches of nonmatching combinations of synthons. With this, we achieve typical response times of a few seconds on a standard desktop computer for searches in large combinatorial libraries like the Enamine REAL Space. We published the Java source as part of the OpenChemLib under the BSD license, and we implemented tools to enable substructure search in custom combinatorial libraries.
Collapse
Affiliation(s)
- Thomas Liphardt
- Scientific Computing Drug Discovery, Idorsia Pharmaceuticals, Ltd., CH-4123 Allschwil, Switzerland
| | - Thomas Sander
- Scientific Computing Drug Discovery, Idorsia Pharmaceuticals, Ltd., CH-4123 Allschwil, Switzerland
| |
Collapse
|
3
|
Meyenburg C, Dolfus U, Briem H, Rarey M. Galileo: Three-dimensional searching in large combinatorial fragment spaces on the example of pharmacophores. J Comput Aided Mol Des 2023; 37:1-16. [PMID: 36418668 PMCID: PMC10032335 DOI: 10.1007/s10822-022-00485-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 10/17/2022] [Indexed: 11/25/2022]
Abstract
Fragment spaces are an efficient way to model large chemical spaces using a handful of small fragments and a few connection rules. The development of Enamine's REAL Space has shown that large spaces of readily available compounds may be created this way. These are several orders of magnitude larger than previous libraries. So far, searching and navigating these spaces is mostly limited to topological approaches. A way to overcome this limitation is optimization via metaheuristics which can be combined with arbitrary scoring functions. Here we present Galileo, a novel Genetic Algorithm to sample fragment spaces. We showcase Galileo in combination with a novel pharmacophore mapping approach, called Phariety, enabling 3D searches in fragment spaces. We estimate the effectiveness of the approach with a small fragment space. Furthermore, we apply Galileo to two pharmacophore searches in the REAL Space, detecting hundreds of compounds fulfilling a HSP90 and a FXIa pharmacophore.
Collapse
Affiliation(s)
- Christian Meyenburg
- Universität Hamburg, ZBH - Center for Bioinformatics, Universität Hamburg, Bundesstraße 43, 20146, Hamburg, Germany
| | - Uschi Dolfus
- Universität Hamburg, ZBH - Center for Bioinformatics, Universität Hamburg, Bundesstraße 43, 20146, Hamburg, Germany
| | - Hans Briem
- Research & Development, Pharmaceuticals, Computational Molecular Design Berlin, Bayer AG, Building S110, 711, 13342, Berlin, Germany
| | - Matthias Rarey
- Universität Hamburg, ZBH - Center for Bioinformatics, Universität Hamburg, Bundesstraße 43, 20146, Hamburg, Germany.
| |
Collapse
|
4
|
Wang PH, Chen JH, Tseng YJ. Intelligent pharmaceutical patent search on a near-term gate-based quantum computer. Sci Rep 2022; 12:175. [PMID: 34997034 PMCID: PMC8742058 DOI: 10.1038/s41598-021-04031-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Accepted: 12/14/2021] [Indexed: 12/03/2022] Open
Abstract
Pharmaceutical patent analysis is the key to product protection for pharmaceutical companies. In patent claims, a Markush structure is a standard chemical structure drawing with variable substituents. Overlaps between apparently dissimilar Markush structures are nearly unrecognizable when the structures span a broad chemical space. We propose a quantum search-based method which performs an exact comparison between two non-enumerated Markush structures with a constraint satisfaction oracle. The quantum circuit is verified with a quantum simulator and the real effect of noise is estimated using a five-qubit superconductivity-based IBM quantum computer. The possibilities of measuring the correct states can be increased by improving the connectivity of the most computation intensive qubits. Depolarizing error is the most influential error. The quantum method to exactly compares two patents is hard to simulate classically and thus creates a quantum advantage in patent analysis.
Collapse
Affiliation(s)
- Pei-Hua Wang
- Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, No. 1 Sec. 4, Roosevelt Road, Taipei, 106, Taiwan
| | - Jen-Hao Chen
- Department of Computer Science and Information Engineering, National Taiwan University, No. 1 Sec. 4, Roosevelt Road, Taipei, 106, Taiwan.,Chunghwa Telecom Co., Ltd, Taipei, 106, Taiwan
| | - Yufeng Jane Tseng
- Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, No. 1 Sec. 4, Roosevelt Road, Taipei, 106, Taiwan. .,Department of Computer Science and Information Engineering, National Taiwan University, No. 1 Sec. 4, Roosevelt Road, Taipei, 106, Taiwan.
| |
Collapse
|
5
|
Friedrich NO, Flachsenberg F, Meyder A, Sommer K, Kirchmair J, Rarey M. Conformator: A Novel Method for the Generation of Conformer Ensembles. J Chem Inf Model 2019; 59:731-742. [PMID: 30747530 DOI: 10.1021/acs.jcim.8b00704] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Computer-aided drug design methods such as docking, pharmacophore searching, 3D database searching, and the creation of 3D-QSAR models need conformational ensembles to handle the flexibility of small molecules. Here, we present Conformator, an accurate and effective knowledge-based algorithm for generating conformer ensembles. With 99.9% of all test molecules processed, Conformator stands out by its robustness with respect to input formats, molecular geometries, and the handling of macrocycles. With an extended set of rules for sampling torsion angles, a novel algorithm for macrocycle conformer generation, and a new clustering algorithm for the assembly of conformer ensembles, Conformator reaches a median minimum root-mean-square deviation (measured between protein-bound ligand conformations and ensembles of a maximum of 250 conformers) of 0.47 Å with no significant difference to the highest-ranked commercial algorithm OMEGA and significantly higher accuracy than seven free algorithms, including the RDKit DG algorithm. Conformator is freely available for noncommercial use and academic research.
Collapse
Affiliation(s)
- Nils-Ole Friedrich
- Center for Bioinformatics , Universität Hamburg , Bundesstrasse 43 , 20146 Hamburg , Germany
| | - Florian Flachsenberg
- Center for Bioinformatics , Universität Hamburg , Bundesstrasse 43 , 20146 Hamburg , Germany
| | - Agnes Meyder
- Center for Bioinformatics , Universität Hamburg , Bundesstrasse 43 , 20146 Hamburg , Germany
| | - Kai Sommer
- Center for Bioinformatics , Universität Hamburg , Bundesstrasse 43 , 20146 Hamburg , Germany
| | - Johannes Kirchmair
- Center for Bioinformatics , Universität Hamburg , Bundesstrasse 43 , 20146 Hamburg , Germany.,Department of Chemistry , University of Bergen , N-5020 Bergen , Norway.,Computational Biology Unit (CBU) , University of Bergen , N-5020 Bergen , Norway
| | - Matthias Rarey
- Center for Bioinformatics , Universität Hamburg , Bundesstrasse 43 , 20146 Hamburg , Germany
| |
Collapse
|
6
|
Sommer K, Flachsenberg F, Rarey M. NAOMInext – Synthetically feasible fragment growing in a structure-based design context. Eur J Med Chem 2019; 163:747-762. [DOI: 10.1016/j.ejmech.2018.11.075] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Revised: 11/27/2018] [Accepted: 11/30/2018] [Indexed: 12/31/2022]
|
7
|
Abstract
Although significant advances in experimental high throughput screening (HTS) have been made for drug lead identification, in silico virtual screening (VS) is indispensable owing to its unique advantage over experimental HTS, target-focused, cheap, and efficient, albeit its disadvantage of producing false positive hits. For both experimental HTS and VS, the quality of screening libraries is crucial and determines the outcome of those studies. In this paper, we first reviewed the recent progress on screening library construction. We realized the urgent need for compiling high-quality screening libraries in drug discovery. Then we compiled a set of screening libraries from about 20 million druglike ZINC molecules by running fingerprint-based similarity searches against known drug molecules. Lastly, the screening libraries were objectively evaluated using 5847 external actives covering more than 2000 drug targets. The result of the assessment is very encouraging. For example, with the Tanimoto coefficient being set to 0.75, 36% of external actives were retrieved and the enrichment factor was 13. Additionally, drug target family specific screening libraries were also constructed and evaluated. The druglike screening libraries are available for download from https://mulan.pharmacy.pitt.edu .
Collapse
Affiliation(s)
- Junmei Wang
- Department of Pharmaceutical Sciences , The University of Pittsburgh , 3501 Terrace Street , Pittsburgh , Pennsylvania 15261 , United States
| | - Yubin Ge
- Department of Pharmaceutical Sciences , The University of Pittsburgh , 3501 Terrace Street , Pittsburgh , Pennsylvania 15261 , United States
| | - Xiang-Qun Xie
- Department of Pharmaceutical Sciences , The University of Pittsburgh , 3501 Terrace Street , Pittsburgh , Pennsylvania 15261 , United States
| |
Collapse
|
8
|
Lauck F, Rarey M. FSees: Customized Enumeration of Chemical Subspaces with Limited Main Memory Consumption. J Chem Inf Model 2016; 56:1641-53. [DOI: 10.1021/acs.jcim.6b00117] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Affiliation(s)
- Florian Lauck
- ZBH - Center for Bioinformatics, University of Hamburg, Bundesstrasse 43, 20146 Hamburg, Germany
| | - Matthias Rarey
- ZBH - Center for Bioinformatics, University of Hamburg, Bundesstrasse 43, 20146 Hamburg, Germany
| |
Collapse
|
9
|
Kumar A, Zhang KYJ. Prospective evaluation of shape similarity based pose prediction method in D3R Grand Challenge 2015. J Comput Aided Mol Des 2016; 30:685-693. [DOI: 10.1007/s10822-016-9931-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2016] [Accepted: 07/25/2016] [Indexed: 01/23/2023]
|
10
|
Kumar A, Zhang KYJ. Hierarchical virtual screening approaches in small molecule drug discovery. Methods 2015; 71:26-37. [PMID: 25072167 PMCID: PMC7129923 DOI: 10.1016/j.ymeth.2014.07.007] [Citation(s) in RCA: 98] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2014] [Revised: 07/16/2014] [Accepted: 07/17/2014] [Indexed: 02/06/2023] Open
Abstract
Virtual screening has played a significant role in the discovery of small molecule inhibitors of therapeutic targets in last two decades. Various ligand and structure-based virtual screening approaches are employed to identify small molecule ligands for proteins of interest. These approaches are often combined in either hierarchical or parallel manner to take advantage of the strength and avoid the limitations associated with individual methods. Hierarchical combination of ligand and structure-based virtual screening approaches has received noteworthy success in numerous drug discovery campaigns. In hierarchical virtual screening, several filters using ligand and structure-based approaches are sequentially applied to reduce a large screening library to a number small enough for experimental testing. In this review, we focus on different hierarchical virtual screening strategies and their application in the discovery of small molecule modulators of important drug targets. Several virtual screening studies are discussed to demonstrate the successful application of hierarchical virtual screening in small molecule drug discovery.
Collapse
Affiliation(s)
- Ashutosh Kumar
- Structural Bioinformatics Team, Center for Life Science Technologies, RIKEN, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa 230-0045, Japan
| | - Kam Y J Zhang
- Structural Bioinformatics Team, Center for Life Science Technologies, RIKEN, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa 230-0045, Japan.
| |
Collapse
|