501
|
Abstract
Recent developments in combinatorial chemistry and high-throughput screening have dramatically increased the scale on which drug discovery programs are carried out. Along with these advances has come a need for automated methods of determining which compounds from a library should be synthesized and screened. These methods range from simple counting schemes to sophisticated machine learning techniques such as neural networks. While many of these methods have performed well in validation studies, the field is still in its formative stage. This paper reviews a number of computational techniques for identifying drug-like molecules and examines challenges facing the field.
Collapse
Affiliation(s)
- W Patrick Walters
- Vertex Pharmaceuticals, 130 Waverly Street, 02139, Cambridge, MA 02139, USA.
| | | |
Collapse
|
502
|
de Jong MR, Knegtel RMA, Grootenhuis PDJ, Huskens J, Reinhoudt DN. A Method To Identify and Screen Libraries of Guests That Complex to a Synthetic Host. Angew Chem Int Ed Engl 2002. [DOI: 10.1002/1521-3757(20020315)114:6<1046::aid-ange1046>3.0.co;2-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
503
|
de Jong MR, Knegtel RMA, Grootenhuis PDJ, Huskens J, Reinhoudt DN. A method to identify and screen libraries of guests that complex to a synthetic host. Angew Chem Int Ed Engl 2002; 41:1004-8. [PMID: 12491294 DOI: 10.1002/1521-3773(20020315)41:6<1004::aid-anie1004>3.0.co;2-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Menno R de Jong
- Laboratory of Supramolecular Chemistry and Technology, MESA(+) Research Institute, University of Twente, PO Box 217, 7500 AE Enschede, Netherlands
| | | | | | | | | |
Collapse
|
504
|
Bräse S, Neuß B. Glossar von Begriffen der Kombinatorischen Chemie. Angew Chem Int Ed Engl 2002. [DOI: 10.1002/1521-3757(20020301)114:5<893::aid-ange893>3.0.co;2-s] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
505
|
Abstract
Chemical databases are becoming a powerful tool in drug discovery. Database searches based on possible requirements for biological activity can identify compounds that might be suitable for further analysis or indicate novel ways to achieve the desired activity. What considerations are involved in the construction and searching of chemical databases?
Collapse
Affiliation(s)
- Mitchell A Miller
- LION Bioscience, 9880 Campus Point Drive, San Diego, California 92121, USA.
| |
Collapse
|
506
|
Abstract
Virtual screening (VS) methods have emerged as an adaptive response to massive throughput synthesis and screening technologies. Based on the structure-permeability paradigm, the Lipinski rule of five has become a standard property filtering protocol for VS. Three possible VS scenarios with respect to optimising binding affinity and pharmacokinetic properties are discussed. The parsimony principle for selecting candidate leads for further optimisation is advocated.
Collapse
|
507
|
Peng JW, Lepre CA, Fejzo J, Abdul-Manan N, Moore JM. Nuclear magnetic resonance-based approaches for lead generation in drug discovery. Methods Enzymol 2002; 338:202-30. [PMID: 11460549 DOI: 10.1016/s0076-6879(02)38221-1] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Affiliation(s)
- J W Peng
- Vertex Pharmaceuticals Incorporated, Cambridge, Massachusetts 02139, USA
| | | | | | | | | |
Collapse
|
508
|
Cheng A, Diller DJ, Dixon SL, Egan WJ, Lauri G, Merz KM. Computation of the physio-chemical properties and data mining of large molecular collections. J Comput Chem 2002; 23:172-83. [PMID: 11913384 DOI: 10.1002/jcc.1164] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Very large data sets of molecules screened against a broad range of targets have become available due to the advent of combinatorial chemistry. This information has led to the realization that ADME (absorption, distribution, metabolism, and excretion) and toxicity issues are important to consider prior to library synthesis. Furthermore, these large data sets provide a unique and important source of information regarding what types of molecular shapes may interact with specific receptor or target classes. Thus, the requirement for rapid and accurate data mining tools became paramount. To address these issues Pharmacopeia, Inc. formed a computational research group, The Center for Informatics and Drug Discovery (CIDD).* In this review we cover the work done by this group to address both in silico ADME modeling and data mining issues faced by Pharmacopeia because of the availability of a large and diverse collection (over 6 million discrete compounds) of drug-like molecules. In particular, in the data mining arena we discuss rapid docking tools and how we employ them, and we describe a novel data mining tool based on a ID representation of a molecule followed by a molecular sequence alignment step. For the ADME area we discuss the development and application of absorption, blood-brain barrier (BBB) and solubility models. Finally, we summarize the impact the tools and approaches might have on the drug discovery process.
Collapse
Affiliation(s)
- Ailan Cheng
- Pharmacopeia, Inc., Princeton, New Jersey 08543-5350, USA
| | | | | | | | | | | |
Collapse
|
509
|
Roche O, Schneider P, Zuegge J, Guba W, Kansy M, Alanine A, Bleicher K, Danel F, Gutknecht EM, Rogers-Evans M, Neidhart W, Stalder H, Dillon M, Sjögren E, Fotouhi N, Gillespie P, Goodnow R, Harris W, Jones P, Taniguchi M, Tsujii S, von der Saal W, Zimmermann G, Schneider G. Development of a virtual screening method for identification of "frequent hitters" in compound libraries. J Med Chem 2002; 45:137-42. [PMID: 11754585 DOI: 10.1021/jm010934d] [Citation(s) in RCA: 208] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A computer-based method was developed for rapid and automatic identification of potential "frequent hitters". These compounds show up as hits in many different biological assays covering a wide range of targets. A scoring scheme was elaborated from substructure analysis, multivariate linear and nonlinear statistical methods applied to several sets of one and two-dimensional molecular descriptors. The final model is based on a three-layered neural network, yielding a predictive Matthews correlation coefficient of 0.81. This system was able to correctly classify 90% of the test set molecules in a 10-times cross-validation study. The method was applied to database filtering, yielding between 8% (compilation of trade drugs) and 35% (Available Chemicals Directory) potential frequent hitters. This filter will be a valuable tool for the prioritization of compounds from large databases, for compound purchase and biological testing, and for building new virtual libraries.
Collapse
|
510
|
|
511
|
Abstract
Recent advances in high-throughput protein structure determination and in computational chemistry have refocused attention on virtual screening and fast automated docking methods. This review provides a brief introduction to the basic ideas and outlines computational tools currently used. We also provide several examples of where virtual screening has proved successful, highlighting the usefulness of the approach.
Collapse
Affiliation(s)
- Gisbert Schneider
- F. Hoffmann-La Roche, Pharmaceuticals Division, CH-4070 Basel, Switzerland.
| | | |
Collapse
|
512
|
Blundell TL, Jhoti H, Abell C. High-throughput crystallography for lead discovery in drug design. Nat Rev Drug Discov 2002; 1:45-54. [PMID: 12119609 DOI: 10.1038/nrd706] [Citation(s) in RCA: 332] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Knowledge of the three-dimensional structures of protein targets now emerging from genomic data has the potential to accelerate drug discovery greatly. X-ray crystallography is the most widely used technique for protein structure determination, but technical challenges and time constraints have traditionally limited its use primarily to lead optimization. Here, we describe how significant advances in process automation and informatics have aided the development of high-throughput X-ray crystallography, and discuss the use of this technique for structure-based lead discovery.
Collapse
Affiliation(s)
- Tom L Blundell
- Department of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge CB2 1GA, UK.
| | | | | |
Collapse
|
513
|
Abstract
Docking functions are believed to be the essential component of docking algorithms. Both physically and statistically based functions have been proposed, but there is no consensus about their relative performances. Here, we propose an evaluation approach based on exhaustive enumeration of all possible docking solutions obtained with a discretized description of a rigid docking process. We apply the approach to study both molecular mechanics and statistical potentials. It is found that the statistical potential evaluated is less effective than the AMBER molecular mechanics function to provide an accurate description of the docking process when the exact experimental coordinates are used. However, when coordinates of crystal structures obtained with analogous ligands are used, similar performances are obtained in both cases. Possible reasons for the successes and failures of both docking schemes have been uncovered using linear discriminant analysis, on the basis of a set of physicochemical descriptors capturing the main physical effects at play during protein-ligand docking. In both types of potentials steric effects appear critical to obtain a successful docking. Our results also indicate that neglecting desolvation effects and the explicit treatment of hydrogen bonds are the main source of the failures observed with the molecular mechanics potential. On the other hand, detailed consideration of steric interactions, with a careful treatment of dispersive forces, seems to be needed when using statistical potentials derived from a structural database. The possibility of filtering combinatorial libraries in order to maximize the probability of correct docking is discussed.
Collapse
Affiliation(s)
- C Pérez
- Department of Physiology & Biophysics, Mount Sinai School of Medicine, New York University, One Gustave Levy Plaza, Box 1218, New York, New York 10029, USA
| | | |
Collapse
|
514
|
Ravi M, Hopfinger AJ, Hormann RE, Dinan L. 4D-QSAR analysis of a set of ecdysteroids and a comparison to CoMFA modeling. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 2001; 41:1587-604. [PMID: 11749586 DOI: 10.1021/ci010076u] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The ecdysteroid-responsive Drosophila melanogaster B(II) cell line is a prototypical homologous inducible gene expression system. A training set of 71 ecdysteroids, for which the -log(EC(50)) potencies in the ecdysteroid-responsive B(II) cell line were measured, was used to construct 4D-QSAR models. Four nearly equivalent optimum 4D-QSAR models, for two modestly different alignments, were identified (Q(2) = 0.76-0.80). These four models, together with two CoMFA models, were used in consensus modeling to arrive at a three-dimensional pharmacophore. The C-2 and C-22 hydroxyls are identified as hydrogen-bond acceptor sites which enhance activity. A hydrophobic site near C-12 is consistent with increasing activity. The side-chain substituents at C-17 are predicted to adopt semiextended "active" conformations which could fit into a cylinder-shaped binding pocket lined largely with nonpolar residues for enhanced activity. A test set of 20 ecdysteroids was used to evaluate the QSAR models. Two 4D-QSAR models for one alignment were identified to be superior to the others based on having the smallest average residuals of prediction for the prediction set (0.69 and 1.13 -log[EC(50)] units). The correlation coefficients of the optimum 4D-QSAR models (R(2) = 0.87 and 0.88) are nearly the same as those of the best CoMFA model (R(2) = 0.92) determined for the same training set. However, the cross-validation correlation coefficient of the CoMFA model is less significant (Q(2) = 0.59) than those of the 4D-QSAR models (Q(2) = 0.80 and 0.80).
Collapse
Affiliation(s)
- M Ravi
- Laboratory of Molecular Modeling and Design (M/C-781), College of Pharmacy, University of Illinois at Chicago, 833 South Wood Street, Chicago, IL 60612-7231, USA
| | | | | | | |
Collapse
|
515
|
Manly CJ, Louise-May S, Hammer JD. The impact of informatics and computational chemistry on synthesis and screening. Drug Discov Today 2001; 6:1101-1110. [PMID: 11677167 DOI: 10.1016/s1359-6446(01)01990-0] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
High-throughput synthesis and screening technologies have enhanced the impact of computational chemistry on the drug discovery process. From the design of targeted, drug-like libraries to 'virtual' optimization of potency, selectivity and ADME/Tox properties, computational chemists are able to efficiently manage costly resources and dramatically shorten drug discovery cycle times. This review will describe some of the successful strategies and applications of state-of-the-art algorithms to enhance drug discovery, as well as key points in the drug discovery process where computational methods can have, and have had, greatest impact.
Collapse
Affiliation(s)
- Charles J. Manly
- Neurogen Corporation, 35 Northeast Industrial Rd, 06405, Branford, CT, USA
| | | | | |
Collapse
|
516
|
Roche O, Kiyama R, Brooks CL. Ligand-protein database: linking protein-ligand complex structures to binding data. J Med Chem 2001; 44:3592-8. [PMID: 11606123 DOI: 10.1021/jm000467k] [Citation(s) in RCA: 100] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
In computational structure-based drug design, the scoring functions are the cornerstones to the success of design/discovery. Many approaches have been explored to improve their reliability and accuracy, leading to three families of scoring functions: force-field-based, knowledge-based, and empirical. The last family is the most widely used in association with docking algorithms because of its speed, even though such empirical scoring functions produce far too many false positives to be fully reliable. In this work, we describe a World Wide Web accessible database that gathers the structural information from known complexes of the PDB with experimental binding data. This database, the Ligand-Protein DataBase (LPDB), is designed to allow the selection of complexes based on various properties of receptors and ligands for the design and parametrization of new scoring functions or to assess and improve existing ones. Moreover, for each complex, a continuum of ligand positions ranging from the crystallographic position to points on the surface of the protein receptor allows an assessment of the energetic behavior of particular scoring functions.
Collapse
Affiliation(s)
- O Roche
- Department of Molecular Biology, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, USA
| | | | | |
Collapse
|
517
|
Pearlman DA, Charifson PS. Are free energy calculations useful in practice? A comparison with rapid scoring functions for the p38 MAP kinase protein system. J Med Chem 2001; 44:3417-23. [PMID: 11585447 DOI: 10.1021/jm0100279] [Citation(s) in RCA: 169] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Precise thermodynamic integration free energy simulations have been applied to a congeneric series of 16 inhibitors to the p38 MAP kinase protein for which the experimental binding data (IC(50)) is known. The relative free energy of binding for each compound has been determined. For comparison, the same series of compounds have also scored using the best rapid scoring functions used in database screening. From the results of these calculations, we find (1) that precise free energy simulations allow predictions that are reliable and in good agreement with experiment; (2) that predictions of lower reliability, but still in good qualitative agreement with experiment, can be obtained using the OWFEG free energy grid method, at a much lower computational cost; (3) and that other methods, not based on free energy simulations yield results in much poorer agreement with experiment. A new predictive index, which measures the reliability of a prediction method in the context of normal use, is defined and calculated for each scoring method. Predictive indices of 0.84, 0.56, 0.04, -0.05, and 0.25 are calculated for thermodynamic integration, OWFEG, ChemScore, PLPScore, and Dock Energy Score, respectively, where +1.0 is perfect correct prediction, -1.0 is perfect incorrect prediction, and 0.0 is random.
Collapse
Affiliation(s)
- D A Pearlman
- Vertex Pharmaceuticals Incorporated, 130 Waverly Street, Cambridge, Massachusetts 02139-4242, USA.
| | | |
Collapse
|
518
|
|
519
|
Abstract
Following the recent sequencing of the human genome, the focus has shifted from the DNA level to the protein level, with the goal of elucidating function. Technical developments in x-ray crystallography mean that the crystal structures of these new proteins can now been determined at an unprecedented rate, which assists in functional analysis and rational drug-design programmes.
Collapse
Affiliation(s)
- H Jhoti
- Astex Technology, Cambridge, UK.
| |
Collapse
|
520
|
Wang R, Wang S. How does consensus scoring work for virtual library screening? An idealized computer experiment. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 2001; 41:1422-6. [PMID: 11604043 DOI: 10.1021/ci010025x] [Citation(s) in RCA: 203] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
It has been reported recently that consensus scoring, which combines multiple scoring functions in binding affinity estimation, leads to higher hit-rates in virtual library screening studies. This method seems quite independent to the target receptor, the docking program, or even the scoring functions under investigation. Here we present an idealized computer experiment to explore how consensus scoring works. A hypothetical set of 5000 compounds is used to represent a chemical library under screening. The binding affinities of all its member compounds are assigned by mimicking a real situation. Based on the assumption that the error of a scoring function is a random number in a normal distribution, the predicted binding affinities were generated by adding such a random number to the "observed" binding affinities. The relationship between the hit-rates and the number of scoring functions employed in scoring was then investigated. The performance of several typical ranking strategies for a consensus scoring procedure was also explored. Our results demonstrate that consensus scoring outperforms any single scoring for a simple statistical reason: the mean value of repeated samplings tends to be closer to the true value. Our results also suggest that a moderate number of scoring functions, three or four, are sufficient for the purpose of consensus scoring. As for the ranking strategy, both the rank-by-number and the rank-by-rank strategy work more effectively than the rank-by-vote strategy.
Collapse
Affiliation(s)
- R Wang
- Institute of Cognitive and Computational Science, Department of Oncology, Georgetown University Medical Center, 4000 Reservoir Road, Washington, DC 20007
| | | |
Collapse
|
521
|
Aronov AM, Munagala NR, Kuntz ID, Wang CC. Virtual screening of combinatorial libraries across a gene family: in search of inhibitors of Giardia lamblia guanine phosphoribosyltransferase. Antimicrob Agents Chemother 2001; 45:2571-6. [PMID: 11502531 PMCID: PMC90694 DOI: 10.1128/aac.45.9.2571-2576.2001] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2001] [Accepted: 06/18/2001] [Indexed: 11/20/2022] Open
Abstract
Parasitic protozoa lack the ability to synthesize purine nucleotides de novo, relying instead on purine salvage enzymes for their survival. Guanine phosphoribosyltransferase (GPRT) from the protozoan parasite Giardia lamblia is a potential target for rational antiparasitic drug design, based on the experimental evidence, which indicates the lack of interconversion between adenine and guanine nucleotide pools. The present study is a continuation of our efforts to use three-dimensional structures of parasitic phosphoribosyltransferases (PRTs) to design novel antiparasitic agents. Two micromolar phthalimide-based GPRT inhibitors were identified by screening the in-house phthalimide library. A combination of structure-based scaffold selection using virtual library screening across the PRT gene family and solid phase library synthesis led to identification of smaller (molecular weight, <300) ligands with moderate to low specificity for GPRT; the best inhibitors, GP3 and GP5, had K(i) values in the 23 to 25 microM range. These results represent significant progress toward the goal of designing potent inhibitors of purine salvage in Giardia parasites. As a second step in this process, altering the phthalimide moiety to optimize interactions in the guanine-binding pocket of GPRT is expected to lead to compounds with promising activity against G. lamblia PRT.
Collapse
Affiliation(s)
- A M Aronov
- Department of Pharmaceutical Chemistry, University of California, San Francisco, California 94143-0446, USA
| | | | | | | |
Collapse
|
522
|
Abstract
Recent improvements in flexible docking technology may lead to a bigger role for computational methods in lead discovery. Although fast and accurate computational prediction of binding affinities for an arbitrary molecule is still beyond the limits of current methods, the docking and screening procedures can select small sets of likely lead candidates from large libraries of either commercially or synthetically available compounds.
Collapse
Affiliation(s)
- R Abagyan
- Department of Molecular Biology, The Scripps Research Institute, 10550 North Torrey Pines, TCP-28, La Jolla, CA 92037, USA.
| | | |
Collapse
|
523
|
Roberts SA. High-throughput screening approaches for investigating drug metabolism and pharmacokinetics. Xenobiotica 2001; 31:557-89. [PMID: 11569526 DOI: 10.1080/00498250110060978] [Citation(s) in RCA: 81] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
1. High-throughput screening approaches have been adopted throughout the pharmaceutical industry to aid in the rapid discovery of new chemical entities. Because it is now well recognized that the selection of a robust candidate requires a balance of potency, safety and pharmacokinetics, the role of drug metabolism departments has widened from their traditional one of supporting drug development to include the screening of compounds during the discovery process. To put drug metabolism and pharmacokinetic (DMPK) studies in context, the evolving role of DMPK screening in the drug discovery strategy of pharmaceutical companies will be discussed and a generalized approach will be presented. 2. With the increasing numbers of compounds requiring screening, DMPK optimization methods have had to be adapted for high throughput. There have been many developments in this field over the past decade and this review will focus on the high-throughput DMPK screening methodologies used today and in the recent past. 3. In vitro and in silico (computer-based) methods have proven most amenable to high-throughput approaches and these will firm the bulk of the review, but some advances with in vivo methods will also be discussed. As there has been a vast increase in published material on the topic of high-throughput DMPK methodologies in the past 10 years, it would be impossible to cover every method in detail, so this review will concentrate on the key areas and refer the reader to other, more detailed reviews wherever possible. 4. Most high-throughput methods would not be possible without the enabling technologies of computing, automation, new sample preparation technologies, and highly sensitive and selective detection systems, and these will also be reviewed. 5. The advantages and disadvantages of the screening methods will be presented, in particular the issue of handling the false-positives and -negatives that arise. 6. In concluding the review, future developments in this field will be discussed along with key issues that will need to be addressed.
Collapse
Affiliation(s)
- S A Roberts
- Drug Metabolism and Pharmacokinetics, Celltech Research and Development Ltd, Great Abington, Cambridge, UK.
| |
Collapse
|
524
|
Caron PR, Mullican MD, Mashal RD, Wilson KP, Su MS, Murcko MA. Chemogenomic approaches to drug discovery. Curr Opin Chem Biol 2001; 5:464-70. [PMID: 11470611 DOI: 10.1016/s1367-5931(00)00229-5] [Citation(s) in RCA: 126] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- P R Caron
- Vertex Pharmaceuticals Incorporated, 130 Waverly Street, Cambridge, MA 02139, USA.
| | | | | | | | | | | |
Collapse
|
525
|
Abstract
A simple pharmacophore point filter has been developed that discriminates between drug-like and nondrug-like chemical matter. It is based on the observation that nondrugs are often underfunctionalized. Therefore, a minimum count of well-defined pharmacophore points is required to pass the filter. The application of the filter results in 66-69% of subsets of the MDDR database to be classified as drug-like. Furthermore, 61-68% of subsets of the CMC database are classified as drug-like. In contrast, only 36% of the ACD are found to be drug-like. While these results are not quite as good as those obtained with recently described neural net approaches, the method used here has clear advantages. In contrast to a neural net approach and also in contrast to decision tree methods described recently, the pharmacophore filter has been developed by using "chemical wisdom" that is unbiased from fitting the structural content of specific drug databases to prediction models. Similar to decision tree methods, the pharmacophore point filter provides a detailed structural reason for the classification of each molecule as drug or nondrug. The pharmacophore point filter results are compared to neural net filter results. A statistically significant overlap between compounds recognized as drug-like validates both approaches. The pharmacophore point filter complements neural net approaches as well as property profiling approaches used as drug-likeness filters in compound library analysis and design.
Collapse
Affiliation(s)
- I Muegge
- Bayer Research Center, 400 Morgan Lane, West Haven, Connecticut 06516, USA.
| | | | | |
Collapse
|
526
|
Abstract
The transition from slow, manual, low-throughput screening to industrialized robotic ultra-high throughput screening (uHTS) in the past few years has made it possible to screen hundreds of thousands of chemical entities against a biological target in a short time-frame. The need to minimize the cost of screening has been addressed primarily by reducing the volume of sample to be screened. This, in turn, has resulted in the miniaturization of HTS technology as a whole. Miniaturization requires new technologies and strategies for compound handling, assay development, assay adaptation, liquid handling and automation in addition to refinement of the technologies used for detection systems and data management. This review summarizes current trends in the field of uHTS and illustrates the technological developments that are necessary to enable the routine application of miniaturized uHTS systems within an industrial environment.
Collapse
Affiliation(s)
- J Wölcke
- Drug Discovery Services, Screening Operations, Evotec OAI, Schnackenburgallee 114, D-22525, Hamburg, Germany
| | | |
Collapse
|
527
|
Olender R, Rosenfeld R. A fast algorithm for searching for molecules containing a pharmacophore in very large virtual combinatorial libraries. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 2001; 41:731-8. [PMID: 11410053 DOI: 10.1021/ci000463o] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We present a new algorithm for identifying molecules that display a pharmacophore, or in general a structural motif, by efficiently constructing and screening huge virtual combinatorial libraries of diverse compounds. The uniqueness of this algorithm is its ability to build and screen libraries of ca. 10(18) 3D molecular conformations within a reasonable time scale, thereby increasing the chemical space that can be virtually screened by many orders of magnitude. The algorithm may be used to design new molecules that display a desired pharmacophore on predefined sets of chemical scaffolds. This is demonstrated herein by screening a library of backbone cyclic peptides to find candidate peptido- and proteinomimetics.
Collapse
Affiliation(s)
- R Olender
- Molecular Modeling Group, Peptor Ltd., Kiryat Weizmann 16, Rehovot 76326, Israel.
| | | |
Collapse
|
528
|
Abt M, Lim Y, Sacks J, Xie M, Young SS. Sequential Approach for Identifying Lead Compounds in Large Chemical Databases. Stat Sci 2001. [DOI: 10.1214/ss/1009213288] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
529
|
Xue L, Stahura FL, Godden JW, Bajorath J. Fingerprint scaling increases the probability of identifying molecules with similar activity in virtual screening calculations. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 2001; 41:746-53. [PMID: 11410055 DOI: 10.1021/ci000311t] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Results of systematic virtual screening calculations using a structural key-type fingerprint are reported for compounds belonging to 14 activity classes added to randomly selected synthetic molecules. For each class, a fingerprint profile was calculated to monitor the relative occupancy of fingerprint bit positions. Consensus bit patterns were determined consisting of all bits that were always set on in compounds belonging to a specific activity class. In virtual screening calculations, scale factors were applied to each consensus bit position in fingerprints of query molecules. This technique, called "fingerprint scaling", effectively increases the weight of consensus bit positions in fingerprint comparisons. Although overall prediction accuracy was satisfactory using unscaled calculations, scaling significantly increased the number of correct predictions but only slightly increased the rate of false positives. These observations suggest that fingerprint scaling is an attractive approach to increase the probability of identifying molecules with similar activity by virtual screening. It requires the availability of a series of related compounds and can be easily applied to any keyed fingerprint representation that associates bit positions with specific molecular features.
Collapse
Affiliation(s)
- L Xue
- New Chemical Entities, Inc., 18804 North Creek Parkway, Suite 100, Bothell, Washington 98011, USA
| | | | | | | |
Collapse
|
530
|
Logean A, Sette A, Rognan D. Customized versus universal scoring functions: application to class I MHC-peptide binding free energy predictions. Bioorg Med Chem Lett 2001; 11:675-9. [PMID: 11266167 DOI: 10.1016/s0960-894x(01)00021-x] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
A tailor-made free energy scoring method (Fresno) has been compared to six universal scoring functions (Chemscore, Dock, FlexX, Gold, Pmf, Score) for predicting the binding affinity of 26 peptides to the class I human major histocompatibility protein HLA-B*2705. Fresno clearly outperforms all six universal scoring functions.
Collapse
Affiliation(s)
- A Logean
- Laboratoire de Pharmacochimie de la Communication Cellulaire, UMR 7081, Illkirch, France
| | | | | |
Collapse
|
531
|
Bajorath J. Selected concepts and investigations in compound classification, molecular descriptor analysis, and virtual screening. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 2001; 41:233-45. [PMID: 11277704 DOI: 10.1021/ci0001482] [Citation(s) in RCA: 136] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Affiliation(s)
- J Bajorath
- New Chemical Entities, Inc., 18804 North Creek Parkway, Suite 100, Bothell, Washington 98011, USA.
| |
Collapse
|
532
|
Pearlman DA, Charifson PS. Improved scoring of ligand-protein interactions using OWFEG free energy grids. J Med Chem 2001; 44:502-11. [PMID: 11170640 DOI: 10.1021/jm000375v] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A new approach to rapidly score protein-ligand interactions is tested on several protein-ligand systems. Results using this approach - the OWFEG free energy grid - are quite promising and are generally in better agreement with experiment (in some cases much better) than those obtained employing scoring techniques currently in wide use. The OWFEG free energy grid is generated from a one-window free energy perturbation MD simulation (Pearlman, D. A. J. Med. Chem. 1999, 42, 4313-4324). The OWFEG approach is applied to three protein systems: IMPDH, MAP kinase p38, and HIV-1 aspartyl protease. OWFEG scores are compared to experimental K(i) and IC50 data in each case. Empirical scoring functions applied to the same systems for comparison include ChemScore, Piecewise Linear Potential (PLP), and Dock energy score.
Collapse
Affiliation(s)
- D A Pearlman
- Vertex Pharmaceuticals Incorporated, 130 Waverly Street, Cambridge, Massachusetts 02139-4242, USA.
| | | |
Collapse
|
533
|
Lamb ML, Burdick KW, Toba S, Young MM, Skillman AG, Zou X, Arnold JR, Kuntz ID. Design, docking, and evaluation of multiple libraries against multiple targets. Proteins 2001; 42:296-318. [PMID: 11151003 DOI: 10.1002/1097-0134(20010215)42:3<296::aid-prot20>3.0.co;2-f] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
We present a general approach to the design, docking, and virtual screening of multiple combinatorial libraries against a family of proteins. The method consists of three main stages: docking the scaffold, selecting the best substituents at each site of diversity, and comparing the resultant molecules within and between the libraries. The core "divide-and-conquer" algorithm for side-chain selection, developed from an earlier version (Sun et al., J Comp Aided Mol Design 1998;12:597-604), provides a way to explore large lists of substituents with linear rather than combinatorial time dependence. We have applied our method to three combinatorial libraries and three serine proteases: trypsin, chymotrypsin, and elastase. We show that the scaffold docking procedure, in conjunction with a novel vector-based orientation filter, reproduces crystallographic binding modes. In addition, the free-energy-based scoring procedure (Zou et al., J Am Chem Soc 1999;121:8033-8043) is able to reproduce experimental binding data for P1 mutants of macromolecular protease inhibitors. Finally, we show that our method discriminates between a peptide library and virtual libraries built on benzodiazepine and tetrahydroisoquinolinone scaffolds. Implications of the docking results for library design are explored.
Collapse
Affiliation(s)
- M L Lamb
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, California, USA
| | | | | | | | | | | | | | | |
Collapse
|
534
|
Abstract
Recent advances in NMR-based screening methods have made it possible to screen larger libraries of molecules with higher throughput. However, experience shows that intelligent library design is important if NMR screening is to succeed in aiding our discovery of potent and useful lead compounds. This review presents the current state-of-the-art methodologies for designing primary and follow-up libraries for NMR screening. Diversity, drug-likeness and combinatorial libraries are discussed, and the inherent pitfalls of the NMR approach are addressed.
Collapse
Affiliation(s)
- C A. Lepre
- Vertex Pharmaceuticals, 130 Waverly Street, 02139-4242, Cambridge, MA, USA
| |
Collapse
|
535
|
Grüneberg S, Wendt B, Klebe G. Subnanomolar wirksame Inhibitoren aus dem Computerscreening: eine Modellstudie an der humanen Carboanhydrase II. Angew Chem Int Ed Engl 2001. [DOI: 10.1002/1521-3757(20010119)113:2<404::aid-ange404>3.0.co;2-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
536
|
Grüneberg S, Wendt B, Klebe G. Subnanomolar Inhibitors from Computer Screening: A Model Study Using Human Carbonic Anhydrase II. Angew Chem Int Ed Engl 2001; 40:389-393. [PMID: 11180334 DOI: 10.1002/1521-3773(20010119)40:2<389::aid-anie389>3.0.co;2-#] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Sven Grüneberg
- Institut für Pharmazeutische Chemie der Universität Marbacher Weg 6, 35032 Marburg (Germany)
| | | | | |
Collapse
|
537
|
Thormann M, Pons M. Massive docking of flexible ligands using environmental niches in parallelized genetic algorithms. J Comput Chem 2001. [DOI: 10.1002/jcc.1146] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
538
|
Bissantz C, Folkers G, Rognan D. Protein-based virtual screening of chemical databases. 1. Evaluation of different docking/scoring combinations. J Med Chem 2000; 43:4759-67. [PMID: 11123984 DOI: 10.1021/jm001044l] [Citation(s) in RCA: 521] [Impact Index Per Article: 21.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Three different database docking programs (Dock, FlexX, Gold) have been used in combination with seven scoring functions (Chemscore, Dock, FlexX, Fresno, Gold, Pmf, Score) to assess the accuracy of virtual screening methods against two protein targets (thymidine kinase, estrogen receptor) of known three-dimensional structure. For both targets, it was generally possible to discriminate about 7 out of 10 true hits from a random database of 990 ligands. The use of consensus lists common to two or three scoring functions clearly enhances hit rates among the top 5% scorers from 10% (single scoring) to 25-40% (double scoring) and up to 65-70% (triple scoring). However, in all tested cases, no clear relationships could be found between docking and ranking accuracies. Moreover, predicting the absolute binding free energy of true hits was not possible whatever docking accuracy was achieved and scoring function used. As the best docking/consensus scoring combination varies with the selected target and the physicochemistry of target-ligand interactions, we propose a two-step protocol for screening large databases: (i) screening of a reduced dataset containing a few known ligands for deriving the optimal docking/consensus scoring scheme, (ii) applying the latter parameters to the screening of the entire database.
Collapse
Affiliation(s)
- C Bissantz
- Department of Applied Biosciences, ETH Zürich, Winterthurerstrasse 190, CH-8057 Zürich, Switzerland
| | | | | |
Collapse
|
539
|
Good AC, Krystek SR, Mason JS. High-throughput and virtual screening: core lead discovery technologies move towards integration. Drug Discov Today 2000; 5:61-69. [PMID: 11564568 DOI: 10.1016/s1359-6446(00)00015-5] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
In addition to high-throughput screening (HTS), the main lead discovery technology employed by most pharmaceutical companies today is virtual screening (VS). Although the two techniques have somewhat different philosophical origins, they contain many synergies that can potentially enhance the lead discovery process. Here, we describe many of the latest developments in VS technology with particular emphasis on their potential impact on HTS in, for example, focussed screening and data mining. In addition, we highlight key issues that need to be addressed before the potential of such efforts can be fully realized.
Collapse
Affiliation(s)
- A C. Good
- Bristol-Myers Squibb 5 Research Parkway PO Box 5100, 06492, Wallingford, CT, USA
| | | | | |
Collapse
|
540
|
Chemoinformatics: are we exploiting this new science? 'We need to make chemoinformatics tools more accessible to the bench chemist.'. Drug Discov Today 2000; 5:483-485. [PMID: 11084381 DOI: 10.1016/s1359-6446(00)01560-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
541
|
Abstract
Rational drug discovery requires an early appraisal of all factors impacting on the likely success of a drug candidate in the subsequent preclinical, clinical and commercial phases of drug development. The study of absorption, distribution, metabolism, excretion and pharmacokinetics (ADME/PK) has developed into a relatively mature discipline in drug discovery through the application of well-established in vitro and in vivo methodologies. The availability of improved analytical and automation technologies has dramatically increased our ability to dissect out the fundamentals of ADME/PK through the development of increasingly powerful in silico methods. This is fuelling a shift away from the traditional, empirical nature of ADME/PK towards a more rational, in cerebro approach to drug design.
Collapse
|
542
|
Abstract
Over the last ten years, nmr spectroscopy has evolved into an important discipline in drug discovery. Initially, nmr was most useful as a technique to provide structural information regarding protein drug targets and target-ligand interactions. More recently, it has been shown that nmr may be used as an alternative method for identification of small molecule ligands that bind to protein drug targets. High throughput implementation of these experiments to screen small molecule libraries may lead to identification of potent and novel lead compounds. In this review, we will use examples from our own research to illustrate how nmr experiments to characterize ligand binding may be used to both screen for novel compounds during the process of lead generation, as well as provide structural information useful for lead optimization during the latter stages of a discovery program.
Collapse
Affiliation(s)
- J M Moore
- Vertex Pharmaceuticals Incorporated, 130 Waverly Street, Cambridge, MA 02139-4242, USA.
| |
Collapse
|
543
|
Sheridan RP, SanFeliciano SG, Kearsley SK. Designing targeted libraries with genetic algorithms. J Mol Graph Model 2000; 18:320-34, 525. [PMID: 11143552 DOI: 10.1016/s1093-3263(00)00060-7] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
In combinatorial synthesis, molecules are assembled by linking chemically similar fragments. Because the number of available chemical fragments often greatly exceeds the number that can be used in one synthetic experiment, one needs a rational method for choosing a subset of desirable fragments. If a combinatorial library is to be targeted against a particular biological activity, virtual screening methods can be used to predict which molecules in a virtual library are most likely to be active. When the number of possible molecules in a virtual library is very large, genetic algorithms (GAs) or simulated annealing can be used to quickly find high-scoring molecules by sampling a small subset of the total combinatorial space. We previously demonstrated how a GA can be used to select a subset of fragments for a combinatorial library, and we used topology-based methods of scoring. Here we extend that earlier work in three ways. (1) We demonstrate use of the GA with 3D scoring methods developed in our laboratory. (2) We show that the approach of assembling libraries from fragments in high-scoring molecules is a reasonable one. (3) We compare results from a library-based GA to those from a molecule-based GA.
Collapse
Affiliation(s)
- R P Sheridan
- Department of Molecular Systems, Merck Research Laboratories, P.O.B. 2000, Rahway, NJ 07065, USA.
| | | | | |
Collapse
|
544
|
Abstract
Which strategy is best for hit identification? Making the right choices in the capital-intensive world of modern drug discovery can make the difference between success and expensive failure. Keeping an open mind to all the options is essential. Two well-established strategies are diversity-based and focussed screening. This review will provide contrasting viewpoints highlighting the strengths and deficiencies of each approach, as well as some insights into why both strategies are likely to have a place in the research armoury of a successful drug company.
Collapse
|
545
|
Carlson HA, Masukawa KM, Rubins K, Bushman FD, Jorgensen WL, Lins RD, Briggs JM, McCammon JA. Developing a dynamic pharmacophore model for HIV-1 integrase. J Med Chem 2000; 43:2100-14. [PMID: 10841789 DOI: 10.1021/jm990322h] [Citation(s) in RCA: 212] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We present the first receptor-based pharmacophore model for HIV-1 integrase. The development of "dynamic" pharmacophore models is a new method that accounts for the inherent flexibility of the active site and aims to reduce the entropic penalties associated with binding a ligand. Furthermore, this new drug discovery method overcomes the limitation of an incomplete crystal structure of the target protein. A molecular dynamics (MD) simulation describes the flexibility of the uncomplexed protein. Many conformational models of the protein are saved from the MD simulations and used in a series of multi-unit search for interacting conformers (MUSIC) simulations. MUSIC is a multiple-copy minimization method, available in the BOSS program; it is used to determine binding regions for probe molecules containing functional groups that complement the active site. All protein conformations from the MD are overlaid, and conserved binding regions for the probe molecules are identified. Those conserved binding regions define the dynamic pharmacophore model. Here, the dynamic model is compared to known inhibitors of the integrase as well as a three-point, ligand-based pharmacophore model from the literature. Also, a "static" pharmacophore model was determined in the standard fashion, using a single crystal structure. Inhibitors thought to bind in the active site of HIV-1 integrase fit the dynamic model but not the static model. Finally, we have identified a set of compounds from the Available Chemicals Directory that fit the dynamic pharmacophore model, and experimental testing of the compounds has confirmed several new inhibitors.
Collapse
Affiliation(s)
- H A Carlson
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California 92093-0365, USA.
| | | | | | | | | | | | | | | |
Collapse
|
546
|
Estrada E, Uriarte E, Montero A, Teijeira M, Santana L, De Clercq E. A novel approach for the virtual screening and rational design of anticancer compounds. J Med Chem 2000; 43:1975-85. [PMID: 10821710 DOI: 10.1021/jm991172d] [Citation(s) in RCA: 142] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A topological substructural approach to molecular design (TOSS-MODE) has been introduced for the selection and design of anticancer compounds. A quantitative model that discriminates anticancer compounds from the inactive ones in a training series was obtained. This model permits the correct classification of 91.43% of compounds in an external prediction set with only 1.43% of false actives and 7. 14% of false inactives. The model developed is then used in a simulation of a virtual search for Ras FTase inhibitors; 87% of the Ras FTase inhibitors used in this simulated search were correctly classified, thus indicating the ability of the TOSS-MODE model of finding lead compounds with novel structures and mechanism of action. Finally, a series of carbonucleosides was designed, and the compounds were classified as active/inactive anticancer compounds by using the model developed here. From the compounds so-designed, 20 were synthesized and evaluated experimentally for their antitumor effects on the proliferation of murine leukemia cells (L1210/0) and human T-lymphocyte cells (Molt4/C8 and CEM/0); 80% of these compounds were well-classified, as active or inactive, and only two pairs of isomeric compounds were false actives. The chloropurine derivatives were the most active compounds, especially compounds 6c, d.
Collapse
Affiliation(s)
- E Estrada
- Faculty of Pharmacy, Department of Organic Chemistry, Universidad de Santiago de Compostela, 15706 Santiago de Compostela, Spain.
| | | | | | | | | | | |
Collapse
|
547
|
Combinatorial library design: maximizing model-fitting compounds within matrix synthesis constraints. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 2000; 40:701-5. [PMID: 10850774 DOI: 10.1021/ci990183c] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The use of combinatorial chemistry has become commonplace within the pharmaceutical industry. Less widespread but gaining in popularity is the derivation of activity models from the high-throughput assays of these libraries. Such models are then used as filters during the design of refined daughter libraries. The design of these second generation libraries, which efficiently test and conform to the derived activity model from the large space of virtual possibilities, remains an area of considerable research. We present here a computationally efficient method for the design of optimally dense (in model matching compounds) synthetic matrices from in silico virtual libraries.
Collapse
|
548
|
New approach to molecular docking and its application to virtual screening of chemical databases. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 2000; 40:254-62. [PMID: 10761126 DOI: 10.1021/ci990440d] [Citation(s) in RCA: 64] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
This paper describes the validation of a molecular docking method and its application to virtual database screening. The code flexibly docks ligand molecules into rigid receptor structures using a tabu search methodology driven by an empirically derived function for estimating the binding affinity of a protein-ligand complex. The docking method has been tested on 70 ligand-receptor complexes for which the experimental binding affinity and binding geometry are known. The lowest energy geometry produced by the docking protocol is within 2.0 A root mean square of the experimental binding mode for 79% of the complexes. The method has been applied to the problem of virtual database screening to identify known ligands for thrombin, factor Xa, and the estrogen receptor. A database of 10,000 randomly chosen "druglike" molecules has been docked into the three receptor structures. In each case known receptor ligands were included in the study. The results showed good separation between the predicted binding affinities of the known ligand set and the database subset.
Collapse
|
549
|
Darvas F, Dormán G, Papp A. Diversity measures for enhancing ADME admissibility of combinatorial libraries. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 2000; 40:314-22. [PMID: 10761133 DOI: 10.1021/ci990268d] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
For general screening libraries, structural diversity descriptors and drug-likeness indicators still do not guarantee the in vivo bioavailability for the candidates, which is considered a major bottleneck in drug development. Early prediction of pharmacokinetics (log P, log D), metabolism, and toxicity makes it possible to deal with ADME (adsorption, distribution, metabolism, excretion) related diversity as an extension to the classical diversity concepts. It opens several new possibilities for optimization of a discovery library before doing any experimental screening. This new diversity concept is demonstrated on a subset of MeDiverse, which is a diverse collection of pharmacologically relevant compounds selected from our in-house library. From consideration of the ADME interface in living systems, virtual secondary libraries of metabolites and retrometabolites (prodrugs) can be generated. These additional libraries readily enhance both the structural and ADME related diversity. This new opportunity in library design can substantially improve the success rate for in vivo lead generation from in vitro hits.
Collapse
Affiliation(s)
- F Darvas
- ComGenex, Inc., Budapest, Hungary.
| | | | | |
Collapse
|
550
|
Bit-string methods for selective compound acquisition. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 2000; 40:210-4. [PMID: 10761120 DOI: 10.1021/ci990428l] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Selective compound acquisition programs need to ensure that the compounds that are chosen do not contain undesirable functionality. This is easy to achieve if a supplier is prepared to provide unambiguous structure representations for the compounds that they have available: this paper discusses selection techniques that can be used when a supplier is prepared to make available only fragment bit-string representations for the compounds in their catalog. Experiments with three databases and three types of bit-string show that a simple k-nearest-neighbor searching method provides a surprisingly effective, although far from perfect, way of selecting compounds when only bit-string representations are available. A second approach, based on the use of a fragment weighting scheme analogous to those used in substructural analysis studies, proved to be noticeably less effective in operation.
Collapse
|