Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Harper G, Bradshaw J, Gittins JC, Green DV, Leach AR. Prediction of biological activity for high-throughput screening using binary kernel discrimination. J Chem Inf Comput Sci 2001;41:1295-300. [PMID: 11604029 DOI: 10.1021/ci000397q] [Citation(s) in RCA: 65] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

For:	Harper G, Bradshaw J, Gittins JC, Green DV, Leach AR. Prediction of biological activity for high-throughput screening using binary kernel discrimination. J Chem Inf Comput Sci 2001;41:1295-300. [PMID: 11604029 DOI: 10.1021/ci000397q] [Citation(s) in RCA: 65] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Number

Cited by Other Article(s)

Minot M, Reddy ST. Meta learning addresses noisy and under-labeled data in machine learning-guided antibody engineering. Cell Syst 2024;15:4-18.e4. [PMID: 38194961 DOI: 10.1016/j.cels.2023.12.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 07/21/2023] [Accepted: 12/07/2023] [Indexed: 01/11/2024]

Carneiro J, Magalhães RP, de la Oliva Roque VM, Simões M, Pratas D, Sousa SF. TargIDe: a machine-learning workflow for target identification of molecules with antibiofilm activity against Pseudomonas aeruginosa. J Comput Aided Mol Des 2023;37:265-278. [PMID: 37085636 DOI: 10.1007/s10822-023-00505-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Accepted: 04/12/2023] [Indexed: 04/23/2023]

Deep learning model for classification and bioactivity prediction of essential oil-producing plants from Egypt. Sci Rep 2020;10:21349. [PMID: 33288845 PMCID: PMC7721748 DOI: 10.1038/s41598-020-78449-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Accepted: 11/20/2020] [Indexed: 11/29/2022] Open

Berenger F, Yamanishi Y. Ranking Molecules with Vanishing Kernels and a Single Parameter: Active Applicability Domain Included. J Chem Inf Model 2020;60:4376-4387. [PMID: 32281797 DOI: 10.1021/acs.jcim.9b01075] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

de la Vega de León A, Chen B, Gillet VJ. Effect of missing data on multitask prediction methods. J Cheminform 2018;10:26. [PMID: 29789977 PMCID: PMC5964064 DOI: 10.1186/s13321-018-0281-z] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2018] [Accepted: 05/14/2018] [Indexed: 01/05/2023] Open

Afolabi LT, Saeed F, Hashim H, Petinrin OO. Ensemble learning method for the prediction of new bioactive molecules. PLoS One 2018;13:e0189538. [PMID: 29329334 PMCID: PMC5766097 DOI: 10.1371/journal.pone.0189538] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Accepted: 11/27/2017] [Indexed: 12/31/2022] Open

Riniker S, Landrum GA, Montanari F, Villalba SD, Maier J, Jansen JM, Walters WP, Shelat AA. Virtual-screening workflow tutorials and prospective results from the Teach-Discover-Treat competition 2014 against malaria. F1000Res 2017;6:1136. [PMID: 28928948 DOI: 10.12688/f1000research.11905.1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/11/2017] [Indexed: 12/21/2022] Open

Riniker S, Landrum GA, Montanari F, Villalba SD, Maier J, Jansen JM, Walters WP, Shelat AA. Virtual-screening workflow tutorials and prospective results from the Teach-Discover-Treat competition 2014 against malaria. F1000Res 2017. [PMID: 28928948 PMCID: PMC5580409 DOI: 10.12688/f1000research.11905.2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open

Babajide Mustapha I, Saeed F. Bioactive Molecule Prediction Using Extreme Gradient Boosting. Molecules 2016;21:molecules21080983. [PMID: 27483216 PMCID: PMC6273295 DOI: 10.3390/molecules21080983] [Citation(s) in RCA: 98] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2016] [Revised: 07/19/2016] [Accepted: 07/22/2016] [Indexed: 01/29/2023] Open

Gilson MK, Liu T, Baitaluk M, Nicola G, Hwang L, Chong J. BindingDB in 2015: A public database for medicinal chemistry, computational chemistry and systems pharmacology. Nucleic Acids Res 2016;44:D1045-53. [PMID: 26481362 PMCID: PMC4702793 DOI: 10.1093/nar/gkv1072] [Citation(s) in RCA: 804] [Impact Index Per Article: 100.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2015] [Revised: 10/02/2015] [Accepted: 10/05/2015] [Indexed: 12/12/2022] Open

The Parzen Window method: In terms of two vectors and one matrix. Pattern Recognit Lett 2015;63:30-35. [PMID: 26435560 PMCID: PMC4534349 DOI: 10.1016/j.patrec.2015.06.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Abstract

•

We revisit the Parzen Window approach widely employed in pattern recognition.

•

The Parzen Window approach can suffer from a severe computational bottleneck.

•

This manuscript introduces a new scheme to ameliorate this computational drawback.

Pattern classification methods assign an object to one of several predefined classes/categories based on features extracted from observed attributes of the object (pattern). When L discriminatory features for the pattern can be accurately determined, the pattern classification problem presents no difficulty. However, precise identification of the relevant features for a classification algorithm (classifier) to be able to categorize real world patterns without errors is generally infeasible. In this case, the pattern classification problem is often cast as devising a classifier that minimizes the misclassification rate. One way of doing this is to consider both the pattern attributes and its class label as random variables, estimate the posterior class probabilities for a given pattern and then assign the pattern to the class/category for which the posterior class probability value estimated is maximum. More often than not, the form of the posterior class probabilities is unknown.

The so-called Parzen Window approach is widely employed to estimate class-conditional probability (class-specific probability) densities for a given pattern. These probability densities can then be utilized to estimate the appropriate posterior class probabilities for that pattern. However, the Parzen Window scheme can become computationally impractical when the size of the training dataset is in the tens of thousands and L is also large (a few hundred or more). Over the years, various schemes have been suggested to ameliorate the computational drawback of the Parzen Window approach, but the problem still remains outstanding and unresolved.

In this paper, we revisit the Parzen Window technique and introduce a novel approach that may circumvent the aforementioned computational bottleneck. The current paper presents the mathematical aspect of our idea. Practical realizations of the proposed scheme will be given elsewhere.

Collapse

Pirhadi S, Shiri F, Ghasemi JB. Multivariate statistical analysis methods in QSAR. RSC Adv 2015. [DOI: 10.1039/c5ra10729f] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open

Lewis RA, Wood D. Modern 2D QSAR for drug discovery. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2014. [DOI: 10.1002/wcms.1187] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]

Abdo A, Leclère V, Jacques P, Salim N, Pupin M. Prediction of new bioactive molecules using a Bayesian belief network. J Chem Inf Model 2014;54:30-6. [PMID: 24392938 DOI: 10.1021/ci4004909] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Riniker S, Fechner N, Landrum GA. Heterogeneous Classifier Fusion for Ligand-Based Virtual Screening: Or, How Decision Making by Committee Can Be a Good Thing. J Chem Inf Model 2013;53:2829-36. [DOI: 10.1021/ci400466r] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]

Bauman JD, Patel D, Dharia C, Fromer MW, Ahmed S, Frenkel Y, Vijayan RSK, Eck JT, Ho WC, Das K, Shatkin AJ, Arnold E. Detecting allosteric sites of HIV-1 reverse transcriptase by X-ray crystallographic fragment screening. J Med Chem 2013;56:2738-46. [PMID: 23342998 PMCID: PMC3906421 DOI: 10.1021/jm301271j] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]

Affiliation(s)

Joseph D. Bauman Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey
Disha Patel Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey Department of Medicinal Chemistry, Rutgers University, Piscataway, New Jersey
Chhaya Dharia Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey
Marc W. Fromer Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey
Sameer Ahmed Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey
Yulia Frenkel Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey
R. S. K. Vijayan Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey
J. Thomas Eck Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey
William C. Ho Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey
Kalyan Das Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey
Aaron J. Shatkin Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey
Eddy Arnold Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey Department of Medicinal Chemistry, Rutgers University, Piscataway, New Jersey

Collapse

Tyzack JD, Mussa HY, Glen RC. Probabilistic classifier: generated using randomised sub-sampling of the feature space. J Cheminform 2012. [PMCID: PMC3341313 DOI: 10.1186/1758-2946-4-s1-p40] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Wang Z, Mussa HY, Lowe R, Glen RC, Yan A. Probability Based hERG Blocker Classifiers. Mol Inform 2012;31:679-85. [DOI: 10.1002/minf.201200011] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2012] [Accepted: 07/03/2012] [Indexed: 11/11/2022]

Nicola G, Liu T, Gilson MK. Public domain databases for medicinal chemistry. J Med Chem 2012;55:6987-7002. [PMID: 22731701 DOI: 10.1021/jm300501t] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]

Zhang S. Application of Machine Leaning in Drug Discovery and Development. Mach Learn 2012. [DOI: 10.4018/978-1-60960-818-7.ch517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

He J, Yang G, Rao H, Li Z, Ding X, Chen Y. Prediction of human major histocompatibility complex class II binding peptides by continuous kernel discrimination method. Artif Intell Med 2011;55:107-15. [PMID: 22134095 DOI: 10.1016/j.artmed.2011.10.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2011] [Revised: 10/12/2011] [Accepted: 10/21/2011] [Indexed: 11/25/2022]

Abstract

OBJECTIVE

Accurate prediction of major histocompatibility complex (MHC) class II binding peptides helps reducing the experimental cost for identifying helper T cell epitopes, which has been a challenging problem partly because of the variable length of the binding peptides. This work is to develop an accurate model for predicting MHC-binding peptides using machine learning methods.

METHODS

In this work, a machine learning method, continuous kernel discrimination (CKD), was used for predicting MHC class II binders of variable lengths. The composition transition and distribution features were used for encoding peptide sequence and the Metropolis Monte Carlo simulated annealing approach was used for feature selection.

RESULTS

Feature selection was found to significantly improve the performance of the model. For benchmark dataset Dataset-1, the number of features is reduced from 147 to 24 and the area under the receiver operating characteristic curve (AUC) is improved from 0.8088 to 0.9034, while for benchmark dataset Dataset-2, the number of features is reduced from 147 to 44 and the AUC is improved from 0.7349 to 0.8499. An optimal CKD model was derived from the feature selection and bandwidth optimization using 10-fold cross-validation. Its AUC values are between 0.831 and 0.980 evaluated on benchmark datasets BM-Set1 and are between 0.806 and 0.949 on benchmark datasets BM-Set2 for MHC class II alleles. These results indicate a significantly better performance for our CKD model over other earlier models based on the training and testing of the same datasets.

CONCLUSIONS

Our study suggested that the CKD method outperforms other machine learning methods proposed earlier in the prediction of MHC class II biding peptides. Moreover, the choice of the cut-off for CKD classifier is crucial for its performance.

Collapse

Lowe R, Mussa HY, Mitchell JBO, Glen RC. Classifying Molecules Using a Sparse Probabilistic Kernel Binary Classifier. J Chem Inf Model 2011;51:1539-44. [DOI: 10.1021/ci200128w] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Willett P. Similarity methods in chemoinformatics. ACTA ACUST UNITED AC 2011. [DOI: 10.1002/aris.2009.1440430108] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]

Varnek A. Fragment descriptors in structure-property modeling and virtual screening. Methods Mol Biol 2011;672:213-243. [PMID: 20838971 DOI: 10.1007/978-1-60761-839-3_9] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]

Mussa HY, Hawizy L, Nigsch F, Glen RC. Classifying large chemical data sets: using a regularized potential function method. J Chem Inf Model 2010;51:4-14. [PMID: 21155612 DOI: 10.1021/ci100022u] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Kutchukian PS, Shakhnovich EI. De novo design: balancing novelty and confined chemical space. Expert Opin Drug Discov 2010;5:789-812. [PMID: 22827800 DOI: 10.1517/17460441.2010.497534] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Geppert H, Vogt M, Bajorath J. Current trends in ligand-based virtual screening: molecular representations, data mining methods, new application areas, and performance evaluation. J Chem Inf Model 2010;50:205-16. [PMID: 20088575 DOI: 10.1021/ci900419k] [Citation(s) in RCA: 231] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Ranu S, Singh AK. Mining Statistically Significant Molecular Substructures for Efficient Molecular Classification. J Chem Inf Model 2009;49:2537-50. [DOI: 10.1021/ci900035z] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Kutchukian PS, Lou D, Shakhnovich EI. FOG: Fragment Optimized Growth Algorithm for the de Novo Generation of Molecules Occupying Druglike Chemical Space. J Chem Inf Model 2009;49:1630-42. [DOI: 10.1021/ci9000458] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Nasr RJ, Swamidass SJ, Baldi PF. Large scale study of multiple-molecule queries. J Cheminform 2009;1:7. [PMID: 20298525 PMCID: PMC3225883 DOI: 10.1186/1758-2946-1-7] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2009] [Accepted: 06/04/2009] [Indexed: 12/04/2022] Open

Abstract

Background

In ligand-based screening, as well as in other chemoinformatics applications, one seeks to effectively search large repositories of molecules in order to retrieve molecules that are similar typically to a single molecule lead. However, in some case, multiple molecules from the same family are available to seed the query and search for other members of the same family.

Multiple-molecule query methods have been less studied than single-molecule query methods. Furthermore, the previous studies have relied on proprietary data and sometimes have not used proper cross-validation methods to assess the results. In contrast, here we develop and compare multiple-molecule query methods using several large publicly available data sets and background. We also create a framework based on a strict cross-validation protocol to allow unbiased benchmarking for direct comparison in future studies across several performance metrics.

Results

Fourteen different multiple-molecule query methods were defined and benchmarked using: (1) 41 publicly available data sets of related molecules with similar biological activity; and (2) publicly available background data sets consisting of up to 175,000 molecules randomly extracted from the ChemDB database and other sources. Eight of the fourteen methods were parameter free, and six of them fit one or two free parameters to the data using a careful cross-validation protocol. All the methods were assessed and compared for their ability to retrieve members of the same family against the background data set by using several performance metrics including the Area Under the Accumulation Curve (AUAC), Area Under the Curve (AUC), F1-measure, and BEDROC metrics.

Consistent with the previous literature, the best parameter-free methods are the MAX-SIM and MIN-RANK methods, which score a molecule to a family by the maximum similarity, or minimum ranking, obtained across the family. One new parameterized method introduced in this study and two previously defined methods, the Exponential Tanimoto Discriminant (ETD), the Tanimoto Power Discriminant (TPD), and the Binary Kernel Discriminant (BKD), outperform most other methods but are more complex, requiring one or two parameters to be fit to the data.

Conclusion

Fourteen methods for multiple-molecule querying of chemical databases, including novel methods, (ETD) and (TPD), are validated using publicly available data sets, standard cross-validation protocols, and established metrics. The best results are obtained with ETD, TPD, BKD, MAX-SIM, and MIN-RANK. These results can be replicated and compared with the results of future studies using data freely downloadable from http://cdb.ics.uci.edu/.

Collapse

Green DVS. Virtual screening of chemical libraries for drug discovery. Expert Opin Drug Discov 2008;3:1011-26. [DOI: 10.1517/17460441.3.9.1011] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]

Ma XH, Wang R, Yang SY, Li ZR, Xue Y, Wei YC, Low BC, Chen YZ. Evaluation of virtual screening performance of support vector machines trained by sparsely distributed active compounds. J Chem Inf Model 2008;48:1227-37. [PMID: 18533644 DOI: 10.1021/ci800022e] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]

Reid D, Sadjad BS, Zsoldos Z, Simon A. LASSO—ligand activity by surface similarity order: a new tool for ligand based virtual screening. J Comput Aided Mol Des 2008;22:479-87. [DOI: 10.1007/s10822-007-9164-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2007] [Accepted: 12/18/2007] [Indexed: 10/22/2022]

Han LY, Ma XH, Lin HH, Jia J, Zhu F, Xue Y, Li ZR, Cao ZW, Ji ZL, Chen YZ. A support vector machines approach for virtual screening of active compounds of single and multiple mechanisms from large libraries at an improved hit-rate and enrichment factor. J Mol Graph Model 2007;26:1276-86. [PMID: 18218332 DOI: 10.1016/j.jmgm.2007.12.002] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2007] [Revised: 12/05/2007] [Accepted: 12/05/2007] [Indexed: 01/04/2023]

Li H, Yap CW, Ung CY, Xue Y, Li ZR, Han LY, Lin HH, Chen YZ. Machine learning approaches for predicting compounds that interact with therapeutic and ADMET related proteins. J Pharm Sci 2007;96:2838-60. [PMID: 17786989 DOI: 10.1002/jps.20985] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Willett P, Wilton D, Hartzoulakis B, Tang R, Ford J, Madge D. Prediction of Ion Channel Activity Using Binary Kernel Discrimination. J Chem Inf Model 2007;47:1961-6. [PMID: 17622131 DOI: 10.1021/ci700087v] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]

Pasupa K, Harrison RF, Willett P. Parsimonious Kernel Fisher Discrimination. PATTERN RECOGNITION AND IMAGE ANALYSIS 2007. [DOI: 10.1007/978-3-540-72847-4_68] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Eckert H, Bajorath J. Molecular similarity analysis in virtual screening: foundations, limitations and novel approaches. Drug Discov Today 2007;12:225-33. [PMID: 17331887 DOI: 10.1016/j.drudis.2007.01.011] [Citation(s) in RCA: 312] [Impact Index Per Article: 18.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2006] [Revised: 12/22/2006] [Accepted: 01/23/2007] [Indexed: 11/27/2022]

Jensen BF, Vind C, Padkjaer SB, Brockhoff PB, Refsgaard HHF. In Silico Prediction of Cytochrome P450 2D6 and 3A4 Inhibition Using Gaussian Kernel Weighted k-Nearest Neighbor and Extended Connectivity Fingerprints, Including Structural Fragment Analysis of Inhibitors versus Noninhibitors. J Med Chem 2007;50:501-11. [PMID: 17266202 DOI: 10.1021/jm060333s] [Citation(s) in RCA: 87] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]

Chen B, Harrison RF, Papadatos G, Willett P, Wood DJ, Lewell XQ, Greenidge P, Stiefl N. Evaluation of machine-learning methods for ligand-based virtual screening. J Comput Aided Mol Des 2007;21:53-62. [PMID: 17205373 DOI: 10.1007/s10822-006-9096-5] [Citation(s) in RCA: 93] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2006] [Accepted: 12/04/2006] [Indexed: 01/28/2023]

Willett P. Similarity-based virtual screening using 2D fingerprints. Drug Discov Today 2006;11:1046-53. [PMID: 17129822 DOI: 10.1016/j.drudis.2006.10.005] [Citation(s) in RCA: 547] [Impact Index Per Article: 30.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2006] [Revised: 09/04/2006] [Accepted: 10/09/2006] [Indexed: 11/19/2022]

Liu T, Lin Y, Wen X, Jorissen RN, Gilson MK. BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinities. Nucleic Acids Res 2006;35:D198-201. [PMID: 17145705 PMCID: PMC1751547 DOI: 10.1093/nar/gkl999] [Citation(s) in RCA: 1216] [Impact Index Per Article: 67.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Willett P. Enhancing the Effectiveness of Ligand-Based Virtual Screening Using Data Fusion. ACTA ACUST UNITED AC 2006. [DOI: 10.1002/qsar.200610084] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Ganguly M, Brown N, Schuffenhauer A, Ertl P, Gillet VJ, Greenidge PA. Introducing the consensus modeling concept in genetic algorithms: application to interpretable discriminant analysis. J Chem Inf Model 2006;46:2110-24. [PMID: 16995742 DOI: 10.1021/ci050529l] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Auer J, Bajorath J. Emerging Chemical Patterns: A New Methodology for Molecular Classification and Compound Selection. J Chem Inf Model 2006;46:2502-14. [PMID: 17125191 DOI: 10.1021/ci600301t] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Hert J, Willett P, Wilton DJ, Acklin P, Azzaoui K, Jacoby E, Schuffenhauer A. New methods for ligand-based virtual screening: use of data fusion and machine learning to enhance the effectiveness of similarity searching. J Chem Inf Model 2006;46:462-70. [PMID: 16562973 DOI: 10.1021/ci050348j] [Citation(s) in RCA: 165] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Eckert H, Vogt I, Bajorath J. Mapping Algorithms for Molecular Similarity Analysis and Ligand-Based Virtual Screening: Design of DynaMAD and Comparison with MAD and DMC. J Chem Inf Model 2006;46:1623-34. [PMID: 16859294 DOI: 10.1021/ci060083o] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Wilton DJ, Harrison RF, Willett P, Delaney J, Lawson K, Mullier G. Virtual Screening Using Binary Kernel Discrimination: Analysis of Pesticide Data. J Chem Inf Model 2006;46:471-7. [PMID: 16562974 DOI: 10.1021/ci050397w] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Chen B, Harrison RF, Pasupa K, Willett P, Wilton DJ, Wood DJ, Lewell XQ. Virtual Screening Using Binary Kernel Discrimination: Effect of Noisy Training Data and the Optimization of Performance. J Chem Inf Model 2006;46:478-86. [PMID: 16562975 DOI: 10.1021/ci0505426] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Capelli AM, Feriani A, Tedesco G, Pozzan A. Generation of a Focused Set of GSK Compounds Biased toward Ligand-Gated Ion-Channel Ligands. J Chem Inf Model 2006;46:659-64. [PMID: 16562996 DOI: 10.1021/ci050353n] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]