Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Dhanda SK, Singla D, Mondal AK, Raghava GPS. DrugMint: a webserver for predicting and designing of drug-like molecules. Biol Direct 2013;8:28. [PMID: 24188205 PMCID: PMC3826839 DOI: 10.1186/1745-6150-8-28] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2013] [Accepted: 10/24/2013] [Indexed: 11/10/2022] Open

For:	Dhanda SK, Singla D, Mondal AK, Raghava GPS. DrugMint: a webserver for predicting and designing of drug-like molecules. Biol Direct 2013;8:28. [PMID: 24188205 PMCID: PMC3826839 DOI: 10.1186/1745-6150-8-28] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2013] [Accepted: 10/24/2013] [Indexed: 11/10/2022] Open

Number

Cited by Other Article(s)

Singh S, Singh PK, Sachan K, Kumar M, Bhardwaj P. Automation of Drug Discovery through Cutting-edge In-silico Research in Pharmaceuticals: Challenges and Future Scope. Curr Comput Aided Drug Des 2024;20:723-735. [PMID: 37807412 DOI: 10.2174/0115734099260187230921073932] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Revised: 08/05/2023] [Accepted: 08/18/2023] [Indexed: 10/10/2023]

Xia S, Chen E, Zhang Y. Integrated Molecular Modeling and Machine Learning for Drug Design. J Chem Theory Comput 2023;19:7478-7495. [PMID: 37883810 PMCID: PMC10653122 DOI: 10.1021/acs.jctc.3c00814] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 10/10/2023] [Accepted: 10/11/2023] [Indexed: 10/28/2023]

Zhu W, Wang Y, Niu Y, Zhang L, Liu Z. Current Trends and Challenges in Drug-Likeness Prediction: Are They Generalizable and Interpretable? HEALTH DATA SCIENCE 2023;3:0098. [PMID: 38487200 PMCID: PMC10880170 DOI: 10.34133/hds.0098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 10/20/2023] [Indexed: 03/17/2024]

Dou Y, Meng W. Comparative analysis of weka-based classification algorithms on medical diagnosis datasets. Technol Health Care 2023;31:397-408. [PMID: 37066939 DOI: 10.3233/thc-236034] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]

Abstract

BACKGROUND

With the advent of 5G and the era of Big Data, the rapid development of medical information technology around the world, the massive application of electronic medical records and cases, and the digitization of medical equipment and instruments, a large amount of data has accumulated in the database system of hospitals, which includes clinical diagnosis data and hospital management data.

OBJECTIVE

This study aimed to examine the classification effects of different machine learning algorithms on medical datasets so as to better explore the value of machine learning methods in aiding medical diagnosis.

METHODS

The classification datasets of four different medical fields in the University of California Irvine machine learning database were used as the research object. Also, six categories of classification models based on the Bayesian theorem idea, integrated learning idea, and rule-based and tree-based idea were constructed using the Weka platform.

RESULTS

The between-group experiments showed that the Random Forest algorithm achieved the best results on the Indian liver disease patient dataset (ILPD), delivery cardiotocography (CADG), and lymphatic tractography (LYMP) datasets, followed by Bagging and partition and regression tree. In the within-group algorithm comparison experiments, the Bagging algorithm achieved better results than other algorithms based on the integration idea for 11 metrics on all datasets, mainly focusing on 2 binary datasets. Logit Boost had only 7 metrics with significant performance, and the best algorithm was Rotation Forest, with 28 metrics achieving optimal values. Among the algorithms based on tree ideas, the logistic model tree algorithm achieved optimal results on all metrics on the mammographic dataset (MAGR). The classification performance of BFTree, J48, and Random Tree was poor on each dataset. The best algorithm was Random Forest on the ILPD, CADG, and LYMP datasets with 27 metrics reaching the optimum.

CONCLUSION

Machine learning algorithms have good application value in disease prediction and can provide a reference basis for disease diagnosis.

Collapse

Pande A, Patiyal S, Lathwal A, Arora C, Kaur D, Dhall A, Mishra G, Kaur H, Sharma N, Jain S, Usmani SS, Agrawal P, Kumar R, Kumar V, Raghava GPS. Pfeature: A Tool for Computing Wide Range of Protein Features and Building Prediction Models. J Comput Biol 2023;30:204-222. [PMID: 36251780 DOI: 10.1089/cmb.2022.0241] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open

Abstract

In the last three decades, a wide range of protein features have been discovered to annotate a protein. Numerous attempts have been made to integrate these features in a software package/platform so that the user may compute a wide range of features from a single source. To complement the existing methods, we developed a method, Pfeature, for computing a wide range of protein features. Pfeature allows to compute more than 200,000 features required for predicting the overall function of a protein, residue-level annotation of a protein, and function of chemically modified peptides. It has six major modules, namely, composition, binary profiles, evolutionary information, structural features, patterns, and model building. Composition module facilitates to compute most of the existing compositional features, plus novel features. The binary profile of amino acid sequences allows to compute the fraction of each type of residue as well as its position. The evolutionary information module allows to compute evolutionary information of a protein in the form of a position-specific scoring matrix profile generated using Position-Specific Iterative Basic Local Alignment Search Tool (PSI-BLAST); fit for annotation of a protein and its residues. A structural module was developed for computing of structural features/descriptors from a tertiary structure of a protein. These features are suitable to predict the therapeutic potential of a protein containing non-natural or chemically modified residues. The model-building module allows to implement various machine learning techniques for developing classification and regression models as well as feature selection. Pfeature also allows the generation of overlapping patterns and features from a protein. A user-friendly Pfeature is available as a web server python library and stand-alone package.

Collapse

Affiliation(s)

Akshara Pande Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
Sumeet Patiyal Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
Anjali Lathwal Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
Chakit Arora Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
Dilraj Kaur Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
Anjali Dhall Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
Gaurav Mishra Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Department of Electrical Engineering, Shiv Nadar University, Greater Noida, India
Harpreet Kaur Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
Neelam Sharma Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
Shipra Jain Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
Salman Sadullah Usmani Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
Piyush Agrawal Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
Rajesh Kumar Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
Vinod Kumar Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
Gajendra P S Raghava Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India

Collapse

Erlina L, Paramita RI, Kusuma WA, Fadilah F, Tedjo A, Pratomo IP, Ramadhanti NS, Nasution AK, Surado FK, Fitriawan A, Istiadi KA, Yanuar A. Virtual screening of Indonesian herbal compounds as COVID-19 supportive therapy: machine learning and pharmacophore modeling approaches. BMC Complement Med Ther 2022;22:207. [PMID: 35922786 PMCID: PMC9347098 DOI: 10.1186/s12906-022-03686-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 07/21/2022] [Indexed: 11/10/2022] Open

Abstract

Background

The number of COVID-19 cases continues to grow in Indonesia. This phenomenon motivates researchers to find alternative drugs that function for prevention or treatment. Due to the rich biodiversity of Indonesian medicinal plants, one alternative is to examine the potential of herbal medicines to support COVID therapy. This study aims to identify potential compound candidates in Indonesian herbal using a machine learning and pharmacophore modeling approaches.

Methods

We used three classification methods that had different decision-making processes: support vector machine (SVM), multilayer perceptron (MLP), and random forest (RF). For the pharmacophore modeling approach, we performed a structure-based analysis on the 3D structure of the main protease SARS-CoV-2 (3CLPro) and repurposed SARS, MERS, and SARS-CoV-2 drugs identified from the literature as datasets in the ligand-based method. Lastly, we used molecular docking to analyze the interactions between the 3CLpro and 14 hit compounds from the Indonesian Herbal Database (HerbalDB), with lopinavir as a positive control.

Results

From the molecular docking analysis, we found six potential compounds that may act as the main proteases of the SARS-CoV-2 inhibitor: hesperidin, kaempferol-3,4'-di-O-methyl ether (Ermanin); myricetin-3-glucoside, peonidin 3-(4’-arabinosylglucoside); quercetin 3-(2G-rhamnosylrutinoside); and rhamnetin 3-mannosyl-(1-2)-alloside.

Conclusions

Our layered virtual screening with machine learning and pharmacophore modeling approaches provided a more objective and optimal virtual screening and avoided subjective decision making of the results. Herbal compounds from the screening, i.e. hesperidin, kaempferol-3,4'-di-O-methyl ether (Ermanin); myricetin-3-glucoside, peonidin 3-(4’-arabinosylglucoside); quercetin 3-(2G-rhamnosylrutinoside); and rhamnetin 3-mannosyl-(1-2)-alloside are potential antiviral candidates for SARS-CoV-2. Moringa oleifera and Psidium guajava that consist of those compounds, could be an alternative option as COVID-19 herbal preventions.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12906-022-03686-y.

Collapse

Modern research thoughts and methods on bio-active components of TCM formulae. Chin J Nat Med 2022;20:481-493. [DOI: 10.1016/s1875-5364(22)60206-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Indexed: 12/24/2022]

Azevedo L, Serafim MSM, Maltarollo VG, Grabrucker AM, Granato D. Atherosclerosis fate in the era of tailored functional foods: Evidence-based guidelines elicited from structure- and ligand-based approaches. Trends Food Sci Technol 2022. [DOI: 10.1016/j.tifs.2022.07.010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Pantaleão SQ, Fernandes PO, Gonçalves JE, Maltarollo VG, Honorio KM. Recent Advances in the Prediction of Pharmacokinetics Properties in Drug Design Studies: A Review. ChemMedChem 2021;17:e202100542. [PMID: 34655454 DOI: 10.1002/cmdc.202100542] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Revised: 10/07/2021] [Indexed: 12/11/2022]

Shaker B, Ahmad S, Lee J, Jung C, Na D. In silico methods and tools for drug discovery. Comput Biol Med 2021;137:104851. [PMID: 34520990 DOI: 10.1016/j.compbiomed.2021.104851] [Citation(s) in RCA: 138] [Impact Index Per Article: 46.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 09/05/2021] [Accepted: 09/05/2021] [Indexed: 12/28/2022]

Dhall A, Patiyal S, Sharma N, Devi NL, Raghava GPS. Computer-aided prediction of inhibitors against STAT3 for managing COVID-19 associated cytokine storm. Comput Biol Med 2021;137:104780. [PMID: 34450382 PMCID: PMC8378993 DOI: 10.1016/j.compbiomed.2021.104780] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2021] [Revised: 08/11/2021] [Accepted: 08/18/2021] [Indexed: 12/27/2022]

Sharma N, Patiyal S, Dhall A, Devi NL, Raghava GPS. ChAlPred: A web server for prediction of allergenicity of chemical compounds. Comput Biol Med 2021;136:104746. [PMID: 34388468 DOI: 10.1016/j.compbiomed.2021.104746] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Revised: 08/04/2021] [Accepted: 08/04/2021] [Indexed: 11/28/2022]

Abstract

BACKGROUND

Allergy is the abrupt reaction of the immune system that may occur after the exposure to allergens such as proteins, peptides, or chemicals. In the past, various methods have been generated for predicting allergenicity of proteins and peptides. In contrast, there is no method that can predict allergenic potential of chemicals. In this paper, we described a method ChAlPred developed for predicting chemical allergens as well as for designing chemical analogs with desired allergenicity.

METHOD

In this study, we have used 403 allergenic and 1074 non-allergenic chemical compounds obtained from IEDB database. The PaDEL software was used to compute the molecular descriptors of the chemical compounds to develop different prediction models. All the models were trained and tested on the 80% training data and evaluated on the 20% validation data using the 2D, 3D and FP descriptors.

RESULTS

In this study, we have developed different prediction models using several machine learning approaches. It was observed that the Random Forest based model developed using hybrid descriptors performed the best, and achieved the maximum accuracy of 83.39% and AUC of 0.93 on validation dataset. The fingerprint analysis of the dataset indicates that certain chemical fingerprints are more abundant in allergens that include PubChemFP129 and GraphFP1014. We have also predicted allergenicity potential of FDA-approved drugs using our best model and identified the drugs causing allergic symptoms (e.g., Cefuroxime, Spironolactone, Tioconazole). Our results agreed with allergenicity of these drugs reported in literature.

CONCLUSIONS

To aid the research community, we developed a smart-device compatible web server ChAlPred (https://webs.iiitd.edu.in/raghava/chalpred/) that allows to predict and design the chemicals with allergenic properties.

Collapse

Oselusi SO, Christoffels A, Egieyeh SA. Cheminformatic Characterization of Natural Antimicrobial Products for the Development of New Lead Compounds. Molecules 2021;26:molecules26133970. [PMID: 34209681 PMCID: PMC8271829 DOI: 10.3390/molecules26133970] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2021] [Revised: 05/29/2021] [Accepted: 06/02/2021] [Indexed: 12/26/2022] Open

Peerzada MN, Hamel E, Bai R, Supuran CT, Azam A. Deciphering the key heterocyclic scaffolds in targeting microtubules, kinases and carbonic anhydrases for cancer drug development. Pharmacol Ther 2021;225:107860. [PMID: 33895188 DOI: 10.1016/j.pharmthera.2021.107860] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 03/31/2021] [Accepted: 04/06/2021] [Indexed: 12/17/2022]

Characterisation of twelve newly synthesised N-(substituted phenyl)-2-chloroacetamides with QSAR analysis and antimicrobial activity tests. Arh Hig Rada Toksikol 2021;72:70-79. [PMID: 33787186 PMCID: PMC8191425 DOI: 10.2478/aiht-2021-72-3483] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Accepted: 02/01/2021] [Indexed: 11/20/2022] Open

Wang C, Wang W, Lu K, Zhang J, Chen P, Wang B. Predicting Drug-Target Interactions with Electrotopological State Fingerprints and Amphiphilic Pseudo Amino Acid Composition. Int J Mol Sci 2020;21:ijms21165694. [PMID: 32784497 PMCID: PMC7570185 DOI: 10.3390/ijms21165694] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2020] [Revised: 08/05/2020] [Accepted: 08/06/2020] [Indexed: 12/13/2022] Open

Kar S, Leszczynski J. Open access in silico tools to predict the ADMET profiling of drug candidates. Expert Opin Drug Discov 2020;15:1473-1487. [PMID: 32735147 DOI: 10.1080/17460441.2020.1798926] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

Cadow J, Born J, Manica M, Oskooei A, Rodríguez Martínez M. PaccMann: a web service for interpretable anticancer compound sensitivity prediction. Nucleic Acids Res 2020;48:W502-W508. [PMID: 32402082 PMCID: PMC7319576 DOI: 10.1093/nar/gkaa327] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Revised: 04/06/2020] [Accepted: 04/22/2020] [Indexed: 12/19/2022] Open

Gupta MK, Vemula S, Donde R, Gouda G, Behera L, Vadde R. In-silico approaches to detect inhibitors of the human severe acute respiratory syndrome coronavirus envelope protein ion channel. J Biomol Struct Dyn 2020;39:2617-2627. [PMID: 32238078 PMCID: PMC7171389 DOI: 10.1080/07391102.2020.1751300] [Citation(s) in RCA: 151] [Impact Index Per Article: 37.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Distinguishing drug/non-drug-like small molecules in drug discovery using deep belief network. Mol Divers 2020;25:827-838. [PMID: 32193758 DOI: 10.1007/s11030-020-10065-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2020] [Accepted: 02/26/2020] [Indexed: 10/24/2022]

Gupta MK, Vadde R. A computational structural biology study to understand the impact of mutation on structure-function relationship of inward-rectifier potassium ion channel Kir6.2 in human. J Biomol Struct Dyn 2020;39:1447-1460. [PMID: 32089084 DOI: 10.1080/07391102.2020.1733666] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

A drug-likeness toolbox facilitates ADMET study in drug discovery. Drug Discov Today 2019;25:248-258. [PMID: 31705979 DOI: 10.1016/j.drudis.2019.10.014] [Citation(s) in RCA: 167] [Impact Index Per Article: 33.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2019] [Revised: 10/18/2019] [Accepted: 10/30/2019] [Indexed: 01/12/2023]

Gupta MK, Vadde R. Insights into the structure–function relationship of both wild and mutant zinc transporter ZnT8 in human: a computational structural biology approach. J Biomol Struct Dyn 2019;38:137-151. [DOI: 10.1080/07391102.2019.1567391] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Gupta MK, Vadde R. In silico identification of natural product inhibitors for γ‐secretase activating protein, a therapeutic target for Alzheimer's disease. J Cell Biochem 2018;120:10323-10336. [PMID: 30565717 DOI: 10.1002/jcb.28316] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2018] [Accepted: 11/28/2018] [Indexed: 12/19/2022]

Yosipof A, Guedes RC, García-Sosa AT. Data Mining and Machine Learning Models for Predicting Drug Likeness and Their Disease or Organ Category. Front Chem 2018;6:162. [PMID: 29868564 PMCID: PMC5954128 DOI: 10.3389/fchem.2018.00162] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2018] [Accepted: 04/20/2018] [Indexed: 12/11/2022] Open

Abstract

Data mining approaches can uncover underlying patterns in chemical and pharmacological property space decisive for drug discovery and development. Two of the most common approaches are visualization and machine learning methods. Visualization methods use dimensionality reduction techniques in order to reduce multi-dimension data into 2D or 3D representations with a minimal loss of information. Machine learning attempts to find correlations between specific activities or classifications for a set of compounds and their features by means of recurring mathematical models. Both models take advantage of the different and deep relationships that can exist between features of compounds, and helpfully provide classification of compounds based on such features or in case of visualization methods uncover underlying patterns in the feature space. Drug-likeness has been studied from several viewpoints, but here we provide the first implementation in chemoinformatics of the t-Distributed Stochastic Neighbor Embedding (t-SNE) method for the visualization and the representation of chemical space, and the use of different machine learning methods separately and together to form a new ensemble learning method called AL Boost. The models obtained from AL Boost synergistically combine decision tree, random forests (RF), support vector machine (SVM), artificial neural network (ANN), k nearest neighbors (kNN), and logistic regression models. In this work, we show that together they form a predictive model that not only improves the predictive force but also decreases bias. This resulted in a corrected classification rate of over 0.81, as well as higher sensitivity and specificity rates for the models. In addition, separation and good models were also achieved for disease categories such as antineoplastic compounds and nervous system diseases, among others. Such models can be used to guide decision on the feature landscape of compounds and their likeness to either drugs or other characteristics, such as specific or multiple disease-category(ies) or organ(s) of action of a molecule.

Collapse

Singh H, Kumar R, Singh S, Chaudhary K, Gautam A, Raghava GPS. Prediction of anticancer molecules using hybrid model developed on molecules screened against NCI-60 cancer cell lines. BMC Cancer 2016;16:77. [PMID: 26860193 PMCID: PMC4748564 DOI: 10.1186/s12885-016-2082-y] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2015] [Accepted: 01/21/2016] [Indexed: 11/16/2022] Open

Abstract

Background

In past, numerous quantitative structure-activity relationship (QSAR) based models have been developed for predicting anticancer activity for a specific class of molecules against different cancer drug targets. In contrast, limited attempt have been made to predict the anticancer activity of a diverse class of chemicals against a wide variety of cancer cell lines. In this study, we described a hybrid method developed on thousands of anticancer and non-anticancer molecules tested against National Cancer Institute (NCI) 60 cancer cell lines.

Results

Our analysis of anticancer molecules revealed that majority of anticancer molecules contains 18–24 carbon atoms and are dominated by functional groups like R₂NH, R₃N, ROH, RCOR, and ROR. It was also observed that certain substructures (e.g., 1-methoxy-4-methylbenzene, 1-methoxy benzene, Nitrobenzene, Indole, Propenyl benzene) are more abundant in anticancer molecules. Next, we developed anticancer molecule prediction models using various machine-learning techniques and achieved maximum matthews correlation coefficient (MCC) of 0.81 with 90.40 % accuracy using support vector machine (SVM) based models. In another approach, a novel similarity or potency score based method has been developed using selected fragments/fingerprints and achieved maximum MCC of 0.82 with 90.65 % accuracy. Finally, we combined the strength of above methods and developed a hybrid method with maximum MCC of 0.85 with 92.47 % accuracy.

Conclusions

We developed a hybrid method utilizing the best of machine learning and potency score based method. The highly accurate hybrid method can be used for classification of anticancer and non-anticancer molecules. In order to facilitate scientific community working in the field of anticancer drug discovery, we integrate hybrid and potency method in a web server CancerIN. This server provides various facilities that includes; virtual screening of anticancer molecules, analog based drug design, and similarity with known anticancer molecules (http://crdd.osdd.net/oscadd/cancerin).

Electronic supplementary material

The online version of this article (doi:10.1186/s12885-016-2082-y) contains supplementary material, which is available to authorized users.

Collapse

Drug-symptom networking: Linking drug-likeness screening to drug discovery. Pharmacol Res 2015;103:105-13. [PMID: 26615785 DOI: 10.1016/j.phrs.2015.11.015] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/24/2015] [Revised: 10/26/2015] [Accepted: 11/11/2015] [Indexed: 01/19/2023]

Mead B, Morgan H, Mann-Knowlton A, Tedeschi L, Sloan C, Lang S, Hines C, Gragg M, Stofer J, Riemann K, Derr T, Heller E, Collins D, Landis P, Linna N, Jones D. Reveromycin A-Induced Apoptosis in Osteoclasts Is Not Accompanied by Necrosis. J Cell Biochem 2015;116:1646-57. [DOI: 10.1002/jcb.25125] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2015] [Accepted: 02/02/2015] [Indexed: 12/23/2022]

Affiliation(s)

Brittany Mead Division of Natural Sciences; Department of Biology; Indiana Wesleyan University; South Washington Street Marion Indiana
Heather Morgan Division of Natural Sciences; Department of Biology; Indiana Wesleyan University; South Washington Street Marion Indiana
Alyssa Mann-Knowlton Division of Natural Sciences; Department of Biology; Indiana Wesleyan University; South Washington Street Marion Indiana
Laura Tedeschi Division of Natural Sciences; Department of Biology; Indiana Wesleyan University; South Washington Street Marion Indiana
Chris Sloan Division of Natural Sciences; Department of Biology; Indiana Wesleyan University; South Washington Street Marion Indiana
Spenser Lang Division of Natural Sciences; Department of Biology; Indiana Wesleyan University; South Washington Street Marion Indiana
Cory Hines Division of Natural Sciences; Department of Biology; Indiana Wesleyan University; South Washington Street Marion Indiana
Megan Gragg Division of Natural Sciences; Department of Biology; Indiana Wesleyan University; South Washington Street Marion Indiana
Jonathan Stofer Division of Natural Sciences; Department of Biology; Indiana Wesleyan University; South Washington Street Marion Indiana
Kaitlin Riemann Division of Natural Sciences; Department of Biology; Indiana Wesleyan University; South Washington Street Marion Indiana
Tyler Derr Division of Natural Sciences; Department of Biology; Indiana Wesleyan University; South Washington Street Marion Indiana
Emily Heller Division of Natural Sciences; Department of Biology; Indiana Wesleyan University; South Washington Street Marion Indiana
David Collins Division of Natural Sciences; Department of Biology; Indiana Wesleyan University; South Washington Street Marion Indiana
Paul Landis Division of Natural Sciences; Department of Biology; Indiana Wesleyan University; South Washington Street Marion Indiana
Nathan Linna Division of Natural Sciences; Department of Biology; Indiana Wesleyan University; South Washington Street Marion Indiana
Daniel Jones Division of Natural Sciences; Department of Biology; Indiana Wesleyan University; South Washington Street Marion Indiana

Collapse

Hernández Alvarez L, Naranjo Feliciano D, Hernández González JE, de Oliveira Soares R, Barreto Gomes DE, Pascutti PG. Insights into the Interactions of Fasciola hepatica Cathepsin L3 with a Substrate and Potential Novel Inhibitors through In Silico Approaches. PLoS Negl Trop Dis 2015;9:e0003759. [PMID: 25978322 PMCID: PMC4433193 DOI: 10.1371/journal.pntd.0003759] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2014] [Accepted: 04/14/2015] [Indexed: 11/19/2022] Open

Abstract

Background

Fasciola hepatica is the causative agent of fascioliasis, a disease affecting grazing animals, causing economic losses in global agriculture and currently being an important human zoonosis. Overuse of chemotherapeutics against fascioliasis has increased the populations of drug resistant parasites. F. hepatica cathepsin L3 is a protease that plays important roles during the life cycle of fluke. Due to its particular collagenolytic activity it is considered an attractive target against the infective phase of F. hepatica.

Methodology/Principal Findings

Starting with a three dimensional model of FhCL3 we performed a structure-based design of novel inhibitors through a computational study that combined virtual screening, molecular dynamics simulations, and binding free energy (ΔG_bind) calculations. Virtual screening was carried out by docking inhibitors obtained from the MYBRIDGE-HitFinder database inside FhCL3 and human cathepsin L substrate-binding sites. On the basis of dock-scores, five compounds were predicted as selective inhibitors of FhCL3. Molecular dynamic simulations were performed and, subsequently, an end-point method was employed to predict ΔG_bind values. Two compounds with the best ΔG_bind values (-10.68 kcal/mol and -7.16 kcal/mol), comparable to that of the positive control (-10.55 kcal/mol), were identified. A similar approach was followed to structurally and energetically characterize the interface of FhCL3 in complex with a peptidic substrate. Finally, through pair-wise and per-residue free energy decomposition we identified residues that are critical for the substrate/ligand binding and for the enzyme specificity.

Conclusions/Significance

The present study is the first computer-aided drug design approach against F. hepatica cathepsins. Here we predict the principal determinants of binding of FhCL3 in complex with a natural substrate by detailed energetic characterization of protease interaction surface. We also propose novel compounds as FhCL3 inhibitors. Overall, these results will foster the future rational design of new inhibitors against FhCL3, as well as other F. hepatica cathepsins.

Fascioliosis is considered an emerging disease in humans, causing important losses in global agriculture through the infection of livestock animals. The outcome of resistant parasites has increased the search for new drugs which may contribute to disease control. In recent decades, Fasciola cathepsins (FhCs) have been defined as the principal virulence factors of this parasite. Despite being in the same protein family, they have different specificities and, thus, distinct roles throughout the fluke life cycle. Differences in specificity have been attributed to a few variations in the sequence of key FhCs subsites. Currently, the structure-based drug design of inhibitors against Fasciola cathepsin Ls (FhCLs) with unknown structures is possible due to the availability of the three-dimensional structure of FhCL1. Our detailed structural analysis of the major infective juvenile enzyme (FhCL3) identifies the molecular determinants for protein binding. Also, novel potential inhibitors against FhCL3 are proposed, which might reduce host invasion and penetration processes. These compounds are predicted to interact with the binding site of the enzyme, therefore they could prevent substrate processing by competitive inhibition. The structure-based drug design strategy described here will be useful for the development of new potent and selective inhibitors against other FhCs.

Collapse

Singh H, Singh S, Singla D, Agarwal SM, Raghava GPS. QSAR based model for discriminating EGFR inhibitors and non-inhibitors using Random forest. Biol Direct 2015;10:10. [PMID: 25880749 PMCID: PMC4372225 DOI: 10.1186/s13062-015-0046-9] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2014] [Accepted: 03/06/2015] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Epidermal Growth Factor Receptor (EGFR) is a well-characterized cancer drug target. In the past, several QSAR models have been developed for predicting inhibition activity of molecules against EGFR. These models are useful to a limited set of molecules for a particular class like quinazoline-derivatives. In this study, an attempt has been made to develop prediction models on a large set of molecules (~3500 molecules) that include diverse scaffolds like quinazoline, pyrimidine, quinoline and indole.

RESULTS

We train, test and validate our classification models on a dataset called EGFR10 that contains 508 inhibitors (having inhibition activity IC50 less than 10 nM) and 2997 non-inhibitors. Our Random forest based model achieved maximum MCC 0.49 with accuracy 83.7% on a validation set using 881 PubChem fingerprints. In this study, frequency-based feature selection technique has been used to identify best fingerprints. It was observed that PubChem fingerprints FP380 (C(~O) (~O)), FP579 (O = C-C-C-C), FP388 (C(:C) (:N) (:N)) and FP 816 (ClC1CC(Br)CCC1) are more frequent in the inhibitors in comparison to non-inhibitors. In addition, we created different datasets namely EGFR100 containing inhibitors having IC50 < 100 nM and EGFR1000 containing inhibitors having IC50 < 1000 nM. We trained, test and validate our models on datasets EGFR100 and EGFR1000 datasets and achieved and maximum MCC 0.58 and 0.71 respectively. In addition, models were developed for predicting quinazoline and pyrimidine based EGFR inhibitors.

CONCLUSIONS

In summary, models have been developed on a large set of molecules of various classes for discriminating EGFR inhibitors and non-inhibitors. These highly accurate prediction models can be used to design and discover novel EGFR inhibitors. In order to provide service to the scientific community, a web server/standalone EGFRpred also has been developed ( http://crdd.osdd.net/oscadd/egfrpred/ ).

Collapse

Prediction of cancer proteins by integrating protein interaction, domain frequency, and domain interaction data using machine learning algorithms. BIOMED RESEARCH INTERNATIONAL 2015;2015:312047. [PMID: 25866773 PMCID: PMC4381656 DOI: 10.1155/2015/312047] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/02/2014] [Revised: 02/25/2015] [Accepted: 03/03/2015] [Indexed: 12/23/2022]

Xiong M, Guo Z, Han B, Chen M. Combating multidrug resistance in bacterial infection by targeting functional proteome with natural products. Nat Prod Res 2014;29:1624-9. [DOI: 10.1080/14786419.2014.991926] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Ahmad S, Seebacher W, Wolkinger V, Presser A, Faist J, Kaiser M, Brun R, Saf R, Weis R. Synthesis and antiprotozoal activities of new 3-azabicyclo[3.2.2]nonanes. Arch Pharm Res 2014;38:1455-67. [PMID: 25433423 DOI: 10.1007/s12272-014-0523-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2014] [Accepted: 11/24/2014] [Indexed: 11/30/2022]

O′Hagan S, Swainston N, Handl J, Kell DB. A 'rule of 0.5' for the metabolite-likeness of approved pharmaceutical drugs. Metabolomics 2014;11:323-339. [PMID: 25750602 PMCID: PMC4342520 DOI: 10.1007/s11306-014-0733-z] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/04/2014] [Accepted: 09/08/2014] [Indexed: 12/20/2022]

Gupta S, Chavan S, Deobagkar DN, Deobagkar DD. Bio/chemoinformatics in India: an outlook. Brief Bioinform 2014;16:710-31. [PMID: 25159593 DOI: 10.1093/bib/bbu028] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2014] [Accepted: 07/28/2014] [Indexed: 12/25/2022] Open