Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Chauhan JS, Dhanda SK, Singla D, Agarwal SM, Raghava GPS. QSAR-based models for designing quinazoline/imidazothiazoles/pyrazolopyrimidines based inhibitors against wild and mutant EGFR. PLoS One 2014;9:e101079. [PMID: 24992720 PMCID: PMC4081576 DOI: 10.1371/journal.pone.0101079] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2014] [Accepted: 06/02/2014] [Indexed: 01/19/2023] Open

For:	Chauhan JS, Dhanda SK, Singla D, Agarwal SM, Raghava GPS. QSAR-based models for designing quinazoline/imidazothiazoles/pyrazolopyrimidines based inhibitors against wild and mutant EGFR. PLoS One 2014;9:e101079. [PMID: 24992720 PMCID: PMC4081576 DOI: 10.1371/journal.pone.0101079] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2014] [Accepted: 06/02/2014] [Indexed: 01/19/2023] Open

Number

Cited by Other Article(s)

Boonyarit B, Yamprasert N, Kaewnuratchadasorn P, Kinchagawat J, Prommin C, Rungrotmongkol T, Nutanong S. GraphEGFR: Multi-task and transfer learning based on molecular graph attention mechanism and fingerprints improving inhibitor bioactivity prediction for EGFR family proteins on data scarcity. J Comput Chem 2024;45:2001-2023. [PMID: 38713612 DOI: 10.1002/jcc.27388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 04/16/2024] [Accepted: 04/19/2024] [Indexed: 05/09/2024]

Abstract

The proteins within the human epidermal growth factor receptor (EGFR) family, members of the tyrosine kinase receptor family, play a pivotal role in the molecular mechanisms driving the development of various tumors. Tyrosine kinase inhibitors, key compounds in targeted therapy, encounter challenges in cancer treatment due to emerging drug resistance mutations. Consequently, machine learning has undergone significant evolution to address the challenges of cancer drug discovery related to EGFR family proteins. However, the application of deep learning in this area is hindered by inherent difficulties associated with small-scale data, particularly the risk of overfitting. Moreover, the design of a model architecture that facilitates learning through multi-task and transfer learning, coupled with appropriate molecular representation, poses substantial challenges. In this study, we introduce GraphEGFR, a deep learning regression model designed to enhance molecular representation and model architecture for predicting the bioactivity of inhibitors against both wild-type and mutant EGFR family proteins. GraphEGFR integrates a graph attention mechanism for molecular graphs with deep and convolutional neural networks for molecular fingerprints. We observed that GraphEGFR models employing multi-task and transfer learning strategies generally achieve predictive performance comparable to existing competitive methods. The integration of molecular graphs and fingerprints adeptly captures relationships between atoms and enables both global and local pattern recognition. We further validated potential multi-targeted inhibitors for wild-type and mutant HER1 kinases, exploring key amino acid residues through molecular dynamics simulations to understand molecular interactions. This predictive model offers a robust strategy that could significantly contribute to overcoming the challenges of developing deep learning models for drug discovery with limited data and exploring new frontiers in multi-targeted kinase drug discovery for EGFR family proteins.

Collapse

Chang H, Zhang Z, Tian J, Bai T, Xiao Z, Wang D, Qiao R, Li C. Machine Learning-Based Virtual Screening and Identification of the Fourth-Generation EGFR Inhibitors. ACS OMEGA 2024;9:2314-2324. [PMID: 38250375 PMCID: PMC10795152 DOI: 10.1021/acsomega.3c06225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 11/06/2023] [Accepted: 11/15/2023] [Indexed: 01/23/2024]

Choudhary R, Walhekar V, Muthal A, Kumar D, Bagul C, Kulkarni R. Machine learning facilitated structural activity relationship approach for the discovery of novel inhibitors targeting EGFR. J Biomol Struct Dyn 2023;41:12445-12463. [PMID: 36762704 DOI: 10.1080/07391102.2023.2175263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Accepted: 01/03/2023] [Indexed: 02/11/2023]

Abstract

This research manuscript aims to find the most effective epidermal growth factor receptor (EGFR) inhibitors from millions of in house compounds through Machine Learning (ML) techniques. ML-based structure activity relationship (SAR) models were validated to predict biological activity of untested novel molecules. Six ML algorithms, including k nearest neighbour (KNN), decision tree (DT), Logistic Regression, support vector machine (SVM), multilinear regression (MLR), and random forest (RF), were used to build for activity prediction. Among these, RF classifier (accuracy for train and test set is 90% and 81%) and RF regressor (R2 and MSE for trainset is 0.83 and 0.29 and for test set, 0.69 and 0.46) showed good predictive performance. Also, the six most essential features that affect the biological activity parameter and highly contribute to model development were successfully selected by the variable importance technique. RF regression model was used to predict the biological activity expressed as pIC50 of nearly ten million molecules while RF classification model classifies those molecules into active, moderately active, and least active according to their predicted pIC50. Based on two models, thousand molecules from million molecules with higher predicted pIC50 values and classified as active were selected for molecular docking. Based on the docking scores, predicted pIC50, and binding interactions with MET769 residue, compounds, i.e., Zinc257233137, Zinc257232249, and Zinc101379788, were identified as potential EGFR inhibitors with predicted pIC50 7.72, 7.85, and 7.70. Dynamics studies were also performed on Zinc257233137 to illustrate that it has good binding free energy and stable hydrogen bonding interactions with EGFR. These molecules can be used for further research and proved to be the novel drugs for EGFR in cancer treatment.Communicated by Ramaswamy H. Sarma.

Collapse

Vetrivel A, Ramasamy J, Natchimuthu S, Senthil K, Ramasamy M, Murugesan R. Combined machine learning and pharmacophore based virtual screening approaches to screen for antibiofilm inhibitors targeting LasR of Pseudomonas aeruginosa. J Biomol Struct Dyn 2022;41:4124-4142. [PMID: 35451916 DOI: 10.1080/07391102.2022.2064331] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Abstract

Pseudomonas aeruginosa, a virulent pathogen affects patients with cystic fibrosis and nosocomial infections. Quorum sensing (QS) mechanism plays a crucial role in causing these ailments by mediating biofilm formation and expressing virulent genes. A novel approach to circumvent this bacterial infection is by hindering its QS network. Targeting LasR of las system serves beneficial as it holds the top position in QS system cascade. Here, we have integrated machine learning, pharmacophore based virtual screening, molecular docking and simulation studies to look for new leads as inhibitors for LasR. Support vector machine (SVM) learning algorithm was used to generate QSAR models from 66 antagonist dataset. The top three models resulted in correlation coefficient (R²) values of 0.67, 0.86, and 0.91, respectively. The correlation coefficient (R²_test) values on external test set were found to be 0.62, 0.57, and 0.55, respectively. A four-point pharmacophore model was developed. The pharmacophore hypothesis AAAD_1 was used to screen for potential leads against MolPort database in ZincPharmer. The leads which showed predicted pIC50 value of >8.00 by SVM models were subjected to docking analysis that reranked the compounds based on docking scores. Four top leads namely ZINC3851967 N-[3,5-bis(trifluoromethyl)phenyl]-5-tert-butyl-6-chloropyrazine-2-carboxamide, ZINC4024175 4-Amino-1-[(2R,3S,4S,5S)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-2-oxopyrimidine-5-carbonitrile, ZINC2125703 N-[(5-Methoxy-4,7-dimethyl-2-oxo-2H-chromen-3-yl)acetyl]-beta-alanine, and ZINC3851966 N-[3,5-Bis(trifluoromethyl)phenyl]5-tert-butylpyrazine-2-carboxamide were selected. These compounds were checked for its stability by performing a molecular dynamics simulation for a period of 100 ns. The ADME properties of the leads were also determined. Hence, the compounds identified in this study can be used as possible leads for developing a novel inhibitor for LasR.Communicated by Ramaswamy H. Sarma.

Collapse

Nguyen L, Nguyen Vo TH, Trinh QH, Nguyen BH, Nguyen-Hoang PU, Le L, Nguyen BP. iANP-EC: Identifying Anticancer Natural Products Using Ensemble Learning Incorporated with Evolutionary Computation. J Chem Inf Model 2022;62:5080-5089. [PMID: 35157472 DOI: 10.1021/acs.jcim.1c00920] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Quantitative structure activity relationship and artificial neural network as vital tools in predicting coordination capabilities of organic compounds with metal surface: A review. Coord Chem Rev 2021. [DOI: 10.1016/j.ccr.2021.214101] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Building 2D classification models and 3D CoMSIA models on small-molecule inhibitors of both wild-type and T790M/L858R double-mutant EGFR. Mol Divers 2021;26:1715-1730. [PMID: 34636023 DOI: 10.1007/s11030-021-10300-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2021] [Accepted: 08/17/2021] [Indexed: 10/20/2022]

Dhall A, Patiyal S, Sharma N, Devi NL, Raghava GPS. Computer-aided prediction of inhibitors against STAT3 for managing COVID-19 associated cytokine storm. Comput Biol Med 2021;137:104780. [PMID: 34450382 PMCID: PMC8378993 DOI: 10.1016/j.compbiomed.2021.104780] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2021] [Revised: 08/11/2021] [Accepted: 08/18/2021] [Indexed: 12/27/2022]

Saini R, Fatima S, Agarwal SM. TMLRpred: A machine learning classification model to distinguish reversible EGFR double mutant inhibitors. Chem Biol Drug Des 2021;96:921-930. [PMID: 33058464 DOI: 10.1111/cbdd.13697] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Revised: 03/26/2020] [Accepted: 04/03/2020] [Indexed: 12/26/2022]

Jiang X, Ren M, Shuang X, Yang H, Shi D, Lai Q, Dong Y. Multiparametric MRI-Based Radiomics Approaches for Preoperative Prediction of EGFR Mutation Status in Spinal Bone Metastases in Patients with Lung Adenocarcinoma. J Magn Reson Imaging 2021;54:497-507. [PMID: 33638577 DOI: 10.1002/jmri.27579] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 02/10/2021] [Accepted: 02/12/2021] [Indexed: 12/30/2022] Open

Abstract

BACKGROUND

Preoperative prediction of epidermal growth factor receptor (EGFR) mutation status in patients with spinal bone metastases (SBM) from primary lung adenocarcinoma is potentially important for treatment decisions.

PURPOSE

To develop and validate multiparametric magnetic resonance imaging (MRI)-based radiomics methods for preoperative prediction of EGFR mutation based on MRI of SBM.

STUDY TYPE

Retrospective.

POPULATION

A total of 97 preoperative patients with lumbar SBM from lung adenocarcinoma (77 in training set and 20 in validation set).

FIELD STRENGTH/SEQUENCE

T1-weighted, T2-weighted, and T2-weighted fat-suppressed fast spin echo sequences at 3.0 T.

ASSESSMENT

Radiomics handcrafted and deep learning-based features were extracted and selected from each MRI sequence. The abilities of the features to predict EGFR mutation status were analyzed and compared. A radiomics nomogram was constructed integrating the selected features.

STATISTICAL TESTS

The Mann-Whitney U test and χ² test were employed for evaluating associations between clinical characteristics and EGFR mutation status for continuous and discrete variables, respectively. Least absolute shrinkage and selection operator was used for selection of predictive features. Sensitivity (SEN), specificity (SPE), and area under the receiver operating characteristic curve (AUC) were used to evaluate the ability of radiomics models to predict the EGFR mutation. Calibration and decision curve analysis (DCA) were performed to assess and validate nomogram results.

RESULTS

The radiomics signature comprised five handcrafted and one deep learning-based features and achieved good performance for predicting EGFR mutation status, with AUCs of 0.891 (95% confidence interval [CI], 0.820-0.962, SEN = 0.913, SPE = 0.710) in the training group and 0.771 (95% CI, 0.551-0.991, SEN = 0.750, SPE = 0.875) in the validation group. DCA confirmed the potential clinical usefulness of the radiomics models.

DATA CONCLUSION

Multiparametric MRI-based radiomics is potentially clinical valuable for predicting EGFR mutation status in patients with SBM from lung adenocarcinoma.

LEVEL OF EVIDENCE

3 TECHNICAL EFFICACY: 2.

Collapse

Kumar M, Joshi G, Chatterjee J, Kumar R. Epidermal Growth Factor Receptor and its Trafficking Regulation by Acetylation: Implication in Resistance and Exploring the Newer Therapeutic Avenues in Cancer. Curr Top Med Chem 2021;20:1105-1123. [PMID: 32031073 DOI: 10.2174/1568026620666200207100227] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Revised: 01/17/2020] [Accepted: 01/24/2020] [Indexed: 12/13/2022]

QSAR study of human epidermal growth factor receptor (EGFR) inhibitors: conformation-independent models. Med Chem Res 2019. [DOI: 10.1007/s00044-019-02437-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]

Rajput A, Kumar A, Kumar M. Computational Identification of Inhibitors Using QSAR Approach Against Nipah Virus. Front Pharmacol 2019;10:71. [PMID: 30809147 PMCID: PMC6379726 DOI: 10.3389/fphar.2019.00071] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2018] [Accepted: 01/21/2019] [Indexed: 12/26/2022] Open

Sharma A, Sharma S, Gupta M, Fatima S, Saini R, Agarwal SM. Pharmacokinetic profiling of anticancer phytocompounds using computational approach. PHYTOCHEMICAL ANALYSIS : PCA 2018;29:559-568. [PMID: 29667756 DOI: 10.1002/pca.2767] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2017] [Revised: 02/15/2018] [Accepted: 02/17/2018] [Indexed: 06/08/2023]

Fatima S, Agarwal SM. Unraveling structural requirements of amino-pyrimidine T790M/L858R double mutant EGFR inhibitors: 2D and 3D QSAR study. J Recept Signal Transduct Res 2018;38:299-306. [DOI: 10.1080/10799893.2018.1494740] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Fatima S, Gupta P, Agarwal SM. Insight into structural requirements of antiamoebic flavonoids: 3D-QSAR and G-QSAR studies. Chem Biol Drug Des 2018;92:1743-1749. [DOI: 10.1111/cbdd.13343] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2018] [Revised: 05/03/2018] [Accepted: 05/12/2018] [Indexed: 01/05/2023]

Anoosha P, Sakthivel R, Gromiha MM. Investigating mutation-specific biological activities of small molecules using quantitative structure-activity relationship for epidermal growth factor receptor in cancer. Mutat Res 2017;806:19-26. [PMID: 28938109 DOI: 10.1016/j.mrfmmm.2017.08.003] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2017] [Revised: 08/21/2017] [Accepted: 08/22/2017] [Indexed: 06/07/2023]

Speck-Planche A, Dias Soeiro Cordeiro MN. Speeding up Early Drug Discovery in Antiviral Research: A Fragment-Based in Silico Approach for the Design of Virtual Anti-Hepatitis C Leads. ACS COMBINATORIAL SCIENCE 2017;19:501-512. [PMID: 28437091 DOI: 10.1021/acscombsci.7b00039] [Citation(s) in RCA: 49] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]

Discovery of Indeno[1,2-c]quinoline Derivatives as Potent Dual Antituberculosis and Anti-Inflammatory Agents. Molecules 2017. [PMID: 28621733 PMCID: PMC6152673 DOI: 10.3390/molecules22061001] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open

Abbasi M, Sadeghi-Aliabadi H, Amanlou M. 3D-QSAR, molecular docking, and molecular dynamic simulations for prediction of new Hsp90 inhibitors based on isoxazole scaffold. J Biomol Struct Dyn 2017;36:1463-1478. [PMID: 28482755 DOI: 10.1080/07391102.2017.1326319] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Qureshi A, Kaur G, Kumar M. AVCpred: an integrated web server for prediction and design of antiviral compounds. Chem Biol Drug Des 2017;89:74-83. [PMID: 27490990 PMCID: PMC7162012 DOI: 10.1111/cbdd.12834] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2016] [Revised: 07/21/2016] [Accepted: 07/25/2016] [Indexed: 12/11/2022]

Singh H, Kumar R, Singh S, Chaudhary K, Gautam A, Raghava GPS. Prediction of anticancer molecules using hybrid model developed on molecules screened against NCI-60 cancer cell lines. BMC Cancer 2016;16:77. [PMID: 26860193 PMCID: PMC4748564 DOI: 10.1186/s12885-016-2082-y] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2015] [Accepted: 01/21/2016] [Indexed: 11/16/2022] Open

Abstract

Background

In past, numerous quantitative structure-activity relationship (QSAR) based models have been developed for predicting anticancer activity for a specific class of molecules against different cancer drug targets. In contrast, limited attempt have been made to predict the anticancer activity of a diverse class of chemicals against a wide variety of cancer cell lines. In this study, we described a hybrid method developed on thousands of anticancer and non-anticancer molecules tested against National Cancer Institute (NCI) 60 cancer cell lines.

Results

Our analysis of anticancer molecules revealed that majority of anticancer molecules contains 18–24 carbon atoms and are dominated by functional groups like R₂NH, R₃N, ROH, RCOR, and ROR. It was also observed that certain substructures (e.g., 1-methoxy-4-methylbenzene, 1-methoxy benzene, Nitrobenzene, Indole, Propenyl benzene) are more abundant in anticancer molecules. Next, we developed anticancer molecule prediction models using various machine-learning techniques and achieved maximum matthews correlation coefficient (MCC) of 0.81 with 90.40 % accuracy using support vector machine (SVM) based models. In another approach, a novel similarity or potency score based method has been developed using selected fragments/fingerprints and achieved maximum MCC of 0.82 with 90.65 % accuracy. Finally, we combined the strength of above methods and developed a hybrid method with maximum MCC of 0.85 with 92.47 % accuracy.

Conclusions

We developed a hybrid method utilizing the best of machine learning and potency score based method. The highly accurate hybrid method can be used for classification of anticancer and non-anticancer molecules. In order to facilitate scientific community working in the field of anticancer drug discovery, we integrate hybrid and potency method in a web server CancerIN. This server provides various facilities that includes; virtual screening of anticancer molecules, analog based drug design, and similarity with known anticancer molecules (http://crdd.osdd.net/oscadd/cancerin).

Electronic supplementary material

The online version of this article (doi:10.1186/s12885-016-2082-y) contains supplementary material, which is available to authorized users.

Collapse

Bérubé G. An overview of molecular hybrids in drug discovery. Expert Opin Drug Discov 2016;11:281-305. [PMID: 26727036 DOI: 10.1517/17460441.2016.1135125] [Citation(s) in RCA: 188] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

Dhiman K, Agarwal SM. NPred: QSAR classification model for identifying plant based naturally occurring anti-cancerous inhibitors. RSC Adv 2016. [DOI: 10.1039/c6ra02772e] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open

Agarwal SM, Sharma M, Fatima S. VOCC: a database of volatile organic compounds in cancer. RSC Adv 2016. [DOI: 10.1039/c6ra24414a] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open

Sharma VK, Nandekar PP, Sangamwar A, Pérez-Sánchez H, Agarwal SM. Structure guided design and binding analysis of EGFR inhibiting analogues of erlotinib and AEE788 using ensemble docking, molecular dynamics and MM-GBSA. RSC Adv 2016. [DOI: 10.1039/c6ra08517b] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Dearden JC, Hewitt M, Roberts DW, Enoch SJ, Rowe PH, Przybylak KR, Vaughan-Williams GD, Smith ML, Pillai GG, Katritzky AR. Mechanism-Based QSAR Modeling of Skin Sensitization. Chem Res Toxicol 2015;28:1975-86. [PMID: 26382665 DOI: 10.1021/acs.chemrestox.5b00197] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Singh H, Singh S, Singla D, Agarwal SM, Raghava GPS. QSAR based model for discriminating EGFR inhibitors and non-inhibitors using Random forest. Biol Direct 2015;10:10. [PMID: 25880749 PMCID: PMC4372225 DOI: 10.1186/s13062-015-0046-9] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2014] [Accepted: 03/06/2015] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Epidermal Growth Factor Receptor (EGFR) is a well-characterized cancer drug target. In the past, several QSAR models have been developed for predicting inhibition activity of molecules against EGFR. These models are useful to a limited set of molecules for a particular class like quinazoline-derivatives. In this study, an attempt has been made to develop prediction models on a large set of molecules (~3500 molecules) that include diverse scaffolds like quinazoline, pyrimidine, quinoline and indole.

RESULTS

We train, test and validate our classification models on a dataset called EGFR10 that contains 508 inhibitors (having inhibition activity IC50 less than 10 nM) and 2997 non-inhibitors. Our Random forest based model achieved maximum MCC 0.49 with accuracy 83.7% on a validation set using 881 PubChem fingerprints. In this study, frequency-based feature selection technique has been used to identify best fingerprints. It was observed that PubChem fingerprints FP380 (C(~O) (~O)), FP579 (O = C-C-C-C), FP388 (C(:C) (:N) (:N)) and FP 816 (ClC1CC(Br)CCC1) are more frequent in the inhibitors in comparison to non-inhibitors. In addition, we created different datasets namely EGFR100 containing inhibitors having IC50 < 100 nM and EGFR1000 containing inhibitors having IC50 < 1000 nM. We trained, test and validate our models on datasets EGFR100 and EGFR1000 datasets and achieved and maximum MCC 0.58 and 0.71 respectively. In addition, models were developed for predicting quinazoline and pyrimidine based EGFR inhibitors.

CONCLUSIONS

In summary, models have been developed on a large set of molecules of various classes for discriminating EGFR inhibitors and non-inhibitors. These highly accurate prediction models can be used to design and discover novel EGFR inhibitors. In order to provide service to the scientific community, a web server/standalone EGFRpred also has been developed ( http://crdd.osdd.net/oscadd/egfrpred/ ).

Collapse