Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Fernández-de Gortari E, García-Jacas CR, Martinez-Mayorga K, Medina-Franco JL. Database fingerprint (DFP): an approach to represent molecular databases. J Cheminform 2017;9:9. [PMID: 28224019 PMCID: PMC5293704 DOI: 10.1186/s13321-017-0195-1] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Accepted: 01/23/2017] [Indexed: 01/19/2023] Open

For:	Fernández-de Gortari E, García-Jacas CR, Martinez-Mayorga K, Medina-Franco JL. Database fingerprint (DFP): an approach to represent molecular databases. J Cheminform 2017;9:9. [PMID: 28224019 PMCID: PMC5293704 DOI: 10.1186/s13321-017-0195-1] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Accepted: 01/23/2017] [Indexed: 01/19/2023] Open

Number

Cited by Other Article(s)

Lin M, Cai J, Wei Y, Peng X, Luo Q, Li B, Chen Y, Wang L. MalariaFlow: A comprehensive deep learning platform for multistage phenotypic antimalarial drug discovery. Eur J Med Chem 2024;277:116776. [PMID: 39173285 DOI: 10.1016/j.ejmech.2024.116776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2024] [Revised: 07/31/2024] [Accepted: 08/01/2024] [Indexed: 08/24/2024]

Abstract

Malaria remains a significant global health challenge due to the growing drug resistance of Plasmodium parasites and the failure to block transmission within human host. While machine learning (ML) and deep learning (DL) methods have shown promise in accelerating antimalarial drug discovery, the performance of deep learning models based on molecular graph and other co-representation approaches warrants further exploration. Current research has overlooked mutant strains of the malaria parasite with varying degrees of sensitivity or resistance, and has not covered the prediction of inhibitory activities across the three major life cycle stages (liver, asexual blood, and gametocyte) within the human host, which is crucial for both treatment and transmission blocking. In this study, we manually curated a benchmark antimalarial activity dataset comprising 407,404 unique compounds and 410,654 bioactivity data points across ten Plasmodium phenotypes and three stages. The performance was systematically compared among two fingerprint-based ML models (RF::Morgan and XGBoost:Morgan), four graph-based DL models (GCN, GAT, MPNN, and Attentive FP), and three co-representations DL models (FP-GNN, HiGNN, and FG-BERT), which reveal that: 1) The FP-GNN model achieved the best predictive performance, outperforming the other methods in distinguishing active and inactive compounds across balanced, more positive, and more negative datasets, with an overall AUROC of 0.900; 2) Fingerprint-based ML models outperformed graph-based DL models on large datasets (>1000 compounds), but the three co-representations DL models were able to incorporate domain-specific chemical knowledge to bridge this gap, achieving better predictive performance. These findings provide valuable guidance for selecting appropriate ML and DL methods for antimalarial activity prediction tasks. The interpretability analysis of the FP-GNN model revealed its ability to accurately capture the key structural features responsible for the liver- and blood-stage activities of the known antimalarial drug atovaquone. Finally, we developed a web server, MalariaFlow, incorporating these high-quality models for antimalarial activity prediction, virtual screening, and similarity search, successfully predicting novel triple-stage antimalarial hits validated through experimental testing, demonstrating its effectiveness and value in discovering potential multistage antimalarial drug candidates.

Collapse

Madushanka A, Laird E, Clark C, Kraka E. SmartCADD: AI-QM Empowered Drug Discovery Platform with Explainability. J Chem Inf Model 2024;64:6799-6813. [PMID: 39177478 DOI: 10.1021/acs.jcim.4c00720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/24/2024]

Srinivasan K, Puliyanda A, Prasad V. Identification of Reaction Network Hypotheses for Complex Feedstocks from Spectroscopic Measurements with Minimal Human Intervention. J Phys Chem A 2024;128:4714-4729. [PMID: 38836378 DOI: 10.1021/acs.jpca.4c01592] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/06/2024]

López-Pérez K, Kim TD, Miranda-Quintana RA. iSIM: instant similarity. DIGITAL DISCOVERY 2024;3:1160-1171. [PMID: 38873032 PMCID: PMC11167700 DOI: 10.1039/d4dd00041b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Accepted: 05/06/2024] [Indexed: 06/15/2024]

Liu H, Chen P, Hu B, Wang S, Wang H, Luan J, Wang J, Lin B, Cheng M. FaissMolLib: An efficient and easy deployable tool for ligand-based virtual screening. Comput Biol Chem 2024;110:108057. [PMID: 38581840 DOI: 10.1016/j.compbiolchem.2024.108057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Revised: 03/06/2024] [Accepted: 03/20/2024] [Indexed: 04/08/2024]

Affiliation(s)

Haihan Liu Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China
Peiying Chen Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China
Baichun Hu Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China
Shizun Wang Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China
Hanxun Wang Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China
Jiasi Luan Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Medical Devices, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China
Jian Wang Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China.
Bin Lin Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China.
Maosheng Cheng Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China.

Collapse

Sardar S, Bhattacharya A, Amin SA, Jha T, Gayen S. Exploring molecular fingerprints of different drugs having bile interaction: a stepping stone towards better drug delivery. Mol Divers 2024;28:1471-1483. [PMID: 37369957 DOI: 10.1007/s11030-023-10670-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Accepted: 06/10/2023] [Indexed: 06/29/2023]

Vogt M. Chemoinformatic approaches for navigating large chemical spaces. Expert Opin Drug Discov 2024;19:403-414. [PMID: 38300511 DOI: 10.1080/17460441.2024.2313475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Accepted: 01/30/2024] [Indexed: 02/02/2024]

Avellaneda-Tamayo JF, Chávez-Hernández AL, Prado-Romero DL, Medina-Franco JL. Chemical Multiverse and Diversity of Food Chemicals. J Chem Inf Model 2024;64:1229-1244. [PMID: 38356237 PMCID: PMC10900296 DOI: 10.1021/acs.jcim.3c01617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 02/03/2024] [Accepted: 02/06/2024] [Indexed: 02/16/2024]

Siddharth T, Lewis NE. Predicting pathways for old and new metabolites through clustering. J Theor Biol 2024;578:111684. [PMID: 38048983 PMCID: PMC11139542 DOI: 10.1016/j.jtbi.2023.111684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 11/17/2023] [Accepted: 11/29/2023] [Indexed: 12/06/2023]

Barrera-Vázquez OS, Escobar-Ramírez JL, Santiago-Mejía J, Carrasco-Ortega OF, Magos-Guerrero GA. Discovering Potential Compounds for Venous Disease Treatment through Virtual Screening and Network Pharmacology Approach. Molecules 2023;28:7937. [PMID: 38138427 PMCID: PMC10745828 DOI: 10.3390/molecules28247937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 11/28/2023] [Accepted: 11/30/2023] [Indexed: 12/24/2023] Open

Li X, Yuan H, Wu X, Wang C, Wu M, Shi H, Lv Y. MultiDS-MDA: Integrating multiple data sources into heterogeneous network for predicting novel metabolite-drug associations. Comput Biol Med 2023;162:107067. [PMID: 37276756 DOI: 10.1016/j.compbiomed.2023.107067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Revised: 05/15/2023] [Accepted: 05/27/2023] [Indexed: 06/07/2023]

Pikalyova R, Zabolotna Y, Horvath D, Marcou G, Varnek A. Chemical Library Space: Definition and DNA-Encoded Library Comparison Study Case. J Chem Inf Model 2023. [PMID: 37368824 DOI: 10.1021/acs.jcim.3c00520] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2023]

Minh Quang N, Tran Thai H, Le Thi H, Duc Cuong N, Hien NQ, Hoang D, Ngoc VTB, Ky Minh V, Van Tat P. Novel Thiosemicarbazone Quantum Dots in the Treatment of Alzheimer's Disease Combining In Silico Models Using Fingerprints and Physicochemical Descriptors. ACS OMEGA 2023;8:11076-11099. [PMID: 37008140 PMCID: PMC10061515 DOI: 10.1021/acsomega.2c07934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 03/07/2023] [Indexed: 06/19/2023]

Pope JD, Drummer OH, Schneider HG. False-Positive Amphetamines in Urine Drug Screens: A 6-Year Review. J Anal Toxicol 2023;47:263-270. [PMID: 36367744 DOI: 10.1093/jat/bkac089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Revised: 09/14/2022] [Accepted: 11/10/2022] [Indexed: 11/13/2022] Open

Caballero Alfonso AY, Chayawan C, Gadaleta D, Roncaglioni A, Benfenati E. A KNIME Workflow to Assist the Analogue Identification for Read-Across, Applied to Aromatase Activity. Molecules 2023;28:molecules28041832. [PMID: 36838826 PMCID: PMC9961311 DOI: 10.3390/molecules28041832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 02/07/2023] [Accepted: 02/10/2023] [Indexed: 02/18/2023] Open

Erlina L, Paramita RI, Kusuma WA, Fadilah F, Tedjo A, Pratomo IP, Ramadhanti NS, Nasution AK, Surado FK, Fitriawan A, Istiadi KA, Yanuar A. Virtual screening of Indonesian herbal compounds as COVID-19 supportive therapy: machine learning and pharmacophore modeling approaches. BMC Complement Med Ther 2022;22:207. [PMID: 35922786 PMCID: PMC9347098 DOI: 10.1186/s12906-022-03686-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 07/21/2022] [Indexed: 11/10/2022] Open

Abstract

Background

The number of COVID-19 cases continues to grow in Indonesia. This phenomenon motivates researchers to find alternative drugs that function for prevention or treatment. Due to the rich biodiversity of Indonesian medicinal plants, one alternative is to examine the potential of herbal medicines to support COVID therapy. This study aims to identify potential compound candidates in Indonesian herbal using a machine learning and pharmacophore modeling approaches.

Methods

We used three classification methods that had different decision-making processes: support vector machine (SVM), multilayer perceptron (MLP), and random forest (RF). For the pharmacophore modeling approach, we performed a structure-based analysis on the 3D structure of the main protease SARS-CoV-2 (3CLPro) and repurposed SARS, MERS, and SARS-CoV-2 drugs identified from the literature as datasets in the ligand-based method. Lastly, we used molecular docking to analyze the interactions between the 3CLpro and 14 hit compounds from the Indonesian Herbal Database (HerbalDB), with lopinavir as a positive control.

Results

From the molecular docking analysis, we found six potential compounds that may act as the main proteases of the SARS-CoV-2 inhibitor: hesperidin, kaempferol-3,4'-di-O-methyl ether (Ermanin); myricetin-3-glucoside, peonidin 3-(4’-arabinosylglucoside); quercetin 3-(2G-rhamnosylrutinoside); and rhamnetin 3-mannosyl-(1-2)-alloside.

Conclusions

Our layered virtual screening with machine learning and pharmacophore modeling approaches provided a more objective and optimal virtual screening and avoided subjective decision making of the results. Herbal compounds from the screening, i.e. hesperidin, kaempferol-3,4'-di-O-methyl ether (Ermanin); myricetin-3-glucoside, peonidin 3-(4’-arabinosylglucoside); quercetin 3-(2G-rhamnosylrutinoside); and rhamnetin 3-mannosyl-(1-2)-alloside are potential antiviral candidates for SARS-CoV-2. Moringa oleifera and Psidium guajava that consist of those compounds, could be an alternative option as COVID-19 herbal preventions.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12906-022-03686-y.

Collapse

Yang R, Zha X, Gao X, Wang K, Cheng B, Yan B. Multi-stage virtual screening of natural products against p38α mitogen-activated protein kinase: predictive modeling by machine learning, docking study and molecular dynamics simulation. Heliyon 2022;8:e10495. [PMID: 36105464 PMCID: PMC9465123 DOI: 10.1016/j.heliyon.2022.e10495] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Revised: 03/20/2022] [Accepted: 08/25/2022] [Indexed: 11/20/2022] Open

Panda G, Mishra N, Sharma D, Kutum R, Bhoyar RC, Jain A, Imran M, Senthilvel V, Divakar MK, Mishra A, Garg P, Banerjee P, Sivasubbu S, Scaria V, Ray A. Comprehensive Assessment of Indian Variations in the Druggable Kinome Landscape Highlights Distinct Insights at the Sequence, Structure and Pharmacogenomic Stratum. Front Pharmacol 2022;13:858345. [PMID: 35865963 PMCID: PMC9294532 DOI: 10.3389/fphar.2022.858345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Accepted: 06/07/2022] [Indexed: 11/13/2022] Open

Abstract India confines more than 17% of the world’s population and has a diverse genetic makeup with several clinically relevant rare mutations belonging to many sub-group which are undervalued in global sequencing datasets like the 1000 Genome data (1KG) containing limited samples for Indian ethnicity. Such databases are critical for the pharmaceutical and drug development industry where diversity plays a crucial role in identifying genetic disposition towards adverse drug reactions. A qualitative and comparative sequence and structural study utilizing variant information present in the recently published, largest curated Indian genome database (IndiGen) and the 1000 Genome data was performed for variants belonging to the kinase coding genes, the second most targeted group of drug targets. The sequence-level analysis identified similarities and differences among different populations based on the nsSNVs and amino acid exchange frequencies whereas a comparative structural analysis of IndiGen variants was performed with pathogenic variants reported in UniProtKB Humsavar data. The influence of these variations on structural features of the protein, such as structural stability, solvent accessibility, hydrophobicity, and the hydrogen-bond network was investigated. In-silico screening of the known drugs to these Indian variation-containing proteins reveals critical differences imparted in the strength of binding due to the variations present in the Indian population. In conclusion, this study constitutes a comprehensive investigation into the understanding of common variations present in the second largest population in the world and investigating its implications in the sequence, structural and pharmacogenomic landscape. The preliminary investigation reported in this paper, supporting the screening and detection of ADRs specific to the Indian population could aid in the development of techniques for pre-clinical and post-market screening of drug-related adverse events in the Indian population. Collapse

Affiliation(s)

Gayatri Panda Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla, India
Neha Mishra Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla, India
Disha Sharma Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India CSIR-Institute of Genomics and Integrative Biology, Delhi, India
Rintu Kutum Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India CSIR-Institute of Genomics and Integrative Biology, Delhi, India Ashoka University, Sonipat, India
Rahul C. Bhoyar CSIR-Institute of Genomics and Integrative Biology, Delhi, India
Abhinav Jain Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India CSIR-Institute of Genomics and Integrative Biology, Delhi, India
Mohamed Imran Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India CSIR-Institute of Genomics and Integrative Biology, Delhi, India
Vigneshwar Senthilvel Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India CSIR-Institute of Genomics and Integrative Biology, Delhi, India
Mohit Kumar Divakar Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India CSIR-Institute of Genomics and Integrative Biology, Delhi, India
Anushree Mishra CSIR-Institute of Genomics and Integrative Biology, Delhi, India
Parth Garg Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla, India
Priyanka Banerjee Institute for Physiology, Charité-University Medicine Berlin, Berlin, Germany
Sridhar Sivasubbu Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India CSIR-Institute of Genomics and Integrative Biology, Delhi, India
Vinod Scaria Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India CSIR-Institute of Genomics and Integrative Biology, Delhi, India
Arjun Ray Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla, India *Correspondence: Arjun Ray,

Collapse

Yang R, Zhao G, Cheng B, Yan B. Identification of potential matrix metalloproteinase-2 inhibitors from natural products through advanced machine learning-based cheminformatics approaches. Mol Divers 2022:10.1007/s11030-022-10467-9. [PMID: 35773549 DOI: 10.1007/s11030-022-10467-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2022] [Accepted: 05/20/2022] [Indexed: 11/29/2022]

Machine Learning for the Prediction of Antiviral Compounds Targeting Avian Influenza A/H9N2 Viral Proteins. Symmetry (Basel) 2022. [DOI: 10.3390/sym14061114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open

Zhu Y, Du C, Zheng H, Wang F, Tian F, Liu X, Li D. Molecular representation of coal-derived asphaltene based on high resolution mass spectrometry. ARAB J CHEM 2022. [DOI: 10.1016/j.arabjc.2021.103531] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open

Screening of Potential Indonesia Herbal Compounds Based on Multi-Label Classification for 2019 Coronavirus Disease. BIG DATA AND COGNITIVE COMPUTING 2021. [DOI: 10.3390/bdcc5040075] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Lee Y, Nam S. Performance Comparisons of AlexNet and GoogLeNet in Cell Growth Inhibition IC50 Prediction. Int J Mol Sci 2021;22:7721. [PMID: 34299341 PMCID: PMC8305019 DOI: 10.3390/ijms22147721] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Revised: 07/09/2021] [Accepted: 07/16/2021] [Indexed: 12/17/2022] Open

Miranda-Quintana RA, Rácz A, Bajusz D, Héberger K. Extended similarity indices: the benefits of comparing more than two objects simultaneously. Part 2: speed, consistency, diversity selection. J Cheminform 2021;13:33. [PMID: 33892799 PMCID: PMC8067665 DOI: 10.1186/s13321-021-00504-4] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Accepted: 03/12/2021] [Indexed: 11/10/2022] Open

Abstract

Despite being a central concept in cheminformatics, molecular similarity has so far been limited to the simultaneous comparison of only two molecules at a time and using one index, generally the Tanimoto coefficent. In a recent contribution we have not only introduced a complete mathematical framework for extended similarity calculations, (i.e. comparisons of more than two molecules at a time) but defined a series of novel idices. Part 1 is a detailed analysis of the effects of various parameters on the similarity values calculated by the extended formulas. Their features were revealed by sum of ranking differences and ANOVA. Here, in addition to characterizing several important aspects of the newly introduced similarity metrics, we will highlight their applicability and utility in real-life scenarios using datasets with popular molecular fingerprints. Remarkably, for large datasets, the use of extended similarity measures provides an unprecedented speed-up over “traditional” pairwise similarity matrix calculations. We also provide illustrative examples of a more direct algorithm based on the extended Tanimoto similarity to select diverse compound sets, resulting in much higher levels of diversity than traditional approaches. We discuss the inner and outer consistency of our indices, which are key in practical applications, showing whether the n-ary and binary indices rank the data in the same way. We demonstrate the use of the new n-ary similarity metrics on t-distributed stochastic neighbor embedding (t-SNE) plots of datasets of varying diversity, or corresponding to ligands of different pharmaceutical targets, which show that our indices provide a better measure of set compactness than standard binary measures. We also present a conceptual example of the applicability of our indices in agglomerative hierarchical algorithms. The Python code for calculating the extended similarity metrics is freely available at: https://github.com/ramirandaq/MultipleComparisons

Collapse

Shen WX, Zeng X, Zhu F, Wang YL, Qin C, Tan Y, Jiang YY, Chen YZ. Out-of-the-box deep learning prediction of pharmaceutical properties by broadly learned knowledge-based molecular representations. NAT MACH INTELL 2021. [DOI: 10.1038/s42256-021-00301-6] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]

Čmelo I, Voršilák M, Svozil D. Profiling and analysis of chemical compounds using pointwise mutual information. J Cheminform 2021;13:3. [PMID: 33423694 PMCID: PMC7798221 DOI: 10.1186/s13321-020-00483-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Accepted: 12/24/2020] [Indexed: 12/21/2022] Open

Abstract

Pointwise mutual information (PMI) is a measure of association used in information theory. In this paper, PMI is used to characterize several publicly available databases (DrugBank, ChEMBL, PubChem and ZINC) in terms of association strength between compound structural features resulting in database PMI interrelation profiles. As structural features, substructure fragments obtained by coding individual compounds as MACCS, PubChemKey and ECFP fingerprints are used. The analysis of publicly available databases reveals, in accord with other studies, unusual properties of DrugBank compounds which further confirms the validity of PMI profiling approach. Z-standardized relative feature tightness (ZRFT), a PMI-derived measure that quantifies how well the given compound's feature combinations fit these in a particular compound set, is applied for the analysis of compound synthetic accessibility (SA), as well as for the classification of compounds as easy (ES) and hard (HS) to synthesize. ZRFT value distributions are compared with these of SYBA and SAScore. The analysis of ZRFT values of structurally complex compounds in the SAVI database reveals oligopeptide structures that are mispredicted by SAScore as HS, while correctly predicted by ZRFT and SYBA as ES. Compared to SAScore, SYBA and random forest, ZRFT predictions are less accurate, though by a narrow margin (AccZRFT = 94.5%, AccSYBA = 98.8%, AccSAScore = 99.0%, AccRF = 97.3%). However, ZRFT ability to distinguish between ES and HS compounds is surprisingly high considering that while SYBA, SAScore and random forest are dedicated SA models, ZRFT is a generic measurement that merely quantifies the strength of interrelations between structural feature pairs. The results presented in the current work indicate that structural feature co-occurrence, quantified by PMI or ZRFT, contains a significant amount of information relevant to physico-chemical properties of organic compounds.

Collapse

Choi KE, Balupuri A, Kang NS. The Study on the hERG Blocker Prediction Using Chemical Fingerprint Analysis. Molecules 2020;25:E2615. [PMID: 32512802 PMCID: PMC7321128 DOI: 10.3390/molecules25112615] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Revised: 06/01/2020] [Accepted: 06/02/2020] [Indexed: 01/31/2023] Open

Kammeraad JA, Goetz J, Walker EA, Tewari A, Zimmerman PM. What Does the Machine Learn? Knowledge Representations of Chemical Reactivity. J Chem Inf Model 2020;60:1290-1301. [PMID: 32091880 DOI: 10.1021/acs.jcim.9b00721] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Yang Y, Zhang Y, Hua Y, Chen X, Fan Y, Wang Y, Liang L, Deng C, Lu T, Chen Y, Liu H. In Silico Design and Analysis of a Kinase-Focused Combinatorial Library Considering Diversity and Quality. J Chem Inf Model 2020;60:92-107. [PMID: 31886658 DOI: 10.1021/acs.jcim.9b00841] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Affiliation(s)

Yan Yang Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
Yanmin Zhang Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
Yi Hua Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
Xingye Chen Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
Yuanrong Fan Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
Yuchen Wang Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
Li Liang Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
Chenglong Deng Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
Tao Lu Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China.,State Key Laboratory of Natural Medicines , China Pharmaceutical University , 24 Tongjiaxiang , Nanjing 210009 , China
Yadong Chen Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
Haichun Liu Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China

Collapse

Vo AH, Van Vleet TR, Gupta RR, Liguori MJ, Rao MS. An Overview of Machine Learning and Big Data for Drug Toxicity Evaluation. Chem Res Toxicol 2019;33:20-37. [DOI: 10.1021/acs.chemrestox.9b00227] [Citation(s) in RCA: 55] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Walker E, Kammeraad J, Goetz J, Robo MT, Tewari A, Zimmerman PM. Learning To Predict Reaction Conditions: Relationships between Solvent, Molecular Structure, and Catalyst. J Chem Inf Model 2019;59:3645-3654. [PMID: 31381340 DOI: 10.1021/acs.jcim.9b00313] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

In Silico Drug-Target Profiling. Methods Mol Biol 2019;1953:89-103. [PMID: 30912017 DOI: 10.1007/978-1-4939-9145-7_6] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

A review of ligand-based virtual screening web tools and screening algorithms in large molecular databases in the age of big data. Future Med Chem 2018;10:2641-2658. [PMID: 30499744 DOI: 10.4155/fmc-2018-0076] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Sánchez-Cruz N, Medina-Franco JL. Statistical-based database fingerprint: chemical space dependent representation of compound databases. J Cheminform 2018;10:55. [PMID: 30467740 PMCID: PMC6755589 DOI: 10.1186/s13321-018-0311-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2018] [Accepted: 11/14/2018] [Indexed: 11/30/2022] Open

Abstract

Background

Simplified representation of compound databases has several applications in cheminformatics. Herein, we introduce an alternative and general method to build single fingerprint representations of compound databases. The approach is inspired on the previously published modal fingerprints that are aimed to capture the most significant bits of a fingerprint representation for a compound data set. The novelty of the herein proposed statistical-based database fingerprint (SB-DFP) is that it is generated based on binomial proportions comparisons taking as reference the distribution of “1” bits on a large representative set of the chemical space.

Results

To illustrate the Method, SB-DFPs were constructed for 28 epigenetic target data sets retrieved from a recently published epigenomics database of interest in probe and drug discovery. For each target data set, the SB-DFPs were built based on two representative fingerprints of different design using as reference a data set with more than 15 million compounds from ZINC. The application of SB-DFP was illustrated and compared to other methods through association relationships of the 28 epigenetic data sets and similarity searching. It was found that SB-DFPs captured overall, the common features between data sets and the distinct features of each set. In similarity searching SB-DFP equaled or outperformed other approaches for at least 20 out of the 28 sets.

Conclusions

SB-DFP is a general approach based on binomial proportion comparisons to represent a compound data set with a single fingerprint. SB-DFP can be developed, at least in principle, based on any fingerprint and reference data set. SB-DFP is a good alternative for exploration of relationships between targets through its associated compound data sets and performing similarity searching.

Electronic supplementary material

The online version of this article (10.1186/s13321-018-0311-x) contains supplementary material, which is available to authorized users.

Collapse

Saldívar-González FI, Gómez-García A, Chávez-Ponce de León DE, Sánchez-Cruz N, Ruiz-Rios J, Pilón-Jiménez BA, Medina-Franco JL. Inhibitors of DNA Methyltransferases From Natural Sources: A Computational Perspective. Front Pharmacol 2018;9:1144. [PMID: 30364171 PMCID: PMC6191485 DOI: 10.3389/fphar.2018.01144] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Accepted: 09/21/2018] [Indexed: 12/15/2022] Open

Capuzzi SJ, Sun W, Muratov EN, Martínez-Romero C, He S, Zhu W, Li H, Tawa G, Fisher EG, Xu M, Shinn P, Qiu X, García-Sastre A, Zheng W, Tropsha A. Computer-Aided Discovery and Characterization of Novel Ebola Virus Inhibitors. J Med Chem 2018;61:3582-3594. [PMID: 29624387 DOI: 10.1021/acs.jmedchem.8b00035] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]

Affiliation(s)

Stephen J Capuzzi Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry , UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill , Chapel Hill , North Carolina 27599 , United States
Wei Sun National Center for Advancing Translational Sciences , National Institutes of Health , Bethesda , Maryland 20892 , United States
Eugene N Muratov Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry , UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill , Chapel Hill , North Carolina 27599 , United States.,Department of Chemical Technology , Odessa National Polytechnic University , Odessa 65000 , Ukraine
Carles Martínez-Romero Department of Microbiology , Icahn School of Medicine at Mount Sinai , New York , New York 10029 , United States.,Global Health and Emerging Pathogens Institute , Icahn School of Medicine at Mount Sinai , New York , New York 10029 , United States
Shihua He Special Pathogens Program, National Microbiology Laboratory , Public Health Agency of Canada , 1015 Arlington Street , Winnipeg , Manitoba R3E 3R2 , Canada
Wenjun Zhu Special Pathogens Program, National Microbiology Laboratory , Public Health Agency of Canada , 1015 Arlington Street , Winnipeg , Manitoba R3E 3R2 , Canada.,Department of Medical Microbiology , University of Manitoba , 745 Bannatyne Avenue , Winnipeg , Manitoba R3E 0J9 , Canada
Hao Li National Center for Advancing Translational Sciences , National Institutes of Health , Bethesda , Maryland 20892 , United States
Gregory Tawa National Center for Advancing Translational Sciences , National Institutes of Health , Bethesda , Maryland 20892 , United States
Ethan G Fisher National Center for Advancing Translational Sciences , National Institutes of Health , Bethesda , Maryland 20892 , United States
Miao Xu National Center for Advancing Translational Sciences , National Institutes of Health , Bethesda , Maryland 20892 , United States
Paul Shinn National Center for Advancing Translational Sciences , National Institutes of Health , Bethesda , Maryland 20892 , United States
Xiangguo Qiu Special Pathogens Program, National Microbiology Laboratory , Public Health Agency of Canada , 1015 Arlington Street , Winnipeg , Manitoba R3E 3R2 , Canada.,Department of Medical Microbiology , University of Manitoba , 745 Bannatyne Avenue , Winnipeg , Manitoba R3E 0J9 , Canada
Adolfo García-Sastre Department of Microbiology , Icahn School of Medicine at Mount Sinai , New York , New York 10029 , United States.,Global Health and Emerging Pathogens Institute , Icahn School of Medicine at Mount Sinai , New York , New York 10029 , United States.,Department of Medicine, Division of Infectious Diseases , Icahn School of Medicine at Mount Sinai , New York , New York 10029 , United States
Wei Zheng National Center for Advancing Translational Sciences , National Institutes of Health , Bethesda , Maryland 20892 , United States
Alexander Tropsha Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry , UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill , Chapel Hill , North Carolina 27599 , United States

Collapse

Naveja JJ, Medina-Franco JL. Insights from pharmacological similarity of epigenetic targets in epipolypharmacology. Drug Discov Today 2018;23:141-150. [DOI: 10.1016/j.drudis.2017.10.006] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Revised: 09/05/2017] [Accepted: 10/05/2017] [Indexed: 01/10/2023]

Naveja JJ, Oviedo-Osornio CI, Trujillo-Minero NN, Medina-Franco JL. Chemoinformatics: a perspective from an academic setting in Latin America. Mol Divers 2017;22:247-258. [PMID: 29204824 DOI: 10.1007/s11030-017-9802-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2017] [Accepted: 11/26/2017] [Indexed: 12/13/2022]

The potential role of in silico approaches to identify novel bioactive molecules from natural resources. Future Med Chem 2017;9:1665-1686. [PMID: 28841048 DOI: 10.4155/fmc-2017-0124] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open