1
|
Karasev DA, Sobolev BN, Filimonov DA, Lagunin A. Prediction of viral protease inhibitors using proteochemometrics approach. Comput Biol Chem 2024; 110:108061. [PMID: 38574417 DOI: 10.1016/j.compbiolchem.2024.108061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Revised: 03/21/2024] [Accepted: 03/23/2024] [Indexed: 04/06/2024]
Abstract
Being widely accepted tools in computational drug search, the (Q)SAR methods have limitations related to data incompleteness. The proteochemometrics (PCM) approach expands the applicability area by using description for both protein and ligand structures. The PCM algorithms are urgently required for the development of new antiviral agents. We suggest the PCM method using the TLMNA descriptors, combining the MNA descriptors of ligands and protein sequence N-grams. Our method was validated on the viral chymotrypsin-like proteases and their ligands. We have developed an original protocol allowing us to collect a comprehensive set of 15 protein sequences and more than 9000 ligands from the ChEMBL database. The N-grams were derived from the 3D-based alignment, accurately superposing ligand-binding regions. In testing the ligand set in SAR mode with MNA descriptors, an accuracy above 0.95 was determined that shows the perspective of the antiviral drug search in virtual chemical libraries. The effective PCM models were built with the TLMNA descriptor. The strong validation procedure with pair exclusion simulated the prediction of interactions between the new ligands and new targets, resulting in accuracy estimation up to 0.89. The PCM approach shows slightly lower accuracy caused by more uncertainty compared with SAR, but it overcomes the problem of data incompleteness.
Collapse
Affiliation(s)
- Dmitry A Karasev
- Department of Bioinformatics, Institute of Biomedical Chemistry, Moscow 119121, Russia.
| | - Boris N Sobolev
- Department of Bioinformatics, Institute of Biomedical Chemistry, Moscow 119121, Russia
| | - Dmitry A Filimonov
- Department of Bioinformatics, Institute of Biomedical Chemistry, Moscow 119121, Russia
| | - Alexey Lagunin
- Department of Bioinformatics, Institute of Biomedical Chemistry, Moscow 119121, Russia; Department of Bioinformatics, Pirogov Russian National Research Medical University, Moscow 117997, Russia
| |
Collapse
|
2
|
Abdullah Z, Chee HY, Yusof R, Mohd Fauzi F. Finding Lead Compounds for Dengue Antivirals from a Collection of Old Drugs through In Silico Target Prediction and Subsequent In Vitro Validation. ACS OMEGA 2023; 8:32483-32497. [PMID: 37720780 PMCID: PMC10500654 DOI: 10.1021/acsomega.3c02607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Accepted: 07/14/2023] [Indexed: 09/19/2023]
Abstract
Dengue virus (DENV) infection is one of the most widely spread flavivirus infections. Despite the fatality it could cause, no antiviral treatment is currently available to treat the disease. Hence, this study aimed to repurpose old drugs as novel DENV NS3 inhibitors. Ligand-based (L-B) and proteochemometric (PCM) prediction models were built using 62,354 bioactivity data to screen for potential NS3 inhibitors. Selected drugs were then subjected to the foci forming unit reduction assay (FFURA) and protease inhibition assay. Finally, molecular docking was performed to validate these results. The in silico studies revealed that both models performed well in the internal and external validations. However, the L-B model showed better accuracy in the external validation in terms of its sensitivity (0.671). In the in vitro validation, all drugs (zileuton, trimethadione, and linalool) were able to moderately inhibit the viral activities at the highest concentration tested. Zileuton showed comparable results with linalool when tested at 2 mM against the DENV NS3 protease, with a reduction of protease activity at 17.89 and 18.42%, respectively. Two new compounds were also proposed through the combination of the selected drugs, which are ziltri (zilueton + trimethadione) and zilool (zileuton + linalool). The molecular docking study confirms the in vitro observations where all drugs and proposed compounds were able to achieve binding affinity ≥ -4.1 kcal/mol, with ziltri showing the highest affinity at -7.7 kcal/mol, surpassing the control, panduratin A. The occupation of both S1 and S2 subpockets of NS2B-NS3 may be essential and a reason for the lower binding energy shown by the proposed compounds compared to the screened drugs. Based on the results, this study provided five potential new lead compounds (ziltri, zilool, zileuton, linalool, and trimethadione) for DENV that could be modified further.
Collapse
Affiliation(s)
- Zafirah
Liyana Abdullah
- Department
of Pharmaceutical Life Sciences, Faculty of Pharmacy, Universiti Teknologi MARA Selangor, Puncak Alam Campus, 42300 Bandar Puncak Alam, Selangor, Malaysia
| | - Hui-Yee Chee
- Department
of Medical Microbiology, Faculty of Medicine and Health Sciences, Universiti Putra Malaysia, 43400 UPM Serdang, Selangor, Malaysia
| | - Rohana Yusof
- Department
of Molecular Medicine, Faculty of Medicine, University of Malaya, 50603 Kuala Lumpur, Malaysia
| | - Fazlin Mohd Fauzi
- Department
of Pharmacology and Pharmaceutical Chemistry, Faculty of Pharmacy, UiTM Selangor, Puncak Alam Campus, 42300 Bandar Puncak Alam, Selangor, Malaysia
- Collaborative
Drug Discovery Research, Faculty of Pharmacy, Universiti Teknologi MARA Selangor, Puncak Alam Campus, 42300 Bandar Puncak Alam, Selangor, Malaysia
| |
Collapse
|
3
|
Guttman Y, Kerem Z. Computer-Aided (In Silico) Modeling of Cytochrome P450-Mediated Food–Drug Interactions (FDI). Int J Mol Sci 2022; 23:ijms23158498. [PMID: 35955630 PMCID: PMC9369352 DOI: 10.3390/ijms23158498] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2022] [Revised: 07/26/2022] [Accepted: 07/28/2022] [Indexed: 02/01/2023] Open
Abstract
Modifications of the activity of Cytochrome 450 (CYP) enzymes by compounds in food might impair medical treatments. These CYP-mediated food–drug interactions (FDI) play a major role in drug clearance in the intestine and liver. Inter-individual variation in both CYP expression and structure is an important determinant of FDI. Traditional targeted approaches have highlighted a limited number of dietary inhibitors and single-nucleotide variations (SNVs), each determining personal CYP activity and inhibition. These approaches are costly in time, money and labor. Here, we review computational tools and databases that are already available and are relevant to predicting CYP-mediated FDIs. Computer-aided approaches such as protein–ligand interaction modeling and the virtual screening of big data narrow down hundreds of thousands of items in databanks to a few putative targets, to which the research resources could be further directed. Structure-based methods are used to explore the structural nature of the interaction between compounds and CYP enzymes. However, while collections of chemical, biochemical and genetic data are available today and call for the implementation of big-data approaches, ligand-based machine-learning approaches for virtual screening are still scarcely used for FDI studies. This review of CYP-mediated FDIs promises to attract scientists and the general public.
Collapse
|
4
|
Wei Y, Li S, Li Z, Wan Z, Lin J. Interpretable-ADMET: a web service for ADMET prediction and optimization based on deep neural representation. Bioinformatics 2022; 38:2863-2871. [PMID: 35561160 DOI: 10.1093/bioinformatics/btac192] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 03/05/2022] [Accepted: 03/28/2022] [Indexed: 11/15/2022] Open
Abstract
MOTIVATION In the process of discovery and optimization of lead compounds, it is difficult for non-expert pharmacologists to intuitively determine the contribution of substructure to a particular property of a molecule. RESULTS In this work, we develop a user-friendly web service, named interpretable-absorption, distribution, metabolism, excretion and toxicity (ADMET), which predict 59 ADMET-associated properties using 90 qualitative classification models and 28 quantitative regression models based on graph convolutional neural network and graph attention network algorithms. In interpretable-ADMET, there are 250 729 entries associated with 59 kinds of ADMET-associated properties for 80 167 chemical compounds. In addition to making predictions, interpretable-ADMET provides interpretation models based on gradient-weighted class activation map for identifying the substructure, which is important to the particular property. Interpretable-ADMET also provides an optimize module to automatically generate a set of novel virtual candidates based on matched molecular pair rules. We believe that interpretable-ADMET could serve as a useful tool for lead optimization in drug discovery. AVAILABILITY AND IMPLEMENTATION Interpretable-ADMET is available at http://cadd.pharmacy.nankai.edu.cn/interpretableadmet/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yu Wei
- State Key Laboratory of Medicinal Chemical Biology, Frontiers Science Center for Cell Responses, College of Pharmacy and Tianjin Key Laboratory of Molecular Drug Research, Nankai University, Tianjin 300353, China
| | - Shanshan Li
- State Key Laboratory of Medicinal Chemical Biology, Frontiers Science Center for Cell Responses, College of Pharmacy and Tianjin Key Laboratory of Molecular Drug Research, Nankai University, Tianjin 300353, China
- Platform of Pharmaceutical Intelligence, Tianjin International Joint Academy of Biomedicine, Tianjin 300457, China
| | - Zhonglin Li
- State Key Laboratory of Medicinal Chemical Biology, Frontiers Science Center for Cell Responses, College of Pharmacy and Tianjin Key Laboratory of Molecular Drug Research, Nankai University, Tianjin 300353, China
- Platform of Pharmaceutical Intelligence, Tianjin International Joint Academy of Biomedicine, Tianjin 300457, China
| | - Ziwei Wan
- State Key Laboratory of Medicinal Chemical Biology, Frontiers Science Center for Cell Responses, College of Pharmacy and Tianjin Key Laboratory of Molecular Drug Research, Nankai University, Tianjin 300353, China
- Platform of Pharmaceutical Intelligence, Tianjin International Joint Academy of Biomedicine, Tianjin 300457, China
| | - Jianping Lin
- State Key Laboratory of Medicinal Chemical Biology, Frontiers Science Center for Cell Responses, College of Pharmacy and Tianjin Key Laboratory of Molecular Drug Research, Nankai University, Tianjin 300353, China
- Platform of Pharmaceutical Intelligence, Tianjin International Joint Academy of Biomedicine, Tianjin 300457, China
- Biodesign Center, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
| |
Collapse
|
5
|
Guttman Y, Kerem Z. Dietary Inhibitors of CYP3A4 Are Revealed Using Virtual Screening by Using a New Deep-Learning Classifier. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2022; 70:2752-2761. [PMID: 35104412 PMCID: PMC8895463 DOI: 10.1021/acs.jafc.2c00237] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 01/17/2022] [Accepted: 01/20/2022] [Indexed: 05/29/2023]
Abstract
CYP3A4 is the main human enzyme responsible for phase I metabolism of dietary compounds, prescribed drugs and xenobiotics, steroid hormones, and bile acids. The inhibition of CYP3A4 activity might impair physiological mechanisms, including the endocrine system and response to drug admission. Here, we aimed to discover new CYP3A4 inhibitors from food and dietary supplements. A deep-learning model was built that classifies compounds as either an inhibitor or noninhibitor, with a high specificity of 0.997. We used this classifier to virtually screen ∼60,000 dietary compounds. Of the 115 identified potential inhibitors, only 31 were previously suggested. Many herbals, as predicted here, might cause impaired metabolism of drugs, and endogenous hormones and bile acids. Additionally, by applying Lipinski's rules of five, 17 compounds were also classified as potential intestine local inhibitors. New CYP3A4 inhibitors predicted by the model, bilobetin and picropodophyllin, were assayed in vitro.
Collapse
|
6
|
Unsupervised Representation Learning for Proteochemometric Modeling. Int J Mol Sci 2021; 22:ijms222312882. [PMID: 34884688 PMCID: PMC8657702 DOI: 10.3390/ijms222312882] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 11/25/2021] [Accepted: 11/26/2021] [Indexed: 11/18/2022] Open
Abstract
In silico protein–ligand binding prediction is an ongoing area of research in computational chemistry and machine learning based drug discovery, as an accurate predictive model could greatly reduce the time and resources necessary for the detection and prioritization of possible drug candidates. Proteochemometric modeling (PCM) attempts to create an accurate model of the protein–ligand interaction space by combining explicit protein and ligand descriptors. This requires the creation of information-rich, uniform and computer interpretable representations of proteins and ligands. Previous studies in PCM modeling rely on pre-defined, handcrafted feature extraction methods, and many methods use protein descriptors that require alignment or are otherwise specific to a particular group of related proteins. However, recent advances in representation learning have shown that unsupervised machine learning can be used to generate embeddings that outperform complex, human-engineered representations. Several different embedding methods for proteins and molecules have been developed based on various language-modeling methods. Here, we demonstrate the utility of these unsupervised representations and compare three protein embeddings and two compound embeddings in a fair manner. We evaluate performance on various splits of a benchmark dataset, as well as on an internal dataset of protein–ligand binding activities and find that unsupervised-learned representations significantly outperform handcrafted representations.
Collapse
|
7
|
Agyapong O, Miller WA, Wilson MD, Kwofie SK. Development of a proteochemometric-based support vector machine model for predicting bioactive molecules of tubulin receptors. Mol Divers 2021; 26:2231-2242. [PMID: 34626303 DOI: 10.1007/s11030-021-10329-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 09/23/2021] [Indexed: 11/26/2022]
Abstract
Microtubules are receiving enormous interest in drug discovery due to the important roles they play in cellular functions. Targeting tubulin polymerization presents an excellent opportunity for the development of anti-tubulin drugs. Drug resistance and high toxicity of currently used tubulin-binding agents have necessitated the pursuit of novel drug candidates with increased therapeutic potency. The design of novel drug candidates can be achieved using efficient computational techniques to support existing efforts. Proteochemometric (PCM) modeling is a computational technique that can be employed to elucidate the bioactivity relations between related targets and multiple ligands. We have developed a PCM-based Support Vector Machine (SVM) approach for predicting the bioactivity between tubulin receptors and small, drug-like molecules. The bioactivity datasets used for training the SVM algorithm were obtained from the Binding DB database. The SVM-based PCM model yielded a good overall predictive performance with an area under the curve (AUC) of 87%, Matthews correlation coefficient (MCC) of 72%, overall accuracy of 93%, and a classification error of 7%. The algorithm allows the prediction of the likelihood of new interactions based on confidence scores between the query datasets, comprising ligands in SMILES format and protein sequences of tubulin targets. The algorithm has been implemented as a web server known as TubPred, accessible via http://35.167.90.225:5000/ .
Collapse
Affiliation(s)
- Odame Agyapong
- Department of Biomedical Engineering, School of Engineering Sciences, College of Basic and Applied Sciences, University of Ghana, PMB LG 77, Legon, Accra, Ghana
- Department of Parasitology, Noguchi Memorial Institute for Medical Research (NMIMR), College of Health Sciences (CHS), University of Ghana, P.O. Box LG 581, Legon, Accra, Ghana
| | - Whelton A Miller
- Department of Medicine, Loyola University Medical Center, Maywood, IL, 60153, USA
- School of Engineering and Applied Science, Department of Chemical and Biomolecular Engineering, University of Pennsylvania, Philadelphia, PA, 19104, USA
- Department of Molecular Pharmacology and Neuroscience, Loyola University Medical Center, Maywood, IL, 60153, USA
| | - Michael D Wilson
- Department of Parasitology, Noguchi Memorial Institute for Medical Research (NMIMR), College of Health Sciences (CHS), University of Ghana, P.O. Box LG 581, Legon, Accra, Ghana
- Department of Medicine, Loyola University Medical Center, Maywood, IL, 60153, USA
| | - Samuel K Kwofie
- Department of Biomedical Engineering, School of Engineering Sciences, College of Basic and Applied Sciences, University of Ghana, PMB LG 77, Legon, Accra, Ghana.
- West African Centre for Cell Biology of Infectious Pathogens, Department of Biochemistry, Cell and Molecular Biology, College of Basic and Applied Sciences, University of Ghana, Accra, Ghana.
| |
Collapse
|
8
|
Plonka W, Stork C, Šícho M, Kirchmair J. CYPlebrity: Machine learning models for the prediction of inhibitors of cytochrome P450 enzymes. Bioorg Med Chem 2021; 46:116388. [PMID: 34488021 DOI: 10.1016/j.bmc.2021.116388] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Revised: 08/19/2021] [Accepted: 08/24/2021] [Indexed: 10/20/2022]
Abstract
The vast majority of approved drugs are metabolized by the five major cytochrome P450 (CYP) isozymes, 1A2, 2C9, 2C19, 2D6 and 3A4. Inhibition of CYP isozymes can cause drug-drug interactions with severe pharmacological and toxicological consequences. Computational methods for the fast and reliable prediction of the inhibition of CYP isozymes by small molecules are therefore of high interest and relevance to pharmaceutical companies and a host of other industries, including the cosmetics and agrochemical industries. Today, a large number of machine learning models for predicting the inhibition of the major CYP isozymes by small molecules are available. With this work we aim to go beyond the coverage of existing models, by combining data from several major public and proprietary sources. More specifically, we used up to 18815 compounds with measured bioactivities to train random forest classification models for the individual CYP isozymes. A major advantage of the new data collection over existing ones is the better representation of the minority class, the CYP inhibitors. With the new data collection we achieved inhibitor-to-non-inhibitor ratios in the order of 1:1 (CYP1A2) to 1:3 (CYP2D6). We show that our models reach competitive performance on external data, with Matthews correlation coefficients (MCCs) ranging from 0.62 (CYP2C19) to 0.70 (CYP2D6), and areas under the receiver operating characteristic curve (AUCs) between 0.89 (CYP2C19) and 0.92 (CYPs 2D6 and 3A4). Importantly, the models show a high level of robustness, reflected in a good predictivity also for compounds that are structurally dissimilar to the compounds represented in the training data. The best models presented in this work are freely accessible for academic research via a web service.
Collapse
Affiliation(s)
- Wojciech Plonka
- Universität Hamburg, Center for Bioinformatics (ZBH), Hamburg, Bundesstr. 43, 20146, Germany; FQS Poland (Fujitsu Group), Parkowa 11, 30-538 Cracow, Poland
| | - Conrad Stork
- Universität Hamburg, Center for Bioinformatics (ZBH), Hamburg, Bundesstr. 43, 20146, Germany
| | - Martin Šícho
- CZ-OPENSCREEN: National Infrastructure for Chemical Biology, Department of Informatics and Chemistry, Faculty of Chemical Technology, University of Chemistry and Technology Prague, Technická 5, 166 28, Prague, Czech Republic
| | - Johannes Kirchmair
- Universität Hamburg, Center for Bioinformatics (ZBH), Hamburg, Bundesstr. 43, 20146, Germany; Department of Pharmaceutical Sciences, Division of Pharmaceutical Chemistry, Faculty of Life Sciences, University of Vienna, Althanstr. 14, 1090 Vienna, Austria.
| |
Collapse
|
9
|
Muratov EN, Bajorath J, Sheridan RP, Tetko IV, Filimonov D, Poroikov V, Oprea TI, Baskin II, Varnek A, Roitberg A, Isayev O, Curtarolo S, Fourches D, Cohen Y, Aspuru-Guzik A, Winkler DA, Agrafiotis D, Cherkasov A, Tropsha A. QSAR without borders. Chem Soc Rev 2020; 49:3525-3564. [PMID: 32356548 PMCID: PMC8008490 DOI: 10.1039/d0cs00098a] [Citation(s) in RCA: 312] [Impact Index Per Article: 78.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Prediction of chemical bioactivity and physical properties has been one of the most important applications of statistical and more recently, machine learning and artificial intelligence methods in chemical sciences. This field of research, broadly known as quantitative structure-activity relationships (QSAR) modeling, has developed many important algorithms and has found a broad range of applications in physical organic and medicinal chemistry in the past 55+ years. This Perspective summarizes recent technological advances in QSAR modeling but it also highlights the applicability of algorithms, modeling methods, and validation practices developed in QSAR to a wide range of research areas outside of traditional QSAR boundaries including synthesis planning, nanotechnology, materials science, biomaterials, and clinical informatics. As modern research methods generate rapidly increasing amounts of data, the knowledge of robust data-driven modelling methods professed within the QSAR field can become essential for scientists working both within and outside of chemical research. We hope that this contribution highlighting the generalizable components of QSAR modeling will serve to address this challenge.
Collapse
Affiliation(s)
- Eugene N Muratov
- UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Kato H. Computational prediction of cytochrome P450 inhibition and induction. Drug Metab Pharmacokinet 2019; 35:30-44. [PMID: 31902468 DOI: 10.1016/j.dmpk.2019.11.006] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Revised: 10/27/2019] [Accepted: 11/17/2019] [Indexed: 12/14/2022]
Abstract
Cytochrome P450 (CYP) enzymes play an important role in the phase I metabolism of many xenobiotics. Most drug-drug interactions (DDIs) associated with CYP are caused by either CYP inhibition or induction. The early detection of potential DDIs is highly desirable in the pharmaceutical industry because DDIs can cause serious adverse events, which can lead to poor patient health and drug development failures. Recently, many computational studies predicting CYP inhibition and induction have been reported. The current computational modeling approaches for CYP metabolism are classified as ligand- and structure-based; various techniques, such as quantitative structure-activity relationships, machine learning, docking, and molecular dynamic simulation, are involved in both the approaches. Recently, combining these two approaches have resulted in improvements in the prediction accuracy of DDIs. In this review, we present important, recent developments in the computational prediction of the inhibition of four clinically crucial CYP isoforms (CYP1A2, 2C9, 2D6, and 3A4) and three nuclear receptors (aryl hydrocarbon receptor, constitutive androstane receptor, and pregnane X receptor) involved in the induction of CYP1A2, 2B6, and 3A4, respectively.
Collapse
Affiliation(s)
- Harutoshi Kato
- DMPK Research Laboratories, Mitsubishi Tanabe Pharma Corporation, Aoba-ku, Yokohama-shi, 227-0033, Japan.
| |
Collapse
|
11
|
Worachartcheewan A, Songtawee N, Siriwong S, Prachayasittikul S, Nantasenamat C, Prachayasittikul V. Rational Design of Colchicine Derivatives as anti-HIV Agents via QSAR and Molecular Docking. Med Chem 2019; 15:328-340. [PMID: 30251609 DOI: 10.2174/1573406414666180924163756] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2018] [Revised: 08/24/2018] [Accepted: 08/25/2018] [Indexed: 02/08/2023]
Abstract
BACKGROUND Human immunodeficiency virus (HIV) is an infective agent that causes an acquired immunodeficiency syndrome (AIDS). Therefore, the rational design of inhibitors for preventing the progression of the disease is required. OBJECTIVE This study aims to construct quantitative structure-activity relationship (QSAR) models, molecular docking and newly rational design of colchicine and derivatives with anti-HIV activity. METHODS A data set of 24 colchicine and derivatives with anti-HIV activity were employed to develop the QSAR models using machine learning methods (e.g. multiple linear regression (MLR), artificial neural network (ANN) and support vector machine (SVM)), and to study a molecular docking. RESULTS The significant descriptors relating to the anti-HIV activity included JGI2, Mor24u, Gm and R8p+ descriptors. The predictive performance of the models gave acceptable statistical qualities as observed by correlation coefficient (Q2) and root mean square error (RMSE) of leave-one out cross-validation (LOO-CV) and external sets. Particularly, the ANN method outperformed MLR and SVM methods that displayed LOO-CV 2 Q and RMSELOO-CV of 0.7548 and 0.5735 for LOOCV set, and Ext 2 Q of 0.8553 and RMSEExt of 0.6999 for external validation. In addition, the molecular docking of virus-entry molecule (gp120 envelope glycoprotein) revealed the key interacting residues of the protein (cellular receptor, CD4) and the site-moiety preferences of colchicine derivatives as HIV entry inhibitors for binding to HIV structure. Furthermore, newly rational design of colchicine derivatives using informative QSAR and molecular docking was proposed. CONCLUSION These findings serve as a guideline for the rational drug design as well as potential development of novel anti-HIV agents.
Collapse
Affiliation(s)
- Apilak Worachartcheewan
- Department of Community Medical Technology, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand.,Department of Clinical Chemistry, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand.,Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Napat Songtawee
- Department of Clinical Chemistry, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Suphakit Siriwong
- Department of Community Medical Technology, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Supaluk Prachayasittikul
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Chanin Nantasenamat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Virapong Prachayasittikul
- Department of Clinical Microbiology and Applied Technology, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| |
Collapse
|
12
|
Mazzolari A, Afzal AM, Pedretti A, Testa B, Vistoli G, Bender A. Prediction of UGT-mediated Metabolism Using the Manually Curated MetaQSAR Database. ACS Med Chem Lett 2019; 10:633-638. [PMID: 30996809 DOI: 10.1021/acsmedchemlett.8b00603] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Accepted: 02/12/2019] [Indexed: 11/30/2022] Open
Abstract
Even though glucuronidations are the most frequent metabolic reactions of conjugation, both in quantitative and qualitative terms, they have rather seldom been investigated using computational approaches. To fill this gap, we have used the manually collected MetaQSAR metabolic reaction database to generate two models for the prediction of UGT-mediated metabolism, both based on molecular descriptors and implementing the Random Forest algorithm. The first model predicts the occurrence of the reaction and was internally validated with a Matthew correlation coefficient (MCC) of 0.76 and an area under the ROC curve (AUC) of 0.94, and further externally validated using a test set composed of 120 additional xenobiotics (MCC of 0.70 and AUC of 0.90). The second model distinguishes between O- and N-glucuronidations and was optimized by the random undersampling procedure to improve the predictive accuracy during the internal validation, with the recall measure of the minority class increasing from 0.55 to 0.78.
Collapse
Affiliation(s)
- Angelica Mazzolari
- Dipartimento di Scienze Farmaceutiche, Facoltà di Scienze del Farmaco, Università degli Studi di Milano, Via Mangiagalli, I-20133 Milano, Italy
| | - Avid M. Afzal
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, CB2 1EW Cambridge, U.K
| | - Alessandro Pedretti
- Dipartimento di Scienze Farmaceutiche, Facoltà di Scienze del Farmaco, Università degli Studi di Milano, Via Mangiagalli, I-20133 Milano, Italy
| | | | - Giulio Vistoli
- Dipartimento di Scienze Farmaceutiche, Facoltà di Scienze del Farmaco, Università degli Studi di Milano, Via Mangiagalli, I-20133 Milano, Italy
| | - Andreas Bender
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, CB2 1EW Cambridge, U.K
| |
Collapse
|
13
|
Giblin KA, Hughes SJ, Boyd H, Hansson P, Bender A. Prospectively Validated Proteochemometric Models for the Prediction of Small-Molecule Binding to Bromodomain Proteins. J Chem Inf Model 2018; 58:1870-1888. [DOI: 10.1021/acs.jcim.8b00400] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Kathryn A. Giblin
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K
| | - Samantha J. Hughes
- Computational Chemistry, Oncology, IMED Biotech Unit, AstraZeneca, Cambridge CB10 1XL, U.K
| | - Helen Boyd
- Discovery Biology, Discovery Sciences, IMED Biotech Unit, AstraZeneca, Gothenburg 431 50 SE, Sweden
| | - Pia Hansson
- Discovery Biology, Discovery Sciences, IMED Biotech Unit, AstraZeneca, Gothenburg 431 50 SE, Sweden
| | - Andreas Bender
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K
| |
Collapse
|
14
|
Abstract
High-throughput and high-content screening campaigns have resulted in the creation of large chemogenomic matrices. These matrices form the training data which is used to build ligand-target interaction models for pharmacological and chemical biology research. While academic, government, and industrial efforts continuously add to the ligand-target data pairs available for modeling, major research efforts are devoted to improving machine learning techniques to cope with the sparseness, heterogeneity, and size of available datasets as well as inherent noise and bias. This "race of arms" has led to the creation of algorithms to generate highly complex models with high prediction performance at the cost of training efficiency as well as interpretability.In contrast, recent studies have challenged the necessity for "big data" in chemogenomic modeling and found that models built on larger numbers of examples do not necessarily result in better predictive abilities. Automated adaptive selection of the training data (ligand-target instances) used for model creation can result in considerably smaller training sets that retain prediction performance on par with training using hundreds of thousands of data points. In this chapter, we describe the protocols used for one such iterative chemogenomic selection technique, including model construction and update as well as possible techniques for evaluations of constructed models and analysis of the iterative model construction.
Collapse
Affiliation(s)
- Daniel Reker
- Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA.
| | - J B Brown
- Life Science Informatics Research Unit, Laboratory of Molecular Biosciences, Kyoto University Graduate School of Medicine, Kyoto, Japan
| |
Collapse
|
15
|
Lee JH, Basith S, Cui M, Kim B, Choi S. In silico prediction of multiple-category classification model for cytochrome P450 inhibitors and non-inhibitors using machine-learning method. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2017; 28:863-874. [PMID: 29183231 DOI: 10.1080/1062936x.2017.1399925] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
The cytochrome P450 (CYP) enzyme superfamily is involved in phase I metabolism which chemically modifies a variety of substrates via oxidative reactions to make them more water-soluble and easier to eliminate. Inhibition of these enzymes leads to undesirable effects, including toxic drug accumulations and adverse drug-drug interactions. Hence, it is necessary to develop in silico models that can predict the inhibition potential of compounds for different CYP isoforms. This study focused on five major CYP isoforms, including CYP1A2, 2C9, 2C19, 2D6 and 3A4, that are responsible for more than 90% of the metabolism of clinical drugs. The main aim of this study is to develop a multiple-category classification model (MCM) for the major CYP isoforms using a Laplacian-modified naïve Bayesian method. The dataset composed of more than 4500 compounds was collected from the PubChem Bioassay database. VolSurf+ descriptors and FCFP_8 fingerprint were used as input features to build classification models. The results demonstrated that the developed MCM using Laplacian-modified naïve Bayesian method was successful in classifying inhibitors and non-inhibitors for each CYP isoform. Moreover, the accuracy, sensitivity and specificity values for both training and test sets were above 80% and also yielded satisfactory area under the receiver operating characteristic curve and Matthews correlation coefficient values.
Collapse
Affiliation(s)
- J H Lee
- a National Leading Research Laboratory (NLRL) of Molecular Modeling & Drug Design, College of Pharmacy and Graduate School of Pharmaceutical Sciences , Ewha Womans University , Seoul , Republic of Korea
| | - S Basith
- a National Leading Research Laboratory (NLRL) of Molecular Modeling & Drug Design, College of Pharmacy and Graduate School of Pharmaceutical Sciences , Ewha Womans University , Seoul , Republic of Korea
| | - M Cui
- a National Leading Research Laboratory (NLRL) of Molecular Modeling & Drug Design, College of Pharmacy and Graduate School of Pharmaceutical Sciences , Ewha Womans University , Seoul , Republic of Korea
| | - B Kim
- a National Leading Research Laboratory (NLRL) of Molecular Modeling & Drug Design, College of Pharmacy and Graduate School of Pharmaceutical Sciences , Ewha Womans University , Seoul , Republic of Korea
| | - S Choi
- a National Leading Research Laboratory (NLRL) of Molecular Modeling & Drug Design, College of Pharmacy and Graduate School of Pharmaceutical Sciences , Ewha Womans University , Seoul , Republic of Korea
| |
Collapse
|
16
|
Cerisier N, Regad L, Triki D, Petitjean M, Flatters D, Camproux AC. Statistical Profiling of One Promiscuous Protein Binding Site: Illustrated by Urokinase Catalytic Domain. Mol Inform 2017; 36. [PMID: 28696518 DOI: 10.1002/minf.201700040] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Accepted: 06/26/2017] [Indexed: 12/21/2022]
Abstract
While recent literature focuses on drug promiscuity, the characterization of promiscuous binding sites (ability to bind several ligands) remains to be explored. Here, we present a proteochemometric modeling approach to analyze diverse ligands and corresponding multiple binding sub-pockets associated with one promiscuous binding site to characterize protein-ligand recognition. We analyze both geometrical and physicochemical profile correspondences. This approach was applied to examine the well-studied druggable urokinase catalytic domain inhibitor binding site, which results in a large number of complex structures bound to various ligands. This approach emphasizes the importance of jointly characterizing pocket and ligand spaces to explore the impact of ligand diversity on sub-pocket properties and to establish their main profile correspondences. This work supports an interest in mining available 3D holo structures associated with a promiscuous binding site to explore its main protein-ligand recognition tendency.
Collapse
Affiliation(s)
- Natacha Cerisier
- INSERM, UMRS-973, MTi,35, rue Hélène Brion, 75205, PARIS CEDEX 13.,University Paris Diderot, Sorbonne Paris Cité, UMRS-973, MTi
| | - Leslie Regad
- INSERM, UMRS-973, MTi,35, rue Hélène Brion, 75205, PARIS CEDEX 13.,University Paris Diderot, Sorbonne Paris Cité, UMRS-973, MTi
| | - Dhoha Triki
- INSERM, UMRS-973, MTi,35, rue Hélène Brion, 75205, PARIS CEDEX 13.,University Paris Diderot, Sorbonne Paris Cité, UMRS-973, MTi
| | - Michel Petitjean
- INSERM, UMRS-973, MTi,35, rue Hélène Brion, 75205, PARIS CEDEX 13.,University Paris Diderot, Sorbonne Paris Cité, UMRS-973, MTi
| | - Delphine Flatters
- INSERM, UMRS-973, MTi,35, rue Hélène Brion, 75205, PARIS CEDEX 13.,University Paris Diderot, Sorbonne Paris Cité, UMRS-973, MTi
| | - Anne-Claude Camproux
- INSERM, UMRS-973, MTi,35, rue Hélène Brion, 75205, PARIS CEDEX 13.,University Paris Diderot, Sorbonne Paris Cité, UMRS-973, MTi
| |
Collapse
|
17
|
Issa NT, Wathieu H, Ojo A, Byers SW, Dakshanamurthy S. Drug Metabolism in Preclinical Drug Development: A Survey of the Discovery Process, Toxicology, and Computational Tools. Curr Drug Metab 2017; 18:556-565. [PMID: 28302026 PMCID: PMC5892202 DOI: 10.2174/1389200218666170316093301] [Citation(s) in RCA: 59] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2016] [Revised: 12/16/2016] [Accepted: 01/17/2017] [Indexed: 11/22/2022]
Abstract
BACKGROUND While establishing efficacy in translational models and humans through clinically-relevant endpoints for disease is of great interest, assessing the potential toxicity of a putative therapeutic drug is critical. Toxicological assessments in the pre-clinical discovery phase help to avoid future failure in the clinical phases of drug development. Many in vitro assays exist to aid in modular toxicological assessment, such as hepatotoxicity and genotoxicity. While these methods have provided tremendous insight into human toxicity by investigational new drugs, they are expensive, require substantial resources, and do not account for pharmacogenomics as well as critical ADME properties. Computational tools can fill this niche in toxicology if in silico models are accurate in relating drug molecular properties to toxicological endpoints as well as reliable in predicting important drug-target interactions that mediate known adverse events or adverse outcome pathways (AOPs). METHODS We undertook an unstructured search of multiple bibliographic databases for peer-reviewed literature regarding computational methods in predictive toxicology for in silico drug discovery. As this review paper is meant to serve as a survey of available methods for the interested reader, no focused criteria were applied. Literature chosen was based on the writers' expertise and intent in communicating important aspects of in silico toxicology to the interested reader. CONCLUSION This review provides a purview of computational methods of pre-clinical toxicologic assessments for novel small molecule drugs that may be of use for novice and experienced investigators as well as academic and commercial drug discovery entities.
Collapse
Affiliation(s)
- Naiem T. Issa
- Georgetown-Lombardi Comprehensive Cancer Center and Department of Oncology, Georgetown University Medical Center, Washington DC, 20057 USA
| | - Henri Wathieu
- Georgetown-Lombardi Comprehensive Cancer Center and Department of Oncology, Georgetown University Medical Center, Washington DC, 20057 USA
| | - Abiola Ojo
- College of Pharmacy, Howard University, Washington, DC 20059, USA
| | - Stephen W. Byers
- Georgetown-Lombardi Comprehensive Cancer Center and Department of Oncology, Georgetown University Medical Center, Washington DC, 20057 USA
- Department of Biochemistry & Molecular Biology, Georgetown University, Washington DC, 20057, USA
| | - Sivanesan Dakshanamurthy
- Georgetown-Lombardi Comprehensive Cancer Center and Department of Oncology, Georgetown University Medical Center, Washington DC, 20057 USA
- Department of Biochemistry & Molecular Biology, Georgetown University, Washington DC, 20057, USA
| |
Collapse
|
18
|
Dixit VA, Deshpande S. Advances in Computational Prediction of Regioselective and Isoform-Specific Drug Metabolism Catalyzed by CYP450s. ChemistrySelect 2016. [DOI: 10.1002/slct.201601051] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Affiliation(s)
- Vaibhav A. Dixit
- Department of Pharmaceutical Chemistry; School of Pharmacy and Technology Management (SPTM), Shri Vile Parle Kelavani Mandal's (SVKM's) Narsee Monjee Institute of Management Studies (NMIMS), Mukesh Patel Technology Park, Babulde, Bank of Tapi River; Mumbai-Agra Road Shirpur, Dist. Dhule−425405 India
| | - Shirish Deshpande
- Department of Pharmaceutical Chemistry; School of Pharmacy and Technology Management (SPTM), Shri Vile Parle Kelavani Mandal's (SVKM's) Narsee Monjee Institute of Management Studies (NMIMS), Mukesh Patel Technology Park, Babulde, Bank of Tapi River; Mumbai-Agra Road Shirpur, Dist. Dhule−425405 India
| |
Collapse
|
19
|
Rasti B, Namazi M, Karimi-Jafari MH, Ghasemi JB. Proteochemometric Modeling of the Interaction Space of Carbonic Anhydrase and its Inhibitors: An Assessment of Structure-based and Sequence-based Descriptors. Mol Inform 2016; 36. [PMID: 27860295 DOI: 10.1002/minf.201600102] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2015] [Accepted: 10/26/2016] [Indexed: 11/08/2022]
Abstract
Due to its physiological and clinical roles, carbonic anhydrase (CA) is one of the most interesting case studies. There are different classes of CAinhibitors including sulfonamides, polyamines, coumarins and dithiocarbamates (DTCs). However, many of them hardly act as a selective inhibitor against a specific isoform. Therefore, finding highly selective inhibitors for different isoforms of CA is still an ongoing project. Proteochemometrics modeling (PCM) is able to model the bioactivity of multiple compounds against different isoforms of a protein. Therefore, it would be extremely applicable when investigating the selectivity of different ligands towards different receptors. Given the facts, we applied PCM to investigate the interaction space and structural properties that lead to the selective inhibition of CA isoforms by some dithiocarbamates. Our models have provided interesting structural information that can be considered to design compounds capable of inhibiting different isoforms of CA in an improved selective manner. Validity and predictivity of the models were confirmed by both internal and external validation methods; while Y-scrambling approach was applied to assess the robustness of the models. To prove the reliability and the applicability of our findings, we showed how ligands-receptors selectivity can be affected by removing any of these critical findings from the modeling process.
Collapse
Affiliation(s)
- Behnam Rasti
- Department of Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Mohsen Namazi
- Department of Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - M H Karimi-Jafari
- Department of Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Jahan B Ghasemi
- Department of Analytical Chemistry, School of Chemistry, College of Science, University of Tehran, Tehran, Iran
| |
Collapse
|
20
|
Spjuth O, Rydberg P, Willighagen EL, Evelo CT, Jeliazkova N. XMetDB: an open access database for xenobiotic metabolism. J Cheminform 2016; 8:47. [PMID: 27651835 PMCID: PMC5025591 DOI: 10.1186/s13321-016-0161-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2016] [Accepted: 09/04/2016] [Indexed: 12/17/2022] Open
Abstract
Xenobiotic metabolism is an active research topic but the limited amount of openly available high-quality biotransformation data constrains predictive modeling. Current database often default to commonly available information: which enzyme metabolizes a compound, but neither experimental conditions nor the atoms that undergo metabolization are captured. We present XMetDB, an open access database for drugs and other xenobiotics and their respective metabolites. The database contains chemical structures of xenobiotic biotransformations with substrate atoms annotated as reaction centra, the resulting product formed, and the catalyzing enzyme, type of experiment, and literature references. Associated with the database is a web interface for the submission and retrieval of experimental metabolite data for drugs and other xenobiotics in various formats, and a web API for programmatic access is also available. The database is open for data deposition, and a curation scheme is in place for quality control. An extensive guide on how to enter experimental data into is available from the XMetDB wiki. XMetDB formalizes how biotransformation data should be reported, and the openly available systematically labeled data is a big step forward towards better models for predictive metabolism.
Collapse
Affiliation(s)
- Ola Spjuth
- Department of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University, 75124 Uppsala, Sweden
| | - Patrik Rydberg
- Department of Drug Design and Pharmacology, University of Copenhagen, Universitetsparken 2, 2100 Copenhagen, Denmark
| | - Egon L Willighagen
- Department of Bioinformatics - BiGCaT, Maastricht University, P.O. Box 616, UNS50 Box 19, 6200 MD Maastricht, The Netherlands
| | - Chris T Evelo
- Department of Bioinformatics - BiGCaT, Maastricht University, P.O. Box 616, UNS50 Box 19, 6200 MD Maastricht, The Netherlands
| | | |
Collapse
|
21
|
Rasti B, Karimi-Jafari MH, Ghasemi JB. Quantitative Characterization of the Interaction Space of the Mammalian Carbonic Anhydrase Isoforms I, II, VII, IX, XII, and XIV and their Inhibitors, Using the Proteochemometric Approach. Chem Biol Drug Des 2016; 88:341-53. [PMID: 26990115 DOI: 10.1111/cbdd.12759] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2015] [Revised: 01/12/2016] [Accepted: 02/29/2016] [Indexed: 12/23/2022]
Affiliation(s)
- Behnam Rasti
- Department of Bioinformatics; Institute of Biochemistry and Biophysics; University of Tehran; PO Box 13145-1365 Tehran Iran
| | - Mohammad H. Karimi-Jafari
- Department of Bioinformatics; Institute of Biochemistry and Biophysics; University of Tehran; PO Box 13145-1365 Tehran Iran
| | - Jahan B. Ghasemi
- Department of Analytical Chemistry; School of Chemistry; College of Science; University of Tehran; PO Box 13145-1365 Tehran Iran
| |
Collapse
|
22
|
Qiu T, Qiu J, Feng J, Wu D, Yang Y, Tang K, Cao Z, Zhu R. The recent progress in proteochemometric modelling: focusing on target descriptors, cross-term descriptors and application scope. Brief Bioinform 2016; 18:125-136. [PMID: 26873661 DOI: 10.1093/bib/bbw004] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2015] [Revised: 12/09/2015] [Indexed: 12/17/2022] Open
Abstract
As an extension of the conventional quantitative structure activity relationship models, proteochemometric (PCM) modelling is a computational method that can predict the bioactivity relations between multiple ligands and multiple targets. Traditional PCM modelling includes three essential elements: descriptors (including target descriptors, ligand descriptors and cross-term descriptors), bioactivity data and appropriate learning functions that link the descriptors to the bioactivity data. Since its appearance, PCM modelling has developed rapidly over the past decade by taking advantage of the progress of different descriptors and machine learning techniques, along with the increasing amounts of available bioactivity data. Specifically, the new emerging target descriptors and cross-term descriptors not only significantly increased the performance of PCM modelling but also expanded its application scope from traditional protein-ligand interaction to more abundant interactions, including protein-peptide, protein-DNA and even protein-protein interactions. In this review, target descriptors and cross-term descriptors, as well as the corresponding application scope, are intensively summarized. Additionally, we look forward to seeing PCM modelling extend into new application scopes, such as Target-Catalyst-Ligand systems, with the further development of descriptors, machine learning techniques and increasing amounts of available bioactivity data.
Collapse
|
23
|
Worachartcheewan A, Prachayasittikul V, Toropova AP, Toropov AA, Nantasenamat C. Large-scale structure-activity relationship study of hepatitis C virus NS5B polymerase inhibition using SMILES-based descriptors. Mol Divers 2015; 19:955-64. [DOI: 10.1007/s11030-015-9614-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
24
|
Olsen L, Oostenbrink C, Jørgensen FS. Prediction of cytochrome P450 mediated metabolism. Adv Drug Deliv Rev 2015; 86:61-71. [PMID: 25958010 DOI: 10.1016/j.addr.2015.04.020] [Citation(s) in RCA: 57] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2014] [Revised: 03/30/2015] [Accepted: 04/27/2015] [Indexed: 10/23/2022]
Abstract
Cytochrome P450 enzymes (CYPs) form one of the most important enzyme families involved in the metabolism of xenobiotics. CYPs comprise many isoforms, which catalyze a wide variety of reactions, and potentially, a large number of different metabolites can be formed. However, it is often hard to rationalize what metabolites these enzymes generate. In recent years, many different in silico approaches have been developed to predict binding or regioselective product formation for the different CYP isoforms. These comprise ligand-based methods that are trained on experimental CYP data and structure-based methods that consider how the substrate is oriented in the active site or/and how reactive the part of the substrate that is accessible to the heme group is. We will review key aspects for various approaches that are available to predict binding and site of metabolism (SOM), what outcome can be expected from the predictions, and how they could potentially be improved.
Collapse
|
25
|
Recent progresses in the exploration of machine learning methods as in-silico ADME prediction tools. Adv Drug Deliv Rev 2015; 86:83-100. [PMID: 26037068 DOI: 10.1016/j.addr.2015.03.014] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2014] [Revised: 03/18/2015] [Accepted: 03/22/2015] [Indexed: 02/05/2023]
Abstract
In-silico methods have been explored as potential tools for assessing ADME and ADME regulatory properties particularly in early drug discovery stages. Machine learning methods, with their ability in classifying diverse structures and complex mechanisms, are well suited for predicting ADME and ADME regulatory properties. Recent efforts have been directed at the broadening of application scopes and the improvement of predictive performance with particular focuses on the coverage of ADME properties, and exploration of more diversified training data, appropriate molecular features, and consensus modeling. Moreover, several online machine learning ADME prediction servers have emerged. Here we review these progresses and discuss the performances, application prospects and challenges of exploring machine learning methods as useful tools in predicting ADME and ADME regulatory properties.
Collapse
|
26
|
Pan X, Chao L, Qu S, Huang S, Yang L, Mei H. An improved large-scale prediction model of CYP1A2 inhibitors by using combined fragment descriptors. RSC Adv 2015. [DOI: 10.1039/c5ra17196b] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Combined fragment descriptors are used to develop a predictive SVM model for virtual screening of CYP1A2 inhibitors.
Collapse
Affiliation(s)
- Xianchao Pan
- Key Laboratory of Biorheological Science and Technology
- Ministry of Education
- Chongqing University
- Chongqing 400044
- China
| | - Li Chao
- College of Bioengineering
- Chongqing University
- Chongqing 400044
- China
| | - Sujun Qu
- College of Bioengineering
- Chongqing University
- Chongqing 400044
- China
| | - Shuheng Huang
- College of Bioengineering
- Chongqing University
- Chongqing 400044
- China
| | - Li Yang
- Key Laboratory of Biorheological Science and Technology
- Ministry of Education
- Chongqing University
- Chongqing 400044
- China
| | - Hu Mei
- Key Laboratory of Biorheological Science and Technology
- Ministry of Education
- Chongqing University
- Chongqing 400044
- China
| |
Collapse
|
27
|
Cortés-Ciriano I, Ain QU, Subramanian V, Lenselink EB, Méndez-Lucio O, IJzerman AP, Wohlfahrt G, Prusis P, Malliavin TE, van Westen GJP, Bender A. Polypharmacology modelling using proteochemometrics (PCM): recent methodological developments, applications to target families, and future prospects. MEDCHEMCOMM 2015. [DOI: 10.1039/c4md00216d] [Citation(s) in RCA: 80] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Proteochemometric (PCM) modelling is a computational method to model the bioactivity of multiple ligands against multiple related protein targets simultaneously.
Collapse
Affiliation(s)
- Isidro Cortés-Ciriano
- Unité de Bioinformatique Structurale
- Institut Pasteur and CNRS UMR 3825
- Structural Biology and Chemistry Department
- 75 724 Paris
- France
| | - Qurrat Ul Ain
- Unilever Centre for Molecular Informatics
- Department of Chemistry
- CB2 1EW Cambridge
- UK
| | | | - Eelke B. Lenselink
- Division of Medicinal Chemistry
- Leiden Academic Centre for Drug Research
- Leiden
- The Netherlands
| | - Oscar Méndez-Lucio
- Unilever Centre for Molecular Informatics
- Department of Chemistry
- CB2 1EW Cambridge
- UK
| | - Adriaan P. IJzerman
- Division of Medicinal Chemistry
- Leiden Academic Centre for Drug Research
- Leiden
- The Netherlands
| | - Gerd Wohlfahrt
- Computer-Aided Drug Design
- Orion Pharma
- FIN-02101 Espoo
- Finland
| | - Peteris Prusis
- Computer-Aided Drug Design
- Orion Pharma
- FIN-02101 Espoo
- Finland
| | - Thérèse E. Malliavin
- Unité de Bioinformatique Structurale
- Institut Pasteur and CNRS UMR 3825
- Structural Biology and Chemistry Department
- 75 724 Paris
- France
| | - Gerard J. P. van Westen
- European Molecular Biology Laboratory
- European Bioinformatics Institute
- Wellcome Trust Genome Campus
- Hinxton
- UK
| | - Andreas Bender
- Unilever Centre for Molecular Informatics
- Department of Chemistry
- CB2 1EW Cambridge
- UK
| |
Collapse
|
28
|
Alvarsson J, Eklund M, Andersson C, Carlsson L, Spjuth O, Wikberg JES. Benchmarking study of parameter variation when using signature fingerprints together with support vector machines. J Chem Inf Model 2014; 54:3211-7. [PMID: 25318024 DOI: 10.1021/ci500344v] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
QSAR modeling using molecular signatures and support vector machines with a radial basis function is increasingly used for virtual screening in the drug discovery field. This method has three free parameters: C, γ, and signature height. C is a penalty parameter that limits overfitting, γ controls the width of the radial basis function kernel, and the signature height determines how much of the molecule is described by each atom signature. Determination of optimal values for these parameters is time-consuming. Good default values could therefore save considerable computational cost. The goal of this project was to investigate whether such default values could be found by using seven public QSAR data sets spanning a wide range of end points and using both a bit version and a count version of the molecular signatures. On the basis of the experiments performed, we recommend a parameter set of heights 0 to 2 for the count version of the signature fingerprints and heights 0 to 3 for the bit version. These are in combination with a support vector machine using C in the range of 1 to 100 and γ in the range of 0.001 to 0.1. When data sets are small or longer run times are not a problem, then there is reason to consider the addition of height 3 to the count fingerprint and a wider grid search. However, marked improvements should not be expected.
Collapse
Affiliation(s)
- Jonathan Alvarsson
- Department of Pharmaceutical Biosciences, Uppsala University , SE-751 24 Uppsala, Sweden
| | | | | | | | | | | |
Collapse
|
29
|
Nabu S, Nantasenamat C, Owasirikul W, Lawung R, Isarankura-Na-Ayudhya C, Lapins M, Wikberg JES, Prachayasittikul V. Proteochemometric model for predicting the inhibition of penicillin-binding proteins. J Comput Aided Mol Des 2014; 29:127-41. [DOI: 10.1007/s10822-014-9809-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2014] [Accepted: 10/21/2014] [Indexed: 12/17/2022]
|
30
|
Nantasenamat C, Simeon S, Owasirikul W, Songtawee N, Lapins M, Prachayasittikul V, Wikberg JES. Illuminating the origins of spectral properties of green fluorescent proteins via proteochemometric and molecular modeling. J Comput Chem 2014; 35:1951-66. [DOI: 10.1002/jcc.23708] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2014] [Revised: 04/28/2014] [Accepted: 07/28/2014] [Indexed: 01/06/2023]
Affiliation(s)
- Chanin Nantasenamat
- Center of Data Mining and Biomedical Informatics; Faculty of Medical Technology, Mahidol University; Bangkok 10700 Thailand
- Department of Clinical Microbiology and Applied Technology; Faculty of Medical Technology, Mahidol University; Bangkok 10700 Thailand
| | - Saw Simeon
- Center of Data Mining and Biomedical Informatics; Faculty of Medical Technology, Mahidol University; Bangkok 10700 Thailand
| | - Wiwat Owasirikul
- Center of Data Mining and Biomedical Informatics; Faculty of Medical Technology, Mahidol University; Bangkok 10700 Thailand
- Department of Radiological Technology; Faculty of Medical Technology, Mahidol University; Bangkok 10700 Thailand
| | - Napat Songtawee
- Center of Data Mining and Biomedical Informatics; Faculty of Medical Technology, Mahidol University; Bangkok 10700 Thailand
| | - Maris Lapins
- Department of Pharmaceutical Biosciences; Uppsala University; Uppsala Sweden
| | - Virapong Prachayasittikul
- Department of Clinical Microbiology and Applied Technology; Faculty of Medical Technology, Mahidol University; Bangkok 10700 Thailand
| | - Jarl E. S. Wikberg
- Department of Pharmaceutical Biosciences; Uppsala University; Uppsala Sweden
| |
Collapse
|
31
|
Nantasenamat C, Monnor T, Worachartcheewan A, Mandi P, Isarankura-Na-Ayudhya C, Prachayasittikul V. Predictive QSAR modeling of aldose reductase inhibitors using Monte Carlo feature selection. Eur J Med Chem 2014; 76:352-9. [PMID: 24589490 DOI: 10.1016/j.ejmech.2014.02.043] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2013] [Revised: 02/12/2014] [Accepted: 02/15/2014] [Indexed: 12/21/2022]
Abstract
This study explores the chemical space and quantitative structure-activity relationship (QSAR) of a set of 60 sulfonylpyridazinones with aldose reductase inhibitory activity. The physicochemical properties of the investigated compounds were described by a total of 3230 descriptors comprising of 6 quantum chemical descriptors and 3224 molecular descriptors. A subset of 5 descriptors was selected from the aforementioned pool by means of Monte Carlo (MC) feature selection coupled to multiple linear regression (MLR). Predictive QSAR models were then constructed by MLR, support vector machine and artificial neural network, which afforded good predictive performance as deduced from internal and external validation. The investigated models are capable of accounting for the origins of aldose reductase inhibitory activity and could be utilized in predicting this property in screening for novel and robust compounds.
Collapse
Affiliation(s)
- Chanin Nantasenamat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand; Department of Clinical Microbiology and Applied Technology, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand.
| | - Teerawat Monnor
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Apilak Worachartcheewan
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Prasit Mandi
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand; Department of Clinical Microbiology and Applied Technology, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | | | - Virapong Prachayasittikul
- Department of Clinical Microbiology and Applied Technology, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| |
Collapse
|
32
|
Medina-Franco JL, Méndez-Lucio O, Martinez-Mayorga K. The Interplay Between Molecular Modeling and Chemoinformatics to Characterize Protein–Ligand and Protein–Protein Interactions Landscapes for Drug Discovery. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2014; 96:1-37. [DOI: 10.1016/bs.apcsb.2014.06.001] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|