1
|
Abou Hajal A, Bryce RA, Amor BB, Atatreh N, Ghattas MA. Boosting the Accuracy and Chemical Space Coverage of the Detection of Small Colloidal Aggregating Molecules Using the BAD Molecule Filter. J Chem Inf Model 2024. [PMID: 38920403 DOI: 10.1021/acs.jcim.4c00363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/27/2024]
Abstract
The ability to conduct effective high throughput screening (HTS) campaigns in drug discovery is often hampered by the detection of false positives in these assays due to small colloidally aggregating molecules (SCAMs). SCAMs can produce artifactual hits in HTS by nonspecific inhibition of the protein target. In this work, we present a new computational prediction tool for detecting SCAMs based on their 2D chemical structure. The tool, called the boosted aggregation detection (BAD) molecule filter, employs decision tree ensemble methods, namely, the CatBoost classifier and the light gradient-boosting machine, to significantly improve the detection of SCAMs. In developing the filter, we explore models trained on individual data sets, a consensus approach using these models, and, third, a merged data set approach, each tailored for specific drug discovery needs. The individual data set method emerged as most effective, achieving 93% sensitivity and 90% specificity, outperforming existing state-of-the-art models by 20 and 5%, respectively. The consensus models offer broader chemical space coverage, exceeding 90% for all testing sets. This feature is an important aspect particularly for early stage medicinal chemistry projects, and provides information on applicability domain. Meanwhile, the merged data set models demonstrated robust performance, with a notable sensitivity of 79% in the comprehensive 10-fold cross-validation test set. A SHAP analysis of model features indicates the importance of hydrophobicity and molecular complexity as primary factors influencing the aggregation propensity. The BAD molecule filter is readily accessible for the public usage on https://molmodlab-aau.com/Tools.html. This filter provides a new, more robust tool for aggregate prediction in the early stages of drug discovery to optimize hit rates and reduce associated testing and validation overheads.
Collapse
Affiliation(s)
- Abdallah Abou Hajal
- College of Pharmacy, Al Ain University, Abu Dhabi 112612, United Arab Emirates
- AAU Health and Biomedical Research Center, Al Ain University, Abu Dhabi 112612, United Arab Emirates
| | - Richard A Bryce
- Division of Pharmacy and Optometry, School of Health Sciences, University of Manchester, Oxford Road, Manchester M13 9PL, U.K
| | - Boulbaba Ben Amor
- Core42, Inception/G42, Abu Dhabi 2282, United Arab Emirates
- IMT Nord Europe, Villeneuve D'Ascq 59650 France
| | - Noor Atatreh
- College of Pharmacy, Al Ain University, Abu Dhabi 112612, United Arab Emirates
- AAU Health and Biomedical Research Center, Al Ain University, Abu Dhabi 112612, United Arab Emirates
| | - Mohammad A Ghattas
- College of Pharmacy, Al Ain University, Abu Dhabi 112612, United Arab Emirates
- AAU Health and Biomedical Research Center, Al Ain University, Abu Dhabi 112612, United Arab Emirates
| |
Collapse
|
2
|
Melo-Filho CC, Su G, Liu K, Muratov EN, Tropsha A, Liu J. Modeling interactions between Heparan sulfate and proteins based on the Heparan sulfate microarray analysis. Glycobiology 2024; 34:cwae039. [PMID: 38836441 PMCID: PMC11180703 DOI: 10.1093/glycob/cwae039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2024] [Revised: 04/30/2024] [Accepted: 05/29/2024] [Indexed: 06/06/2024] Open
Abstract
Heparan sulfate (HS), a sulfated polysaccharide abundant in the extracellular matrix, plays pivotal roles in various physiological and pathological processes by interacting with proteins. Investigating the binding selectivity of HS oligosaccharides to target proteins is essential, but the exhaustive inclusion of all possible oligosaccharides in microarray experiments is impractical. To address this challenge, we present a hybrid pipeline that integrates microarray and in silico techniques to design oligosaccharides with desired protein affinity. Using fibroblast growth factor 2 (FGF2) as a model protein, we assembled an in-house dataset of HS oligosaccharides on microarrays and developed two structural representations: a standard representation with all atoms explicit and a simplified representation with disaccharide units as "quasi-atoms." Predictive Quantitative Structure-Activity Relationship (QSAR) models for FGF2 affinity were developed using the Random Forest (RF) algorithm. The resulting models, considering the applicability domain, demonstrated high predictivity, with a correct classification rate of 0.81-0.80 and improved positive predictive values (PPV) up to 0.95. Virtual screening of 40 new oligosaccharides using the simplified model identified 15 computational hits, 11 of which were experimentally validated for high FGF2 affinity. This hybrid approach marks a significant step toward the targeted design of oligosaccharides with desired protein interactions, providing a foundation for broader applications in glycobiology.
Collapse
Affiliation(s)
- Cleber C Melo-Filho
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, 301 Beard Hall, University of North Carolina, Chapel Hill, NC 27599, United States
| | - Guowei Su
- Glycan Therapeutics, 617 Hutton Street, Raleigh, NC 27606, United States
| | - Kevin Liu
- Glycan Therapeutics, 617 Hutton Street, Raleigh, NC 27606, United States
| | - Eugene N Muratov
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, 301 Beard Hall, University of North Carolina, Chapel Hill, NC 27599, United States
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, 301 Beard Hall, University of North Carolina, Chapel Hill, NC 27599, United States
| | - Jian Liu
- Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, 1044 Genetic Medicine Bldg., University of North Carolina, Chapel Hill, NC 27599, United States
| |
Collapse
|
3
|
Rath M, Wellnitz J, Martin HJ, Melo-Filho C, Hochuli JE, Silva GM, Beasley JM, Travis M, Sessions ZL, Popov KI, Zakharov AV, Cherkasov A, Alves V, Muratov EN, Tropsha A. Pharmacokinetics Profiler (PhaKinPro): Model Development, Validation, and Implementation as a Web Tool for Triaging Compounds with Undesired Pharmacokinetics Profiles. J Med Chem 2024; 67:6508-6518. [PMID: 38568752 DOI: 10.1021/acs.jmedchem.3c02446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/05/2024]
Abstract
Computational models that predict pharmacokinetic properties are critical to deprioritize drug candidates that emerge as hits in high-throughput screening campaigns. We collected, curated, and integrated a database of compounds tested in 12 major end points comprising over 10,000 unique molecules. We then employed these data to build and validate binary quantitative structure-activity relationship (QSAR) models. All trained models achieved a correct classification rate above 0.60 and a positive predictive value above 0.50. To illustrate their utility in drug discovery, we used these models to predict the pharmacokinetic properties for drugs in the NCATS Inxight Drugs database. In addition, we employed the developed models to predict the pharmacokinetic properties of all compounds in the DrugBank. All models described in this paper have been integrated and made publicly available via the PhaKinPro Web-portal that can be accessed at https://phakinpro.mml.unc.edu/.
Collapse
Affiliation(s)
- Marielle Rath
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - James Wellnitz
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Holli-Joi Martin
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Cleber Melo-Filho
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Joshua E Hochuli
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Guilherme Martins Silva
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Jon-Michael Beasley
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Maxfield Travis
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Zoe L Sessions
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Konstantin I Popov
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Alexey V Zakharov
- National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Artem Cherkasov
- Vancouver Prostate Centre, University of British Columbia, Vancouver, British Columbia V6H3Z6, Canada
| | - Vinicius Alves
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Eugene N Muratov
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| |
Collapse
|
4
|
Fernandes PO, Dias ALT, Dos Santos Júnior VS, Sá Magalhães Serafim M, Sousa YV, Monteiro GC, Coutinho ID, Valli M, Verzola MMSA, Ottoni FM, Pádua RMD, Oda FB, Dos Santos AG, Andricopulo AD, da Silva Bolzani V, Mota BEF, Alves RJ, de Oliveira RB, Kronenberger T, Maltarollo VG. Machine Learning-Based Virtual Screening of Antibacterial Agents against Methicillin-Susceptible and Resistant Staphylococcus aureus. J Chem Inf Model 2024; 64:1932-1944. [PMID: 38437501 DOI: 10.1021/acs.jcim.4c00087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2024]
Abstract
The application of computer-aided drug discovery (CADD) approaches has enabled the discovery of new antimicrobial therapeutic agents in the past. The high prevalence of methicillin-resistantStaphylococcus aureus(MRSA) strains promoted this pathogen to a high-priority pathogen for drug development. In this sense, modern CADD techniques can be valuable tools for the search for new antimicrobial agents. We employed a combination of a series of machine learning (ML) techniques to select and evaluate potential compounds with antibacterial activity against methicillin-susceptible S. aureus (MSSA) and MRSA strains. In the present study, we describe the antibacterial activity of six compounds against MSSA and MRSA reference (American Type Culture Collection (ATCC)) strains as well as two clinical strains of MRSA. These compounds showed minimal inhibitory concentrations (MIC) in the range from 12.5 to 200 μM against the different bacterial strains evaluated. Our results constitute relevant proven ML-workflow models to distinctively screen for novel MRSA antibiotics.
Collapse
Affiliation(s)
- Philipe Oliveira Fernandes
- Departamento de Produtos Farmacêuticos, Faculdade de Farmácia, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, Minas Gerais 31.270-901, Brazil
| | - Anna Letícia Teotonio Dias
- Departamento de Produtos Farmacêuticos, Faculdade de Farmácia, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, Minas Gerais 31.270-901, Brazil
| | - Valtair Severino Dos Santos Júnior
- Departamento de Produtos Farmacêuticos, Faculdade de Farmácia, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, Minas Gerais 31.270-901, Brazil
| | - Mateus Sá Magalhães Serafim
- Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, Minas Gerais 31.270-901, Brazil
| | - Yamara Viana Sousa
- Departamento de Produtos Farmacêuticos, Faculdade de Farmácia, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, Minas Gerais 31.270-901, Brazil
| | - Gustavo Claro Monteiro
- Departamento de Química Orgânica, Instituto de Química, Universidade Estadual Paulista (UNESP), Araraquara, São Paulo 14.800-900, Brazil
| | - Isabel Duarte Coutinho
- Departamento de Química Orgânica, Instituto de Química, Universidade Estadual Paulista (UNESP), Araraquara, São Paulo 14.800-900, Brazil
| | - Marilia Valli
- Departamento de Física e Ciência Interdisciplinar, Instituto de Física, Universidade de São Paulo (USP), São Carlos, São Paulo 13.563-120, Brazil
| | - Marina Mol Sena Andrade Verzola
- Departamento de Produtos Farmacêuticos, Faculdade de Farmácia, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, Minas Gerais 31.270-901, Brazil
| | - Flaviano Melo Ottoni
- Departamento de Produtos Farmacêuticos, Faculdade de Farmácia, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, Minas Gerais 31.270-901, Brazil
| | - Rodrigo Maia de Pádua
- Departamento de Produtos Farmacêuticos, Faculdade de Farmácia, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, Minas Gerais 31.270-901, Brazil
| | - Fernando Bombarda Oda
- Departamento de Fármacos e Medicamentos, Faculdade de Ciências Farmacêuticas, Universidade Estadual Paulista (UNESP), Araraquara 14.800-903, Brazil
| | - André Gonzaga Dos Santos
- Departamento de Fármacos e Medicamentos, Faculdade de Ciências Farmacêuticas, Universidade Estadual Paulista (UNESP), Araraquara 14.800-903, Brazil
| | - Adriano Defini Andricopulo
- Departamento de Física e Ciência Interdisciplinar, Instituto de Física, Universidade de São Paulo (USP), São Carlos, São Paulo 13.563-120, Brazil
| | - Vanderlan da Silva Bolzani
- Departamento de Química Orgânica, Instituto de Química, Universidade Estadual Paulista (UNESP), Araraquara, São Paulo 14.800-900, Brazil
| | - Bruno Eduardo Fernandes Mota
- Departamento de Análises Clínicas e Toxicológicas, Faculdade de Farmácia, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, Minas Gerais 31.270-901, Brazil
| | - Ricardo José Alves
- Departamento de Produtos Farmacêuticos, Faculdade de Farmácia, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, Minas Gerais 31.270-901, Brazil
| | - Renata Barbosa de Oliveira
- Departamento de Produtos Farmacêuticos, Faculdade de Farmácia, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, Minas Gerais 31.270-901, Brazil
| | - Thales Kronenberger
- Institute of Pharmacy, Pharmaceutical/Medicinal Chemistry and Tübingen Center for Academic Drug Discovery, Eberhard Karls University Tübingen, 72076 Tübingen, Germany
- School of Pharmacy, Faculty of Health Sciences, University of Eastern Finland, 70211 Kuopio, Finland
| | - Vinícius Gonçalves Maltarollo
- Departamento de Produtos Farmacêuticos, Faculdade de Farmácia, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, Minas Gerais 31.270-901, Brazil
| |
Collapse
|
5
|
Alves VM, Yasgar A, Wellnitz J, Rai G, Rath M, Braga RC, Capuzzi SJ, Simeonov A, Muratov EN, Zakharov AV, Tropsha A. Lies and Liabilities: Computational Assessment of High-Throughput Screening Hits to Identify Artifact Compounds. J Med Chem 2023; 66:12828-12839. [PMID: 37677128 DOI: 10.1021/acs.jmedchem.3c00482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/09/2023]
Abstract
Hits from high-throughput screening (HTS) of chemical libraries are often false positives due to their interference with assay detection technology. In response, we generated the largest publicly available library of chemical liabilities and developed "Liability Predictor," a free web tool to predict HTS artifacts. More specifically, we generated, curated, and integrated HTS data sets for thiol reactivity, redox activity, and luciferase (firefly and nano) activity and developed and validated quantitative structure-interference relationship (QSIR) models to predict these nuisance behaviors. The resulting models showed 58-78% external balanced accuracy for 256 external compounds per assay. QSIR models developed and validated herein identify nuisance compounds among experimental hits more reliably than do popular PAINS filters. Both the models and the curated data sets were implemented in "Liability Predictor," publicly available at https://liability.mml.unc.edu/. "Liability Predictor" may be used as part of chemical library design or for triaging HTS hits.
Collapse
Affiliation(s)
- Vinicius M Alves
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Adam Yasgar
- National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - James Wellnitz
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Ganesha Rai
- National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Marielle Rath
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | | | - Stephen J Capuzzi
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Anton Simeonov
- National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Eugene N Muratov
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
- Department of Pharmaceutical Sciences, Federal University of Paraiba, Joao Pessoa, PB 58059, Brazil
| | - Alexey V Zakharov
- National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| |
Collapse
|
6
|
Shan M, Jiang C, Qin L, Cheng G. A Review of Computational Methods in Predicting hERG Channel Blockers. ChemistrySelect 2022. [DOI: 10.1002/slct.202201221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Mengyi Shan
- School of Pharmaceutical Sciences Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| | - Chen Jiang
- QuanMin RenZheng (HangZhou) Technology Co. Ltd. China
| | - Lu‐Ping Qin
- School of Pharmaceutical Sciences Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| | - Gang Cheng
- School of Pharmaceutical Sciences Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| |
Collapse
|
7
|
Hochuli J, Jain S, Melo-Filho C, Sessions ZL, Bobrowski T, Choe J, Zheng J, Eastman R, Talley DC, Rai G, Simeonov A, Tropsha A, Muratov EN, Baljinnyam B, Zakharov AV. Allosteric Binders of ACE2 Are Promising Anti-SARS-CoV-2 Agents. ACS Pharmacol Transl Sci 2022; 5:468-478. [PMID: 35821746 PMCID: PMC9236207 DOI: 10.1021/acsptsci.2c00049] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The COVID-19 pandemic has had enormous health, economic, and social consequences. Vaccines have been successful in reducing rates of infection and hospitalization, but there is still a need for acute treatment of the disease. We investigate whether compounds that bind the human angiotensin-converting enzyme 2 (ACE2) protein can decrease SARS-CoV-2 replication without impacting ACE2's natural enzymatic function. Initial screening of a diversity library resulted in hit compounds active in an ACE2-binding assay, which showed little inhibition of ACE2 enzymatic activity (116 actives, success rate ∼4%), suggesting they were allosteric binders. Subsequent application of in silico techniques boosted success rates to ∼14% and resulted in 73 novel confirmed ACE2 binders with K d values as low as 6 nM. A subsequent SARS-CoV-2 assay revealed that five of these compounds inhibit the viral life cycle in human cells. Further effort is required to completely elucidate the antiviral mechanism of these ACE2-binders, but they present a valuable starting point for both the development of acute treatments for COVID-19 and research into the host-directed therapy.
Collapse
Affiliation(s)
- Joshua
E. Hochuli
- Molecular
Modeling Laboratory, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
- Curriculum
in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Sankalp Jain
- National
Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, United States
| | - Cleber Melo-Filho
- Molecular
Modeling Laboratory, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Zoe L. Sessions
- Molecular
Modeling Laboratory, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Tesia Bobrowski
- Molecular
Modeling Laboratory, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Jun Choe
- National
Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, United States
| | - Johnny Zheng
- National
Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, United States
| | - Richard Eastman
- National
Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, United States
| | - Daniel C. Talley
- National
Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, United States
| | - Ganesha Rai
- National
Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, United States
| | - Anton Simeonov
- National
Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, United States
| | - Alexander Tropsha
- Molecular
Modeling Laboratory, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Eugene N. Muratov
- Molecular
Modeling Laboratory, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
| | - Bolormaa Baljinnyam
- National
Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, United States
| | - Alexey V. Zakharov
- National
Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, United States
| |
Collapse
|
8
|
Tinkov OV, Grigorev VY, Grigoreva LD, Osipov VN, Kolotaev AV, Khachatryan DS. QSAR analysis and experimental evaluation of new quinazoline-containing hydroxamic acids as histone deacetylase 6 inhibitors. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2022; 33:513-532. [PMID: 35786151 DOI: 10.1080/1062936x.2022.2092210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Accepted: 06/14/2022] [Indexed: 06/15/2023]
Abstract
Histone deacetylase inhibitors represent the most important class of drugs for the treatment of human cancer and other diseases due to their influence on cell growth, differentiation, and apoptosis. Among the well-known eighteen histone deacetylases, histone deacetylase 6 (HDAC6), which is involved in oncogenesis, cell survival, and cancer cell metastasis, is of great importance. Using the CDK and alvaDesc molecular descriptors and the Random Forest and EXtreme Gradient Boosting methods, we propose a number of adequate QSAR classification models, which are integrated into a consensus model and are freely available on the OCHEM web platform (https://ochem.eu). The consensus QSAR model is used for virtual screening of a series of seven new compounds, the derivatives of N-((hydroxyamino)-oxoalkyl)-2-(quinazoline-4-ilamino)-benzamides, the synthesis schemes of which are also presented in this work. In vitro evaluation of the inhibitory activity (IC50) of this series of compounds against HDAC6 allowed us to confirm the results of virtual screening and to reveal promising compounds V-2 and V-4, IC50 of which is 3.25 nM and 0.04 nM, respectively. The subsequent in silico evaluation of the main ADMET properties of active compounds V-2 and V-4 allowed us to find that they have acceptable pharmacokinetic parameters and level of acute toxicity.
Collapse
Affiliation(s)
- O V Tinkov
- Department of Pharmacology and Pharmaceutical Chemistry, Medical Faculty, Shevchenko Transnistria State University, Tiraspol, Moldova
| | - V Y Grigorev
- Molecular Design, Institute of Physiologically Active Compounds of the Russian Academy of SciencesDepartment of Computer-aided, Chernogolovka, Russia
| | - L D Grigoreva
- Department of Fundamental Physicochemical Engineering, Moscow State University, Moscow, Russia
| | - V N Osipov
- Department of Chemical Synthesis, Blokhin National Medical Research Center of Oncology, Ministry of Health of the Russian Federation, Moscow, Russia
| | - A V Kolotaev
- Laboratory of Natural Compounds, National Research Centre "Kurchatov Institute", Moscow, Russia
- Laboratory of Natural Compounds, Institute of Chemical Reagents and High Purity Chemical Substances of the National Research Centre "Kurchatov Institute", Moscow, Russia
| | - D S Khachatryan
- Laboratory of Natural Compounds, National Research Centre "Kurchatov Institute", Moscow, Russia
- Laboratory of Natural Compounds, Institute of Chemical Reagents and High Purity Chemical Substances of the National Research Centre "Kurchatov Institute", Moscow, Russia
| |
Collapse
|
9
|
Hochuli JE, Jain S, Melo-filho C, Sessions ZL, Bobrowski T, Choe J, Zheng J, Eastman R, Talley DC, Rai G, Simeonov A, Tropsha A, Muratov EN, Baljinnyam B, Zakharov AV. Allosteric binders of ACE2 are promising anti-SARS-CoV-2 agents.. [PMID: 35313579 PMCID: PMC8936107 DOI: 10.1101/2022.03.15.484484] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
AbstractThe COVID-19 pandemic has had enormous health, economic, and social consequences. Vaccines have been successful in reducing rates of infection and hospitalization, but there is still a need for an acute treatment for the disease. We investigate whether compounds that bind the human ACE2 protein can interrupt SARS-CoV-2 replication without damaging ACE2’s natural enzymatic function. Initial compounds were screened for binding to ACE2 but little interruption of ACE2 enzymatic activity. This set of compounds was extended by application of quantitative structure-activity analysis, which resulted in 512 virtual hits for further confirmatory screening. A subsequent SARS-CoV-2 replication assay revealed that five of these compounds inhibit SARS-CoV-2 replication in human cells. Further effort is required to completely determine the antiviral mechanism of these compounds, but they serve as a strong starting point for both development of acute treatments for COVID-19 and research into the mechanism of infection.Abstract FigureTOC Graphic: Overall study design.
Collapse
|
10
|
Molecular insights on ABL kinase activation using tree-based machine learning models and molecular docking. Mol Divers 2021; 25:1301-1314. [PMID: 34191245 PMCID: PMC8241884 DOI: 10.1007/s11030-021-10261-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 06/18/2021] [Indexed: 12/14/2022]
Abstract
Abelson kinase (c-Abl) is a non-receptor tyrosine kinase involved in several biological processes essential for cell differentiation, migration, proliferation, and survival. This enzyme's activation might be an alternative strategy for treating diseases such as neutropenia induced by chemotherapy, prostate, and breast cancer. Recently, a series of compounds that promote the activation of c-Abl has been identified, opening a promising ground for c-Abl drug development. Structure-based drug design (SBDD) and ligand-based drug design (LBDD) methodologies have significantly impacted recent drug development initiatives. Here, we combined SBDD and LBDD approaches to characterize critical chemical properties and interactions of identified c-Abl's activators. We used molecular docking simulations combined with tree-based machine learning models—decision tree, AdaBoost, and random forest to understand the c-Abl activators' structural features required for binding to myristoyl pocket, and consequently, to promote enzyme and cellular activation. We obtained predictive and robust models with Matthews correlation coefficient values higher than 0.4 for all endpoints and identified characteristics that led to constructing a structure–activity relationship model (SAR).
Collapse
|
11
|
Soares Rodrigues GC, Maia MDS, Silva Cavalcanti AB, Costa Barros RP, Scotti L, Cespedes-Acuña CL, Muratov EN, Scotti MT. Computer-assisted discovery of compounds with insecticidal activity against Musca domestica and Mythimna separata. Food Chem Toxicol 2020; 147:111899. [PMID: 33279675 DOI: 10.1016/j.fct.2020.111899] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 11/25/2020] [Accepted: 11/30/2020] [Indexed: 11/18/2022]
Abstract
Pesticides are used to control and combat insects and pests in the agricultural sector, households, and public health programs. The frequent and disorderly use of these pesticides may lead to variety of undesired effects. Therefore, natural products have many advantages over to synthetic compounds to be used as insecticides. The goal of this study was to find natural products with insecticidal potential against Musca domestica and Mythimna separata. To achieve this goal, we developed predictive QSAR models using MuDRA, PLS, and RF approaches and performed virtual screening of 117 natural products. As a result of QSAR modeling, we formulated the recommendations regarding physico-chemical characteristics for promising compounds active against Musca domestica and Mythimna separata. Homology models were successfully built for both species and molecular docking of QSAR hits vs known insecticides allowed us to prioritize twenty-two compounds against Musca domestica and six against Mythimna separata. Our results suggest that pimarane diterpenes, abietanes diterpenes, dimeric diterpenes and scopadulane diterpenes obtained from aerial parts of species of the genus Calceolaria (Calceolariaceae: Scrophulariaceae) can be considered as potential insecticidal.
Collapse
Affiliation(s)
- Gabriela Cristina Soares Rodrigues
- Laboratory of Cheminformatics. Postgraduate Program in Natural and Synthetic Bioactive Products (PgPNSB), Health Sciences Center, Federal University of Paraíba, Castelo Branco 58051900, João Pessoa, PB, Brazil
| | - Mayara Dos Santos Maia
- Laboratory of Cheminformatics. Postgraduate Program in Natural and Synthetic Bioactive Products (PgPNSB), Health Sciences Center, Federal University of Paraíba, Castelo Branco 58051900, João Pessoa, PB, Brazil
| | - Andreza Barbosa Silva Cavalcanti
- Laboratory of Cheminformatics. Postgraduate Program in Natural and Synthetic Bioactive Products (PgPNSB), Health Sciences Center, Federal University of Paraíba, Castelo Branco 58051900, João Pessoa, PB, Brazil
| | - Renata Priscila Costa Barros
- Laboratory of Cheminformatics. Postgraduate Program in Natural and Synthetic Bioactive Products (PgPNSB), Health Sciences Center, Federal University of Paraíba, Castelo Branco 58051900, João Pessoa, PB, Brazil
| | - Luciana Scotti
- Laboratory of Cheminformatics. Postgraduate Program in Natural and Synthetic Bioactive Products (PgPNSB), Health Sciences Center, Federal University of Paraíba, Castelo Branco 58051900, João Pessoa, PB, Brazil
| | - Carlos L Cespedes-Acuña
- Plant biochemistry and phytochemical ecotoxicology lab. Departamento de Ciencias Basicas, Facultad de Ciencias, Universidad del Bio Bio, Chillan, Chile
| | - Eugene N Muratov
- Laboratory of Cheminformatics. Postgraduate Program in Natural and Synthetic Bioactive Products (PgPNSB), Health Sciences Center, Federal University of Paraíba, Castelo Branco 58051900, João Pessoa, PB, Brazil; Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Marcus Tullius Scotti
- Laboratory of Cheminformatics. Postgraduate Program in Natural and Synthetic Bioactive Products (PgPNSB), Health Sciences Center, Federal University of Paraíba, Castelo Branco 58051900, João Pessoa, PB, Brazil.
| |
Collapse
|
12
|
QSAR models for the fumigant activity prediction of essential oils. J Mol Graph Model 2020; 101:107751. [DOI: 10.1016/j.jmgm.2020.107751] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Revised: 08/20/2020] [Accepted: 09/04/2020] [Indexed: 12/23/2022]
|
13
|
Li Y, Tian Y, Xi Y, Qin Z, Yan A. Quantitative Structure-Activity Relationship Study for HIV-1 LEDGF/p75 Inhibitors. Curr Comput Aided Drug Des 2020; 16:654-666. [DOI: 10.2174/1573409915666190919153959] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2019] [Revised: 05/08/2019] [Accepted: 08/26/2019] [Indexed: 12/28/2022]
Abstract
Background:
HIV-1 Integrase (IN) is an important target for the development of the
new anti-AIDS drugs. HIV-1 LEDGF/p75 inhibitors, which block the integrase and LEDGF/p75
interaction, have been validated for reduction in HIV-1 viral replicative capacity.
Methods:
In this work, computational Quantitative Structure-Activity Relationship (QSAR) models
were developed for predicting the bioactivity of HIV-1 integrase LEDGF/p75 inhibitors. We collected
190 inhibitors and their bioactivities in this study and divided the inhibitors into nine scaffolds
by the method of T-distributed Stochastic Neighbor Embedding (TSNE). These 190 inhibitors
were split into a training set and a test set according to the result of a Kohonen’s self-organizing
map (SOM) or randomly. Multiple Linear Regression (MLR) models, support vector machine
(SVM) models and two consensus models were built based on the training sets by 20 selected
CORINA Symphony descriptors.
Results:
All the models showed a good prediction of pIC50. The correlation coefficients of all the
models were more than 0.7 on the test set. For the training set of consensus Model C1, which performed
better than other models, the correlation coefficient(r) achieved 0.909 on the training set,
and 0.804 on the test set.
Conclusion:
The selected molecular descriptors show that hydrogen bond acceptor, atom charges
and electronegativities (especially π atom) were important in predicting the activity of HIV-1 integrase
LEDGF/p75-IN inhibitors.
Collapse
Affiliation(s)
- Yang Li
- Institute of Science and Technology, Shandong University of Traditional Chinese Medicine, Ji'nan, Shandong, 250355, China
| | - Yujia Tian
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, P.O. Box 53, Beijing University of Chemical Technology, 15 BeiSanHuan East Road, Beijing 100029, China
| | - Yao Xi
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, P.O. Box 53, Beijing University of Chemical Technology, 15 BeiSanHuan East Road, Beijing 100029, China
| | - Zijian Qin
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, P.O. Box 53, Beijing University of Chemical Technology, 15 BeiSanHuan East Road, Beijing 100029, China
| | - Aixia Yan
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, P.O. Box 53, Beijing University of Chemical Technology, 15 BeiSanHuan East Road, Beijing 100029, China
| |
Collapse
|
14
|
Alves VM, Bobrowski T, Melo-Filho CC, Korn D, Auerbach S, Schmitt C, Muratov EN, Tropsha A. QSAR Modeling of SARS-CoV M pro Inhibitors Identifies Sufugolix, Cenicriviroc, Proglumetacin, and other Drugs as Candidates for Repurposing against SARS-CoV-2. Mol Inform 2020; 40:e2000113. [PMID: 33405340 DOI: 10.1002/minf.202000113] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2020] [Accepted: 07/28/2020] [Indexed: 12/22/2022]
Abstract
The main protease (Mpro) of the SARS-CoV-2 has been proposed as one of the major drug targets for COVID-19. We have identified the experimental data on the inhibitory activity of compounds tested against the closely related (96 % sequence identity, 100 % active site conservation) Mpro of SARS-CoV. We developed QSAR models of these inhibitors and employed these models for virtual screening of all drugs in the DrugBank database. Similarity searching and molecular docking were explored in parallel, but docking failed to correctly discriminate between experimentally active and inactive compounds, so it was not relied upon for prospective virtual screening. Forty-two compounds were identified by our models as consensus computational hits. Subsequent to our computational studies, NCATS reported the results of experimental screening of their drug collection in SARS-CoV-2 cytopathic effect assay (https://opendata.ncats.nih.gov/covid19/). Coincidentally, NCATS tested 11 of our 42 hits, and three of them, cenicriviroc (AC50 of 8.9 μM), proglumetacin (tested twice independently, with AC50 of 8.9 μM and 12.5 μM), and sufugolix (AC50 12.6 μM), were shown to be active. These observations support the value of our modeling approaches and models for guiding the experimental investigations of putative anti-COVID-19 drug candidates. All data and models used in this study are publicly available via Supplementary Materials, GitHub (https://github.com/alvesvm/sars-cov-mpro), and Chembench web portal (https://chembench.mml.unc.edu/).
Collapse
Affiliation(s)
- Vinicius M Alves
- Office of Data Science, National Toxicology Program, NIEHS, Morrisville, NC, 27560, USA
| | - Tesia Bobrowski
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Beard Hall, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Cleber C Melo-Filho
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Beard Hall, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Daniel Korn
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Beard Hall, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA.,Department of Computer Science, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Scott Auerbach
- Toxinformatics Group, National Toxicology Program, NIEHS, Morrisville, NC, 27560, USA
| | - Charles Schmitt
- Office of Data Science, National Toxicology Program, NIEHS, Morrisville, NC, 27560, USA
| | - Eugene N Muratov
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Beard Hall, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA.,Department of Pharmaceutical Sciences, Federal University of Paraiba, Joao Pessoa, PB, Brazil
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, Beard Hall, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
| |
Collapse
|
15
|
Borba JVB, Braga RC, Alves VM, Muratov EN, Kleinstreuer N, Tropsha A, Andrade CH. Pred-Skin: A Web Portal for Accurate Prediction of Human Skin Sensitizers. Chem Res Toxicol 2020; 34:258-267. [PMID: 32673477 DOI: 10.1021/acs.chemrestox.0c00186] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Safety assessment is an essential component of the regulatory acceptance of industrial chemicals. Previously, we have developed a model to predict the skin sensitization potential of chemicals for two assays, the human patch test and murine local lymph node assay, and implemented this model in a web portal. Here, we report on the substantially revised and expanded freely available web tool, Pred-Skin version 3.0. This up-to-date version of Pred-Skin incorporates multiple quantitative structure-activity relationship (QSAR) models developed with in vitro, in chemico, and mice and human in vivo data, integrated into a consensus naïve Bayes model that predicts human effects. Individual QSAR models were generated using skin sensitization data derived from human repeat insult patch tests, human maximization tests, and mouse local lymph node assays. In addition, data for three validated alternative methods, the direct peptide reactivity assay, KeratinoSens, and the human cell line activation test, were employed as well. Models were developed using open-source tools and rigorously validated according to the best practices of QSAR modeling. Predictions obtained from these models were then used to build a naïve Bayes model for predicting human skin sensitization with the following external prediction accuracy: correct classification rate (89%), sensitivity (94%), positive predicted value (91%), specificity (84%), and negative predicted value (89%). As an additional assessment of model performance, we identified 11 cosmetic ingredients known to cause skin sensitization but were not included in our training set, and nine of them were accurately predicted as sensitizers by our models. Pred-Skin can be used as a reliable alternative to animal tests for predicting human skin sensitization.
Collapse
Affiliation(s)
- Joyce V B Borba
- Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Universidade Federal de Goiás, Goiânia, Goiás 74605-170, Brazil.,Laboratory for Molecular Modeling, UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | | | - Vinicius M Alves
- Laboratory for Molecular Modeling, UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Eugene N Muratov
- Laboratory for Molecular Modeling, UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States.,Department of Pharmaceutical Sciences, Federal University of Paraíba, João Pessoa, Paraíba 58059, Brazil
| | - Nicole Kleinstreuer
- National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods, National Institute of Environmental Health Sciences, Research Triangle Park, Durham, North Carolina 27709, United States
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Carolina Horta Andrade
- Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Universidade Federal de Goiás, Goiânia, Goiás 74605-170, Brazil
| |
Collapse
|
16
|
Li X, Kleinstreuer NC, Fourches D. Hierarchical Quantitative Structure–Activity Relationship Modeling Approach for Integrating Binary, Multiclass, and Regression Models of Acute Oral Systemic Toxicity. Chem Res Toxicol 2020; 33:353-366. [DOI: 10.1021/acs.chemrestox.9b00259] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Affiliation(s)
- Xinhao Li
- Department of Chemistry, Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina 27695, United States
| | - Nicole C. Kleinstreuer
- Division of Intramural Research/Biostatistics and Computational Biology Branch, NIEHS, Research Triangle
Park, Durham, North Carolina 27709, United States
- National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods, NIEHS, Research Triangle Park, Durham, North Carolina 27709, United States
| | - Denis Fourches
- Department of Chemistry, Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina 27695, United States
| |
Collapse
|
17
|
|
18
|
Alves VM, Hwang D, Muratov E, Sokolsky-Papkov M, Varlamova E, Vinod N, Lim C, Andrade CH, Tropsha A, Kabanov A. Cheminformatics-driven discovery of polymeric micelle formulations for poorly soluble drugs. SCIENCE ADVANCES 2019; 5:eaav9784. [PMID: 31249867 PMCID: PMC6594770 DOI: 10.1126/sciadv.aav9784] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2019] [Accepted: 05/16/2019] [Indexed: 05/29/2023]
Abstract
Many drug candidates fail therapeutic development because of poor aqueous solubility. We have conceived a computer-aided strategy to enable polymeric micelle-based delivery of poorly soluble drugs. We built models predicting both drug loading efficiency (LE) and loading capacity (LC) using novel descriptors of drug-polymer complexes. These models were employed for virtual screening of drug libraries, and eight drugs predicted to have either high LE and high LC or low LE and low LC were selected. Three putative positives, as well as three putative negative hits, were confirmed experimentally (implying 75% prediction accuracy). Fortuitously, simvastatin, a putative negative hit, was found to have the desired micelle solubility. Podophyllotoxin and simvastatin (LE of 95% and 87% and LC of 43% and 41%, respectively) were among the top five polymeric micelle-soluble compounds ever studied experimentally. The success of the strategy described herein suggests its broad utility for designing drug delivery systems.
Collapse
Affiliation(s)
- Vinicius M. Alves
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA
- Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, GO 74605-170, Brazil
| | - Duhyeong Hwang
- Center for Nanotechnology in Drug Delivery, Division of Pharmacoengineering and Molecular Pharmaceutics, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Eugene Muratov
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA
- Department of Pharmaceutical Sciences, Federal University of Paraíba, Joao Pessoa, PB 58059, Brazil
| | - Marina Sokolsky-Papkov
- Center for Nanotechnology in Drug Delivery, Division of Pharmacoengineering and Molecular Pharmaceutics, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Ekaterina Varlamova
- Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, GO 74605-170, Brazil
| | - Natasha Vinod
- Center for Nanotechnology in Drug Delivery, Division of Pharmacoengineering and Molecular Pharmaceutics, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA
- UNC/NC State Joint Department of Biomedical Engineering, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Chaemin Lim
- Center for Nanotechnology in Drug Delivery, Division of Pharmacoengineering and Molecular Pharmaceutics, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Carolina H. Andrade
- Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, GO 74605-170, Brazil
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Alexander Kabanov
- Center for Nanotechnology in Drug Delivery, Division of Pharmacoengineering and Molecular Pharmaceutics, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA
- Laboratory of Chemical Design of Bionanomaterials, Faculty of Chemistry, M.V. Lomonosov Moscow State University, Moscow 119992, Russia
| |
Collapse
|
19
|
Alves VM, Golbraikh A, Capuzzi SJ, Liu K, Lam WI, Korn DR, Pozefsky D, Andrade CH, Muratov EN, Tropsha A. Multi-Descriptor Read Across (MuDRA): A Simple and Transparent Approach for Developing Accurate Quantitative Structure-Activity Relationship Models. J Chem Inf Model 2018; 58:1214-1223. [PMID: 29809005 DOI: 10.1021/acs.jcim.8b00124] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Multiple approaches to quantitative structure-activity relationship (QSAR) modeling using various statistical or machine learning techniques and different types of chemical descriptors have been developed over the years. Oftentimes models are used in consensus to make more accurate predictions at the expense of model interpretation. We propose a simple, fast, and reliable method termed Multi-Descriptor Read Across (MuDRA) for developing both accurate and interpretable models. The method is conceptually related to the well-known kNN approach but uses different types of chemical descriptors simultaneously for similarity assessment. To benchmark the new method, we have built MuDRA models for six different end points (Ames mutagenicity, aquatic toxicity, hepatotoxicity, hERG liability, skin sensitization, and endocrine disruption) and compared the results with those generated with conventional consensus QSAR modeling. We find that models built with MuDRA show consistently high external accuracy similar to that of conventional QSAR models. However, MuDRA models excel in terms of transparency, interpretability, and computational efficiency. We posit that due to its methodological simplicity and reliable predictive accuracy, MuDRA provides a powerful alternative to a much more complex consensus QSAR modeling. MuDRA is implemented and freely available at the Chembench web portal ( https://chembench.mml.unc.edu/mudra ).
Collapse
Affiliation(s)
- Vinicius M Alves
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy , University of North Carolina , Chapel Hill , North Carolina 27599 , United States.,Laboratory for Molecular Modeling and Design, Department of Pharmacy , Federal University of Goias , Goiania , GO 74605-170 , Brazil
| | - Alexander Golbraikh
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy , University of North Carolina , Chapel Hill , North Carolina 27599 , United States
| | - Stephen J Capuzzi
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy , University of North Carolina , Chapel Hill , North Carolina 27599 , United States
| | - Kammy Liu
- Department of Computer Science , University of North Carolina , Chapel Hill , North Carolina 27599 , United States
| | - Wai In Lam
- Department of Computer Science , University of North Carolina , Chapel Hill , North Carolina 27599 , United States
| | - Daniel Robert Korn
- Department of Computer Science , University of North Carolina , Chapel Hill , North Carolina 27599 , United States
| | - Diane Pozefsky
- Department of Computer Science , University of North Carolina , Chapel Hill , North Carolina 27599 , United States
| | - Carolina Horta Andrade
- Laboratory for Molecular Modeling and Design, Department of Pharmacy , Federal University of Goias , Goiania , GO 74605-170 , Brazil
| | - Eugene N Muratov
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy , University of North Carolina , Chapel Hill , North Carolina 27599 , United States.,Department of Chemical Technology , Odessa National Polytechnic University , Odessa , 65000 , Ukraine
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy , University of North Carolina , Chapel Hill , North Carolina 27599 , United States
| |
Collapse
|
20
|
Zhu XW, Li SJ. In Silico Prediction of Drug-Induced Liver Injury Based on Adverse Drug Reaction Reports. Toxicol Sci 2018; 158:391-400. [PMID: 28521054 DOI: 10.1093/toxsci/kfx099] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Drug-induced liver injury (DILI) is a major cause of drug attrition. Currently existing Quantitative Structure-Activity Relationship models have limited predictive capabilities for DILI. Furthermore, their practical applications were limited by lack of new hepatotoxicity data. In this study, we first collected and curated a novel set of 122 DILI-positive and 932 DILI-negative drugs from online adverse drug reports using proportional reporting ratios as the signal detection method. Second, three strategies (under-sampling the majority class, synthetic minority over-sampling technique, and adjusting decision threshold approach) were employed to develop predictive classification models to cope with the unbalanced dataset. Random forest (RF) models using CDK, MACCS, and Mold2 descriptors based on the under-sampling and over-sampling strategies afforded correct classification ratio (CCR) of ∼0.77 and 0.78, respectively. Recursive RF models based on the last strategy tremendously reduced modeling descriptors (at most 95.4% for Mold2) while apparently improved the predictability with a consensus CCR of 0.84 (sensitivity of 0.88 and specificity of 0.79). Structural analysis showed that pyrimidine derivatives, purine derivatives, and halogenated hydrocarbon were critical for drugs' hepatotoxicity. The reporting frequency of many drugs was gender-dependent (eg, antiviral and anti-cancer drugs for males and antibacterial drugs for females) as well as age-dependent (eg, antiviral and anti-cancer drugs for the middle age group of 20-29, 30-39, and 40-49). Approximately 84% of total cases were reported during the first 6 months of administration. The curated hepatotoxicity dataset along with the predictive classification models presented here should provide insight into future studies of DILI.
Collapse
Affiliation(s)
- Xiang-Wei Zhu
- Department of Environmental Science, College of Resource and Environment, Qingdao Engineering Research Center for Rural Environment, Qingdao Agricultural University, Qingdao 266109, China
| | - Shao-Jing Li
- Department of Computer Science and Technology, College of Science and Information, Qingdao Agricultural University, Qingdao 266109, China
| |
Collapse
|
21
|
Chauhan S, Kumar A. Consensus QSAR modelling of SIRT1 activators using simplex representation of molecular structure. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2018; 29:277-294. [PMID: 29390919 DOI: 10.1080/1062936x.2018.1426626] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/05/2017] [Accepted: 01/08/2018] [Indexed: 06/07/2023]
Abstract
Hierarchical QSAR technology (HiT QSAR) was used for consensus QSAR modelling of 65 SIRT1 activators. Simplex representation of molecular structure (SiRMS) has been used for descriptor generation. The predictive QSAR models were developed using the partial least squares (PLS) method. The QSAR models were built up according to OECD principles. One hundred rounds of Y-scrambling were performed for each selected model to exclude chance correlations. A successful consensus model (r2 = 0.830, [Formula: see text] = 0.754) was obtained from the five best QSAR models. Leverage, ellipsoid and local tree domain of applicability (DA) approaches have been used for evaluation of the quality of predictions. Molecular fragments responsible for an increase and decrease of the activation properties have been determined by mechanistic interpretation of the developed QSAR model.
Collapse
Affiliation(s)
- S Chauhan
- a Department of Pharmaceutical Sciences , Guru Jambheshwar University of Science and Technology , Hisar , India
| | - A Kumar
- a Department of Pharmaceutical Sciences , Guru Jambheshwar University of Science and Technology , Hisar , India
| |
Collapse
|
22
|
Braga RC, Alves VM, Muratov EN, Strickland J, Kleinstreuer N, Trospsha A, Andrade CH. Pred-Skin: A Fast and Reliable Web Application to Assess Skin Sensitization Effect of Chemicals. J Chem Inf Model 2017; 57:1013-1017. [DOI: 10.1021/acs.jcim.7b00194] [Citation(s) in RCA: 53] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Rodolpho C. Braga
- Laboratory
for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, GO 74605-170, Brazil
| | - Vinicius M. Alves
- Laboratory
for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, GO 74605-170, Brazil
- Laboratory
for Molecular Modeling, Division of Chemical Biology and Medicinal
Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel
Hill, North Carolina 27599, United States
| | - Eugene N. Muratov
- Laboratory
for Molecular Modeling, Division of Chemical Biology and Medicinal
Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel
Hill, North Carolina 27599, United States
- Department
of Chemical Technology, Odessa National Polytechnic University, Odessa, 65000, Ukraine
| | - Judy Strickland
- Integrated Laboratory Systems, Inc., P.O. Box 13501, Research Triangle Park, North Carolina 27709, United States
| | - Nicole Kleinstreuer
- National
Toxicology Program Interagency Center for the Evaluation of Alternative
Toxicological Methods, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, United States
| | - Alexander Trospsha
- Laboratory
for Molecular Modeling, Division of Chemical Biology and Medicinal
Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel
Hill, North Carolina 27599, United States
| | - Carolina Horta Andrade
- Laboratory
for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, GO 74605-170, Brazil
| |
Collapse
|
23
|
Capuzzi SJ, Kim ISJ, Lam WI, Thornton TE, Muratov EN, Pozefsky D, Tropsha A. Chembench: A Publicly Accessible, Integrated Cheminformatics Portal. J Chem Inf Model 2017; 57:105-108. [PMID: 28045544 DOI: 10.1021/acs.jcim.6b00462] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The enormous increase in the amount of publicly available chemical genomics data and the growing emphasis on data sharing and open science mandates that cheminformaticians also make their models publicly available for broad use by the scientific community. Chembench is one of the first publicly accessible, integrated cheminformatics Web portals. It has been extensively used by researchers from different fields for curation, visualization, analysis, and modeling of chemogenomics data. Since its launch in 2008, Chembench has been accessed more than 1 million times by more than 5000 users from a total of 98 countries. We report on the recent updates and improvements that increase the simplicity of use, computational efficiency, accuracy, and accessibility of a broad range of tools and services for computer-assisted drug design and computational toxicology available on Chembench. Chembench remains freely accessible at https://chembench.mml.unc.edu.
Collapse
Affiliation(s)
- Stephen J Capuzzi
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, and ‡Department of Computer Science, University of North Carolina , Chapel Hill, North Carolina 27599, United States
| | - Ian Sang-June Kim
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, and ‡Department of Computer Science, University of North Carolina , Chapel Hill, North Carolina 27599, United States
| | - Wai In Lam
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, and ‡Department of Computer Science, University of North Carolina , Chapel Hill, North Carolina 27599, United States
| | - Thomas E Thornton
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, and ‡Department of Computer Science, University of North Carolina , Chapel Hill, North Carolina 27599, United States
| | - Eugene N Muratov
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, and ‡Department of Computer Science, University of North Carolina , Chapel Hill, North Carolina 27599, United States
| | - Diane Pozefsky
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, and ‡Department of Computer Science, University of North Carolina , Chapel Hill, North Carolina 27599, United States
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, and ‡Department of Computer Science, University of North Carolina , Chapel Hill, North Carolina 27599, United States
| |
Collapse
|
24
|
Alves VM, Capuzzi SJ, Muratov E, Braga RC, Thornton T, Fourches D, Strickland J, Kleinstreuer N, Andrade CH, Tropsha A. QSAR models of human data can enrich or replace LLNA testing for human skin sensitization. GREEN CHEMISTRY : AN INTERNATIONAL JOURNAL AND GREEN CHEMISTRY RESOURCE : GC 2016; 18:6501-6515. [PMID: 28630595 PMCID: PMC5473635 DOI: 10.1039/c6gc01836j] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
Skin sensitization is a major environmental and occupational health hazard. Although many chemicals have been evaluated in humans, there have been no efforts to model these data to date. We have compiled, curated, analyzed, and compared the available human and LLNA data. Using these data, we have developed reliable computational models and applied them for virtual screening of chemical libraries to identify putative skin sensitizers. The overall concordance between murine LLNA and human skin sensitization responses for a set of 135 unique chemicals was low (R = 28-43%), although several chemical classes had high concordance. We have succeeded to develop predictive QSAR models of all available human data with the external correct classification rate of 71%. A consensus model integrating concordant QSAR predictions and LLNA results afforded a higher CCR of 82% but at the expense of the reduced external dataset coverage (52%). We used the developed QSAR models for virtual screening of CosIng database and identified 1061 putative skin sensitizers; for seventeen of these compounds, we found published evidence of their skin sensitization effects. Models reported herein provide more accurate alternative to LLNA testing for human skin sensitization assessment across diverse chemical data. In addition, they can also be used to guide the structural optimization of toxic compounds to reduce their skin sensitization potential.
Collapse
Affiliation(s)
- Vinicius M. Alves
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
- Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goias, Goiania, GO, 74605-170, Brazil
| | - Stephen J. Capuzzi
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Eugene Muratov
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
- Department of Chemical Technology, Odessa National Polytechnic University, Odessa, 65000, Ukraine
| | - Rodolpho C. Braga
- Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goias, Goiania, GO, 74605-170, Brazil
| | - Thomas Thornton
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Denis Fourches
- Department of Chemistry, Bioinformatics Research Center, North Carolina State University, Raleigh, NC, 27695, USA
| | - Judy Strickland
- Integrated Laboratory Systems, Inc., P.O. Box 13501, Research Triangle Park, NC, 27709, USA
| | - Nicole Kleinstreuer
- National Institutes of Environmental Health Sciences, Research Triangle Park, NC, 27709, USA
| | - Carolina H. Andrade
- Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goias, Goiania, GO, 74605-170, Brazil
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
| |
Collapse
|
25
|
Alves V, Muratov E, Capuzzi S, Politi R, Low Y, Braga R, Zakharov AV, Sedykh A, Mokshyna E, Farag S, Andrade C, Kuz'min V, Fourches D, Tropsha A. Alarms about structural alerts. GREEN CHEMISTRY : AN INTERNATIONAL JOURNAL AND GREEN CHEMISTRY RESOURCE : GC 2016; 18:4348-4360. [PMID: 28503093 PMCID: PMC5423727 DOI: 10.1039/c6gc01492e] [Citation(s) in RCA: 62] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
Structural alerts are widely accepted in chemical toxicology and regulatory decision support as a simple and transparent means to flag potential chemical hazards or group compounds into categories for read-across. However, there has been a growing concern that alerts disproportionally flag too many chemicals as toxic, which questions their reliability as toxicity markers. Conversely, the rigorously developed and properly validated statistical QSAR models can accurately and reliably predict the toxicity of a chemical; however, their use in regulatory toxicology has been hampered by the lack of transparency and interpretability. We demonstrate that contrary to the common perception of QSAR models as "black boxes" they can be used to identify statistically significant chemical substructures (QSAR-based alerts) that influence toxicity. We show through several case studies, however, that the mere presence of structural alerts in a chemical, irrespective of the derivation method (expert-based or QSAR-based), should be perceived only as hypotheses of possible toxicological effect. We propose a new approach that synergistically integrates structural alerts and rigorously validated QSAR models for a more transparent and accurate safety assessment of new chemicals.
Collapse
Affiliation(s)
- Vinicius Alves
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
- Laboratory for Molecular Modeling and Design, Department of Pharmacy, Federal University of Goias, Goiania, GO, 74605-170, Brazil
| | - Eugene Muratov
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
- Department of Chemical Technology, Odessa National Polytechnic University, Odessa, 65000, Ukraine
| | - Stephen Capuzzi
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Regina Politi
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Yen Low
- Netflix, San Francisco, CA 94123, USA
| | - Rodolpho Braga
- Laboratory for Molecular Modeling and Design, Department of Pharmacy, Federal University of Goias, Goiania, GO, 74605-170, Brazil
| | - Alexey V. Zakharov
- National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, Rockville, MD 20850, USA
| | | | - Elena Mokshyna
- Laboratory of Theoretical Chemistry, A.V. Bogatsky Physical-Chemical Institute NAS of Ukraine, Odessa, 65080, Ukraine
| | - Sherif Farag
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Carolina Andrade
- Laboratory for Molecular Modeling and Design, Department of Pharmacy, Federal University of Goias, Goiania, GO, 74605-170, Brazil
| | - Victor Kuz'min
- Laboratory of Theoretical Chemistry, A.V. Bogatsky Physical-Chemical Institute NAS of Ukraine, Odessa, 65080, Ukraine
| | - Denis Fourches
- Department of Chemistry and Bioinformatics Research Center, North Carolina State University, Raleigh, NC, 27695, USA
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
| |
Collapse
|
26
|
Fourches D, Muratov E, Tropsha A. Trust, but Verify II: A Practical Guide to Chemogenomics Data Curation. J Chem Inf Model 2016; 56:1243-52. [PMID: 27280890 PMCID: PMC5657146 DOI: 10.1021/acs.jcim.6b00129] [Citation(s) in RCA: 187] [Impact Index Per Article: 23.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
There is a growing public concern about the lack of reproducibility of experimental data published in peer-reviewed scientific literature. Herein, we review the most recent alerts regarding experimental data quality and discuss initiatives taken thus far to address this problem, especially in the area of chemical genomics. Going beyond just acknowledging the issue, we propose a chemical and biological data curation workflow that relies on existing cheminformatics approaches to flag, and when appropriate, correct possibly erroneous entries in large chemogenomics data sets. We posit that the adherence to the best practices for data curation is important for both experimental scientists who generate primary data and deposit them in chemical genomics databases and computational researchers who rely on these data for model development.
Collapse
Affiliation(s)
- Denis Fourches
- Department of Chemistry, Bioinformatics Research Center, North Carolina State University, Raleigh, NC, 27695, USA
| | - Eugene Muratov
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
| |
Collapse
|
27
|
Cherkasov A, Muratov EN, Fourches D, Varnek A, Baskin II, Cronin M, Dearden J, Gramatica P, Martin YC, Todeschini R, Consonni V, Kuz'min VE, Cramer R, Benigni R, Yang C, Rathman J, Terfloth L, Gasteiger J, Richard A, Tropsha A. QSAR modeling: where have you been? Where are you going to? J Med Chem 2014; 57:4977-5010. [PMID: 24351051 PMCID: PMC4074254 DOI: 10.1021/jm4004285] [Citation(s) in RCA: 1033] [Impact Index Per Article: 103.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Quantitative structure-activity relationship modeling is one of the major computational tools employed in medicinal chemistry. However, throughout its entire history it has drawn both praise and criticism concerning its reliability, limitations, successes, and failures. In this paper, we discuss (i) the development and evolution of QSAR; (ii) the current trends, unsolved problems, and pressing challenges; and (iii) several novel and emerging applications of QSAR modeling. Throughout this discussion, we provide guidelines for QSAR development, validation, and application, which are summarized in best practices for building rigorously validated and externally predictive QSAR models. We hope that this Perspective will help communications between computational and experimental chemists toward collaborative development and use of QSAR models. We also believe that the guidelines presented here will help journal editors and reviewers apply more stringent scientific standards to manuscripts reporting new QSAR studies, as well as encourage the use of high quality, validated QSARs for regulatory decision making.
Collapse
Affiliation(s)
- Artem Cherkasov
- Vancouver Prostate Centre, University of British Columbia, Vancouver, BC, V6H3Z6, Canada
| | - Eugene N. Muratov
- Laboratory for Molecular Modeling, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
- Department of Molecular Structure and Cheminformatics, A.V. Bogatsky Physical-Chemical Institute National Academy of Sciences of Ukraine, Odessa, 65080, Ukraine
| | - Denis Fourches
- Laboratory for Molecular Modeling, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Alexandre Varnek
- Department of Chemistry, L. Pasteur University of Strasbourg, Strasbourg, 67000, France
| | - Igor I. Baskin
- Department of Physics, Lomonosov Moscow State University, Moscow, 119991, Russia
| | - Mark Cronin
- School of Pharmacy and Biomolecular Sciences, Liverpool John Moores University, Liverpool L33AF, UK
| | - John Dearden
- School of Pharmacy and Biomolecular Sciences, Liverpool John Moores University, Liverpool L33AF, UK
| | - Paola Gramatica
- Department of Structural and Functional Biology, University of Insubria, Varese, 21100, Italy
| | | | - Roberto Todeschini
- Milano Chemometrics and QSAR Research Group, University of Milano-Bicocca, Milan, 20126, Italy
| | - Viviana Consonni
- Milano Chemometrics and QSAR Research Group, University of Milano-Bicocca, Milan, 20126, Italy
| | - Victor E. Kuz'min
- Department of Molecular Structure and Cheminformatics, A.V. Bogatsky Physical-Chemical Institute National Academy of Sciences of Ukraine, Odessa, 65080, Ukraine
| | | | - Romualdo Benigni
- Environment and Health Department, Istituto Superiore di Sanita’, Rome, 00161, Italy
| | | | - James Rathman
- Altamira LLC, Columbus OH 43235, USA
- Department of Chemical and Biomolecular Engineering, the Ohio State University, Columbus, OH 43215, USA
| | | | | | - Ann Richard
- National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, NC, 27519, USA
| | - Alexander Tropsha
- Laboratory for Molecular Modeling, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC, 27599, USA
| |
Collapse
|
28
|
Vassiliev PM, Spasov AA, Kosolapov VA, Kucheryavenko AF, Gurova NA, Anisimova VA. Consensus Drug Design Using IT Microcosm. CHALLENGES AND ADVANCES IN COMPUTATIONAL CHEMISTRY AND PHYSICS 2014. [DOI: 10.1007/978-94-017-9257-8_12] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
|
29
|
Slavov SH, Pearce BA, Buzatu DA, Wilkes JG, Beger RD. Complementary PLS and KNN algorithms for improved 3D-QSDAR consensus modeling of AhR binding. J Cheminform 2013; 5:47. [PMID: 24257141 PMCID: PMC3843526 DOI: 10.1186/1758-2946-5-47] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2013] [Accepted: 11/15/2013] [Indexed: 11/10/2022] Open
Abstract
Multiple validation techniques (Y-scrambling, complete training/test set randomization, determination of the dependence of R2test on the number of randomization cycles, etc.) aimed to improve the reliability of the modeling process were utilized and their effect on the statistical parameters of the models was evaluated. A consensus partial least squares (PLS)-similarity based k-nearest neighbors (KNN) model utilizing 3D-SDAR (three dimensional spectral data-activity relationship) fingerprint descriptors for prediction of the log(1/EC50) values of a dataset of 94 aryl hydrocarbon receptor binders was developed. This consensus model was constructed from a PLS model utilizing 10 ppm x 10 ppm x 0.5 Å bins and 7 latent variables (R2test of 0.617), and a KNN model using 2 ppm x 2 ppm x 0.5 Å bins and 6 neighbors (R2test of 0.622). Compared to individual models, improvement in predictive performance of approximately 10.5% (R2test of 0.685) was observed. Further experiments indicated that this improvement is likely an outcome of the complementarity of the information contained in 3D-SDAR matrices of different granularity. For similarly sized data sets of Aryl hydrocarbon (AhR) binders the consensus KNN and PLS models compare favorably to earlier reports. The ability of 3D-QSDAR (three dimensional quantitative spectral data-activity relationship) to provide structural interpretation was illustrated by a projection of the most frequently occurring bins on the standard coordinate space, thus allowing identification of structural features related to toxicity.
Collapse
Affiliation(s)
| | | | | | | | - Richard D Beger
- Division of Systems Biology, National Center for Toxicological Research, US Food and Drug Administration, 3900 NCTR Road, Jefferson, AR 72079, USA.
| |
Collapse
|
30
|
Lei B, Li J, Yao X. A Novel Strategy of Structural Similarity Based Consensus Modeling. Mol Inform 2013; 32:599-608. [PMID: 27481768 DOI: 10.1002/minf.201200170] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2012] [Accepted: 04/17/2013] [Indexed: 12/16/2022]
Abstract
A novel strategy of "structural similarity based consensus modeling" (SSCM) based on "model distance and guided model selection" (MD-QGMS) submodel set was proposed. The SSCM strategy is built upon a hypothesis, that is, similar compounds are most probably predicted more accurately by a same submodel among a model population, which can be concluded from the fact that models employing a different set of descriptors can predict compounds with specific structures more accurately. It is proved that the proposed SSCM strategy can remarkably improve the external prediction ability of QSAR models by employing two different datasets. In future, the proposed SSCM strategy may provide a new direction to develop more accurate predictive models.
Collapse
Affiliation(s)
- Beilei Lei
- College of Life Sciences, Northwest A & F University, Yangling 712100, 22 Xinong Road, P. R. China tel: +86-029-87092262.
| | - Jiazhong Li
- School of Pharmacy, Lanzhou University, Lanzhou 730000, P. R. China
| | - Xiaojun Yao
- State Key Laboratory of Applied Organic Chemistry, Department of Chemistry, Lanzhou University, Lanzhou 730000, P. R. China
| |
Collapse
|
31
|
|
32
|
Muratov EN, Varlamova EV, Artemenko AG, Polishchuk PG, Kuz'min VE. Existing and Developing Approaches for QSAR Analysis of Mixtures. Mol Inform 2012; 31:202-21. [PMID: 27477092 DOI: 10.1002/minf.201100129] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2011] [Accepted: 02/04/2012] [Indexed: 11/10/2022]
Abstract
This review is devoted to the critical analysis of advantages and disadvantages of existing mixture descriptors and their usage in various QSAR/QSPR tasks. We describe good practices for the QSAR modeling of mixtures, data sources for mixtures, a discussion of various mixture descriptors and their application, recommendations about proper external validation specific for mixture QSAR modeling, and future perspectives of this field. The biggest problem in QSAR of mixtures is the lack of reliable data about the mixtures' properties. Various mixture descriptors are used for the modeling of different endpoints. However, these descriptors have certain disadvantages, such as applicability only to 1 : 1 binary mixtures, and additive nature. The field of QSAR of mixtures is still under development, and existing efforts could be considered as a foundation for future approaches and studies. The usage of non-additive mixture descriptors, which are sensitive to interaction effects, in combination with best practices of QSAR model development (e.g., thorough data collection and curation, rigorous external validation, etc.) will significantly improve the quality of QSAR studies of mixtures.
Collapse
Affiliation(s)
- Eugene N Muratov
- Laboratory of Theoretical Chemistry, Department of Molecular Structure, A. V. Bogatsky Physical Chemical Institute, National Academy of Sciences of Ukraine, Lustdorfskaya Doroga 86, Odessa 65080, Ukraine tel: +380487662394, fax: +380487662394. , .,Laboratory for Molecular Modeling, Division of Medicinal Chemistry and Natural Products, Eshelman School of Pharmacy, University of North Carolina, Beard Hall 301, CB#7568, Chapel Hill, NC, 27599, USA tel: +19199663459, fax: +19199660204. ,
| | - Ekaterina V Varlamova
- Laboratory of Theoretical Chemistry, Department of Molecular Structure, A. V. Bogatsky Physical Chemical Institute, National Academy of Sciences of Ukraine, Lustdorfskaya Doroga 86, Odessa 65080, Ukraine tel: +380487662394, fax: +380487662394
| | - Anatoly G Artemenko
- Laboratory of Theoretical Chemistry, Department of Molecular Structure, A. V. Bogatsky Physical Chemical Institute, National Academy of Sciences of Ukraine, Lustdorfskaya Doroga 86, Odessa 65080, Ukraine tel: +380487662394, fax: +380487662394
| | - Pavel G Polishchuk
- Laboratory of Theoretical Chemistry, Department of Molecular Structure, A. V. Bogatsky Physical Chemical Institute, National Academy of Sciences of Ukraine, Lustdorfskaya Doroga 86, Odessa 65080, Ukraine tel: +380487662394, fax: +380487662394
| | - Victor E Kuz'min
- Laboratory of Theoretical Chemistry, Department of Molecular Structure, A. V. Bogatsky Physical Chemical Institute, National Academy of Sciences of Ukraine, Lustdorfskaya Doroga 86, Odessa 65080, Ukraine tel: +380487662394, fax: +380487662394
| |
Collapse
|
33
|
Liew CY, Lim YC, Yap CW. Mixed learning algorithms and features ensemble in hepatotoxicity prediction. J Comput Aided Mol Des 2011; 25:855-71. [DOI: 10.1007/s10822-011-9468-3] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2011] [Accepted: 08/23/2011] [Indexed: 12/22/2022]
|
34
|
QSAR analysis of [(biphenyloxy)propyl]isoxazoles: agents against coxsackievirus B3. Future Med Chem 2011; 3:15-27. [PMID: 21428823 DOI: 10.4155/fmc.10.278] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND Antiviral drugs are urgently needed for the treatment of acute and chronic diseases caused by enteroviruses such as coxsackievirus B3 (CVB3). The main goal of this study is quantitative structure-activity relationship (QSAR) analysis of anti-CVB3 activity (clinical CVB3 isolate 97927 [log IC50, µM]) and investigation of the selectivity of 25 ([biphenyloxy]propyl)isoxazoles, followed by computer-aided design and virtual screening of novel active compounds. DISCUSSION The 2D QSAR obtained models are quite satisfactory (R(2) = 0.84-0.99, Q(2) = 0.76-0.92, R(2)(ext) = 0.62-0.79). Compounds with high antiviral activity and selectivity have to contain 5-trifluoromethyl-[1,2,4]oxadiazole or 2,4-difluorophenyl fragments. Insertion of 2,5-dimethylbenzene, napthyl and especially biphenyl substituents into investigated compounds substantially decreases both their antiviral activity and selectivity. Several compounds were proposed as a result of design and virtual screening. A high level of activity of 2-methoxy-1-phenyl-1H-imidazo[4,5-c]pyridine (sm428) was confirmed experimentally. CONCLUSION Simplex representation of molecular structure allows successful QSAR analysis of anti-CVB3 activity of ([biphenyloxy]propyl)isoxazole derivatives. Two possible ways of battling CVB3 are considered as a future perspective.
Collapse
|
35
|
Artemenko A, Muratov EN, Kuz’min V, Muratov N, Varlamova E, Kuz'mina A, Gorb LG, Golius A, Hill F, Leszczynski J, Tropsha A. QSAR analysis of the toxicity of nitroaromatics in Tetrahymena pyriformis: structural factors and possible modes of action. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2011; 22:575-601. [PMID: 21714735 PMCID: PMC3442116 DOI: 10.1080/1062936x.2011.569950] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
The Hierarchical Technology for Quantitative Structure-Activity Relationships (HiT QSAR) was applied to 95 diverse nitroaromatic compounds (including some widely known explosives) tested for their toxicity (50% inhibition growth concentration, IGC₅₀) against the ciliate Tetrahymena pyriformis. The dataset was divided into subsets according to putative mechanisms of toxicity. The Classification and Regression Trees (CART) approach implemented within HiT QSAR has been used for prediction of mechanism of toxicity for new compounds. The resulting models were shown to have ~80% accuracy for external datasets indicating that the mechanistic dataset division was sensible. The Partial Least Squares (PLS) statistical approach was then used to develop 2D QSAR models. Validated PLS models were explored to: (1) elucidate the effects of different substituents in nitroaromatic compounds on toxicity; (2) differentiate compounds by probable mechanisms of toxicity based on their structural descriptors; and (3) analyse the role of various physical-chemical factors responsible for compounds' toxicity. Models were interpreted in terms of molecular fragments promoting or interfering with toxicity. It was also shown that mutual influence of substituents in benzene ring plays the determining role in toxicity variation. Although chemical mechanism based models were statistically significant and externally predictive (r²(ext) = 0.64 for the external set of 63 nitroaromatics identified after all calculations have been completed), they were also shown to have limited coverage (57% for modelling and 76% for external set).
Collapse
Affiliation(s)
- A.G. Artemenko
- A.V. Bogatsky Physical-Chemical Institute National Academy of Sciences of Ukraine, Lustdorfskaya Doroga 86, Odessa 65080, Ukraine
- Interdisciplinary Nanotoxicity Center, Jackson State University, 1400 J.R. Lynch Str., Jackson, Mississippi, 39217 USA
| | - E. N. Muratov
- A.V. Bogatsky Physical-Chemical Institute National Academy of Sciences of Ukraine, Lustdorfskaya Doroga 86, Odessa 65080, Ukraine
- Interdisciplinary Nanotoxicity Center, Jackson State University, 1400 J.R. Lynch Str., Jackson, Mississippi, 39217 USA
- Division of Medicinal Chemistry and Natural Products, School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, 27599, USA
| | - V.E. Kuz’min
- A.V. Bogatsky Physical-Chemical Institute National Academy of Sciences of Ukraine, Lustdorfskaya Doroga 86, Odessa 65080, Ukraine
- Interdisciplinary Nanotoxicity Center, Jackson State University, 1400 J.R. Lynch Str., Jackson, Mississippi, 39217 USA
| | - N.N. Muratov
- Odessa National Polytechnic University, 1 Shevchenko Ave., Odessa, 65000, Ukraine
| | - E.V. Varlamova
- A.V. Bogatsky Physical-Chemical Institute National Academy of Sciences of Ukraine, Lustdorfskaya Doroga 86, Odessa 65080, Ukraine
| | - A.V. Kuz'mina
- Odessa National Medicinal University, 2 Ol'gievskaya Str, Odessa, 65000, Ukraine
| | - L. G. Gorb
- Badger Technical Services, LLC, Vicksburg, Mississippi, USA
| | - A. Golius
- Kharkiv National V.N. Karazin University, Department of Radophysics, Karkiv, 61077, Ukraine
| | - F.C. Hill
- US Army ERDC, 3532 Manor Dr, Vicksburg, Mississippi, 39180, USA
| | - J. Leszczynski
- Interdisciplinary Nanotoxicity Center, Jackson State University, 1400 J.R. Lynch Str., Jackson, Mississippi, 39217 USA
| | - A. Tropsha
- Division of Medicinal Chemistry and Natural Products, School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, 27599, USA
| |
Collapse
|
36
|
Fourches D, Muratov E, Tropsha A. Trust, but verify: on the importance of chemical structure curation in cheminformatics and QSAR modeling research. J Chem Inf Model 2010; 50:1189-204. [PMID: 20572635 DOI: 10.1021/ci100176x] [Citation(s) in RCA: 470] [Impact Index Per Article: 33.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Denis Fourches
- Laboratory for Molecular Modeling, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | | | | |
Collapse
|
37
|
Per aspera ad astra: application of Simplex QSAR approach in antiviral research. Future Med Chem 2010; 2:1205-26. [DOI: 10.4155/fmc.10.194] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
This review explores the application of the Simplex representation of molecular structure (SiRMS) QSAR approach in antiviral research. We provide an introduction to and description of SiRMS, its application in antiviral research and future directions of development of the Simplex approach and the whole QSAR field. In the Simplex approach every molecule is represented as a system of different simplexes (tetratomic fragments with fixed composition, structure, chirality and symmetry). The main advantages of SiRMS are consideration of the different physical–chemical properties of atoms, high adequacy and good interpretability of models obtained and clear procedures for molecular design. The reliability of developed QSAR models as predictive virtual screening tools and their ability to serve as the basis of directed drug design was validated by subsequent synthetic and biological experiments. The SiRMS approach is realized as the complex of the computer program ‘HiT QSAR’, which is available on request.
Collapse
|