51
|
Galati S, Di Stefano M, Bertini S, Granchi C, Giordano A, Gado F, Macchia M, Tuccinardi T, Poli G. Identification of New GSK3β Inhibitors through a Consensus Machine Learning-Based Virtual Screening. Int J Mol Sci 2023; 24:17233. [PMID: 38139062 PMCID: PMC10743990 DOI: 10.3390/ijms242417233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 12/05/2023] [Accepted: 12/06/2023] [Indexed: 12/24/2023] Open
Abstract
Glycogen synthase kinase-3 beta (GSK3β) is a serine/threonine kinase that plays key roles in glycogen metabolism, Wnt/β-catenin signaling cascade, synaptic modulation, and multiple autophagy-related signaling pathways. GSK3β is an attractive target for drug discovery since its aberrant activity is involved in the development of neurodegenerative diseases such as Alzheimer's and Parkinson's disease. In the present study, multiple machine learning models aimed at identifying novel GSK3β inhibitors were developed and evaluated for their predictive reliability. The most powerful models were combined in a consensus approach, which was used to screen about 2 million commercial compounds. Our consensus machine learning-based virtual screening led to the identification of compounds G1 and G4, which showed inhibitory activity against GSK3β in the low-micromolar and sub-micromolar range, respectively. These results demonstrated the reliability of our virtual screening approach. Moreover, docking and molecular dynamics simulation studies were employed for predicting reliable binding modes for G1 and G4, which represent two valuable starting points for future hit-to-lead and lead optimization studies.
Collapse
Affiliation(s)
- Salvatore Galati
- Department of Pharmacy, University of Pisa, 56126 Pisa, Italy; (S.G.); (M.D.S.); (S.B.); (C.G.); (M.M.); (G.P.)
| | - Miriana Di Stefano
- Department of Pharmacy, University of Pisa, 56126 Pisa, Italy; (S.G.); (M.D.S.); (S.B.); (C.G.); (M.M.); (G.P.)
- Department of Life Sciences, University of Siena, 53100 Siena, Italy
| | - Simone Bertini
- Department of Pharmacy, University of Pisa, 56126 Pisa, Italy; (S.G.); (M.D.S.); (S.B.); (C.G.); (M.M.); (G.P.)
| | - Carlotta Granchi
- Department of Pharmacy, University of Pisa, 56126 Pisa, Italy; (S.G.); (M.D.S.); (S.B.); (C.G.); (M.M.); (G.P.)
| | - Antonio Giordano
- Sbarro Institute for Cancer Research and Molecular Medicine Center for Biotechnology, College of Science and Technology, Temple University, Philadelphia, PA 19122, USA;
- Department of Medical Biotechnologies, University of Siena, 53100 Siena, Italy
| | - Francesca Gado
- Department of Pharmaceutical Sciences, University of Milan, 20133 Milan, Italy;
| | - Marco Macchia
- Department of Pharmacy, University of Pisa, 56126 Pisa, Italy; (S.G.); (M.D.S.); (S.B.); (C.G.); (M.M.); (G.P.)
| | - Tiziano Tuccinardi
- Department of Pharmacy, University of Pisa, 56126 Pisa, Italy; (S.G.); (M.D.S.); (S.B.); (C.G.); (M.M.); (G.P.)
| | - Giulio Poli
- Department of Pharmacy, University of Pisa, 56126 Pisa, Italy; (S.G.); (M.D.S.); (S.B.); (C.G.); (M.M.); (G.P.)
| |
Collapse
|
52
|
Wang Y, Wang G, Zhao Y, Wang C, Chen C, Ding Y, Lin J, You J, Gao S, Pang X. A deep learning model for predicting multidrug-resistant organism infection in critically ill patients. J Intensive Care 2023; 11:49. [PMID: 37941079 PMCID: PMC10633993 DOI: 10.1186/s40560-023-00695-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Accepted: 10/12/2023] [Indexed: 11/10/2023] Open
Abstract
BACKGROUND This study aimed to apply the backpropagation neural network (BPNN) to develop a model for predicting multidrug-resistant organism (MDRO) infection in critically ill patients. METHODS This study collected patient information admitted to the intensive care unit (ICU) of the Affiliated Hospital of Qingdao University from August 2021 to January 2022. All patients enrolled were divided randomly into a training set (80%) and a test set (20%). The least absolute shrinkage and selection operator and stepwise regression analysis were used to determine the independent risk factors for MDRO infection. A BPNN model was constructed based on these factors. Then, we externally validated this model in patients from May 2022 to July 2022 over the same center. The model performance was evaluated by the calibration curve, the area under the curve (AUC), sensitivity, specificity, and accuracy. RESULTS In the primary cohort, 688 patients were enrolled, including 109 (15.84%) MDRO infection patients. Risk factors for MDRO infection, as determined by the primary cohort, included length of hospitalization, length of ICU stay, long-term bed rest, antibiotics use before ICU, acute physiology and chronic health evaluation II, invasive operation before ICU, quantity of antibiotics, chronic lung disease, and hypoproteinemia. There were 238 patients in the validation set, including 31 (13.03%) MDRO infection patients. This BPNN model yielded good calibration. The AUC of the training set, the test set and the validation set were 0.889 (95% CI 0.852-0.925), 0.919 (95% CI 0.856-0.983), and 0.811 (95% CI 0.731-0.891), respectively. CONCLUSIONS This study confirmed nine independent risk factors for MDRO infection. The BPNN model performed well and was potentially used to predict MDRO infection in ICU patients.
Collapse
Affiliation(s)
- Yaxi Wang
- Department of Critical Care Medicine, The Affiliated Hospital of Qingdao University, No. 16 Jiangsu Road, Qingdao, 266000, China
| | - Gang Wang
- Department of Critical Care Medicine, The Affiliated Hospital of Qingdao University, No. 16 Jiangsu Road, Qingdao, 266000, China
| | - Yuxiao Zhao
- Department of Critical Care Medicine, The Affiliated Hospital of Qingdao University, No. 16 Jiangsu Road, Qingdao, 266000, China
| | - Cheng Wang
- Department of Critical Care Medicine, The Affiliated Hospital of Qingdao University, No. 16 Jiangsu Road, Qingdao, 266000, China
| | - Chen Chen
- School of Nursing, Qingdao University, No. 38 Dengzhou Road, Qingdao, 266021, China
| | - Yaoyao Ding
- Department of Critical Care Medicine, The Affiliated Hospital of Qingdao University, No. 16 Jiangsu Road, Qingdao, 266000, China
| | - Jing Lin
- Department of Critical Care Medicine, The Affiliated Hospital of Qingdao University, No. 16 Jiangsu Road, Qingdao, 266000, China
| | - Jingjing You
- Department of Critical Care Medicine, The Affiliated Hospital of Qingdao University, No. 16 Jiangsu Road, Qingdao, 266000, China
| | - Silong Gao
- Department of Critical Care Medicine, The Affiliated Hospital of Qingdao University, No. 16 Jiangsu Road, Qingdao, 266000, China.
| | - Xufeng Pang
- Department of Hospital-Acquired Infection Control, The Affiliated Hospital of Qingdao University, No. 16 Jiangsu Road, Qingdao, 266000, China.
| |
Collapse
|
53
|
Rodríguez-Belenguer P, March-Vila E, Pastor M, Mangas-Sanjuan V, Soria-Olivas E. Usage of model combination in computational toxicology. Toxicol Lett 2023; 389:34-44. [PMID: 37890682 DOI: 10.1016/j.toxlet.2023.10.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 10/17/2023] [Accepted: 10/24/2023] [Indexed: 10/29/2023]
Abstract
New Approach Methodologies (NAMs) have ushered in a new era in the field of toxicology, aiming to replace animal testing. However, despite these advancements, they are not exempt from the inherent complexities associated with the study's endpoint. In this review, we have identified three major groups of complexities: mechanistic, chemical space, and methodological. The mechanistic complexity arises from interconnected biological processes within a network that are challenging to model in a single step. In the second group, chemical space complexity exhibits significant dissimilarity between compounds in the training and test series. The third group encompasses algorithmic and molecular descriptor limitations and typical class imbalance problems. To address these complexities, this work provides a guide to the usage of a combination of predictive Quantitative Structure-Activity Relationship (QSAR) models, known as metamodels. This combination of low-level models (LLMs) enables a more precise approach to the problem by focusing on different sub-mechanisms or sub-processes. For mechanistic complexity, multiple Molecular Initiating Events (MIEs) or levels of information are combined to form a mechanistic-based metamodel. Regarding the complexity arising from chemical space, two types of approaches were reviewed to construct a fragment-based chemical space metamodel: those with and without structure sharing. Metamodels with structure sharing utilize unsupervised strategies to identify data patterns and build low-level models for each cluster, which are then combined. For situations without structure sharing due to pharmaceutical industry intellectual property, the use of prediction sharing, and federated learning approaches have been reviewed. Lastly, to tackle methodological complexity, various algorithms are combined to overcome their limitations, diverse descriptors are employed to enhance problem definition and balanced dataset combinations are used to address class imbalance issues (methodological-based metamodels). Remarkably, metamodels consistently outperformed classical QSAR models across all cases, highlighting the importance of alternatives to classical QSAR models when faced with such complexities.
Collapse
Affiliation(s)
- Pablo Rodríguez-Belenguer
- Research Programme on Biomedical Informatics (GRIB), Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Hospital del Mar Medical Research Institute, 08003 Barcelona, Spain; Department of Pharmacy and Pharmaceutical Technology and Parasitology, Universitat de València, 46100 Valencia, Spain
| | - Eric March-Vila
- Research Programme on Biomedical Informatics (GRIB), Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Hospital del Mar Medical Research Institute, 08003 Barcelona, Spain
| | - Manuel Pastor
- Research Programme on Biomedical Informatics (GRIB), Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Hospital del Mar Medical Research Institute, 08003 Barcelona, Spain
| | - Victor Mangas-Sanjuan
- Department of Pharmacy and Pharmaceutical Technology and Parasitology, Universitat de València, 46100 Valencia, Spain; Interuniversity Research Institute for Molecular Recognition and Technological Development, Universitat Politècnica de València, 46100 Valencia, Spain
| | - Emilio Soria-Olivas
- IDAL, Intelligent Data Analysis Laboratory, ETSE, Universitat de València, 46100 Valencia, Spain.
| |
Collapse
|
54
|
Haddad S, Oktay L, Erol I, Şahin K, Durdagi S. Utilizing Heteroatom Types and Numbers from Extensive Ligand Libraries to Develop Novel hERG Blocker QSAR Models Using Machine Learning-Based Classifiers. ACS OMEGA 2023; 8:40864-40877. [PMID: 37929100 PMCID: PMC10620895 DOI: 10.1021/acsomega.3c06074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 09/13/2023] [Indexed: 11/07/2023]
Abstract
The human ether-à-go-go-related gene (hERG) channel plays a crucial role in membrane repolarization. Any disruptions in its function can lead to severe cardiovascular disorders such as long QT syndrome (LQTS), which increases the risk of serious cardiovascular problems such as tachyarrhythmia and sudden cardiac death. Drug-induced LQTS is a significant concern and has resulted in drug withdrawals from the market in the past. The main objective of this study is to pinpoint crucial heteroatoms present in ligands that initiate interactions leading to the effective blocking of the hERG channel. To achieve this aim, ligand-based quantitative structure-activity relationships (QSAR) models were constructed using extensive ligand libraries, considering the heteroatom types and numbers, and their associated hERG channel blockage pIC50 values. Machine learning-assisted QSAR models were developed to analyze the key structural components influencing compound activity. Among the various methods, the KPLS method proved to be the most efficient, allowing the construction of models based on eight distinct fingerprints. The study delved into investigating the influence of heteroatoms on the activity of hERG blockers, revealing their significant role. Furthermore, by quantifying the effect of heteroatom types and numbers on ligand activity at the hERG channel, six compound pairs were selected for molecular docking. Subsequent molecular dynamics simulations and per residue MM/GBSA calculations were performed to comprehensively analyze the interactions of the selected pair compounds.
Collapse
Affiliation(s)
- Safa Haddad
- Computational
Biology and Molecular Simulations Laboratory, Department of Biophysics,
School of Medicine, Bahçeşehir
University, Istanbul 34353, Turkey
- Computational
Drug Design Center (HITMER), Bahçeşehir
University, Istanbul 34353, Turkey
| | - Lalehan Oktay
- Computational
Biology and Molecular Simulations Laboratory, Department of Biophysics,
School of Medicine, Bahçeşehir
University, Istanbul 34353, Turkey
- Computational
Drug Design Center (HITMER), Bahçeşehir
University, Istanbul 34353, Turkey
| | - Ismail Erol
- Computational
Biology and Molecular Simulations Laboratory, Department of Biophysics,
School of Medicine, Bahçeşehir
University, Istanbul 34353, Turkey
- Computational
Drug Design Center (HITMER), Bahçeşehir
University, Istanbul 34353, Turkey
| | - Kader Şahin
- Department
of Analytical Chemistry, School of Pharmacy, Bahçeşehir University, Istanbul 34734, Turkey
| | - Serdar Durdagi
- Computational
Biology and Molecular Simulations Laboratory, Department of Biophysics,
School of Medicine, Bahçeşehir
University, Istanbul 34353, Turkey
- Computational
Drug Design Center (HITMER), Bahçeşehir
University, Istanbul 34353, Turkey
- Molecular
Therapy Lab, Department of Pharmaceutical Chemistry, School of Pharmacy, Bahçeşehir University, Istanbul 34353, Turkey
| |
Collapse
|
55
|
Liu W, Hopkins AM, Yan P, Du S, Luyt LG, Li Y, Hou J. Can machine learning 'transform' peptides/peptidomimetics into small molecules? A case study with ghrelin receptor ligands. Mol Divers 2023; 27:2239-2255. [PMID: 36331785 DOI: 10.1007/s11030-022-10555-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 10/19/2022] [Indexed: 11/06/2022]
Abstract
There has been considerable interest in transforming peptides into small molecules as peptide-based molecules often present poorer bioavailability and lower metabolic stability. Our studies looked into building machine learning (ML) models to investigate if ML is able to identify the 'bioactive' features of peptides and use the features to accurately discriminate between binding and non-binding small molecules. The ghrelin receptor (GR), a receptor that is implicated in various diseases, was used as an example to demonstrate whether ML models derived from a peptide library can be used to predict small molecule binders. ML models based on three different algorithms, namely random forest, support vector machine, and extreme gradient boosting, were built based on a carefully curated dataset of peptide/peptidomimetic and small molecule GR ligands. The results indicated that ML models trained with a dataset exclusively composed of peptides/peptidomimetics provide limited predictive power for small molecules, but that ML models trained with a diverse dataset composed of an array of both peptides/peptidomimetics and small molecules displayed exceptional results in terms of accuracy and false rates. The diversified models can accurately differentiate the binding small molecules from non-binding small molecules using an external validation set with new small molecules that we synthesized previously. Structural features that are the most critical contributors to binding activity were extracted and are remarkably consistent with the crystallography and mutagenesis studies.
Collapse
Affiliation(s)
- Wenjie Liu
- Department of Chemistry, Lakehead University and Thunder Bay Regional Health Research Institute, 980 Oliver Road, Thunder Bay, ON, P7B 6V4, Canada
| | - Austin M Hopkins
- Department of Chemistry, Lakehead University and Thunder Bay Regional Health Research Institute, 980 Oliver Road, Thunder Bay, ON, P7B 6V4, Canada
| | - Peizhi Yan
- Department of Electrical and Computer Engineering, The University of British Columbia, Vancouver, BC, Canada
| | - Shan Du
- Department of Computer Science, Mathematics, Physics and Statistics, The University of British Columbia, Okanagan, Kelowna, BC, Canada
| | - Leonard G Luyt
- Department of Chemistry, University of Western Ontario, London, ON, Canada
- London Regional Cancer Program, Lawson Health Research Institute, London, ON, Canada
| | - Yifeng Li
- Department of Computer Science, Brock University, Saint Catharines, ON, Canada
| | - Jinqiang Hou
- Department of Chemistry, Lakehead University and Thunder Bay Regional Health Research Institute, 980 Oliver Road, Thunder Bay, ON, P7B 6V4, Canada.
| |
Collapse
|
56
|
Luo L, Wu A, Shu X, Liu L, Feng Z, Zeng Q, Wang Z, Hu T, Cao Y, Tu Y, Li Z. Hub gene identification and molecular subtype construction for Helicobacter pylori in gastric cancer via machine learning methods and NMF algorithm. Aging (Albany NY) 2023; 15:11782-11810. [PMID: 37768204 PMCID: PMC10683617 DOI: 10.18632/aging.205053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Accepted: 07/19/2023] [Indexed: 09/29/2023]
Abstract
Helicobacter pylori (HP) is a gram-negative and spiral-shaped bacterium colonizing the human stomach and has been recognized as the risk factor of gastritis, peptic ulcer disease, and gastric cancer (GC). Moreover, it was recently identified as a class I carcinogen, which affects the occurrence and progression of GC via inducing various oncogenic pathways. Therefore, identifying the HP-related key genes is crucial for understanding the oncogenic mechanisms and improving the outcomes of GC patients. We retrieved the list of HP-related gene sets from the Molecular Signatures Database. Based on the HP-related genes, unsupervised non-negative matrix factorization (NMF) clustering method was conducted to stratify TCGA-STAD, GSE15459, GSE84433 samples into two clusters with distinct clinical outcomes and immune infiltration characterization. Subsequently, two machine learning (ML) strategies, including support vector machine-recursive feature elimination (SVM-RFE) and random forest (RF), were employed to determine twelve hub HP-related genes. Beyond that, receiver operating characteristic and Kaplan-Meier curves further confirmed the diagnostic value and prognostic significance of hub genes. Finally, expression of HP-related hub genes was tested by qRT-PCR array and immunohistochemical images. Additionally, functional pathway enrichment analysis indicated that these hub genes were implicated in the genesis and progression of GC by activating or inhibiting the classical cancer-associated pathways, such as epithelial-mesenchymal transition, cell cycle, apoptosis, RAS/MAPK, etc. In the present study, we constructed a novel HP-related tumor classification in different datasets, and screened out twelve hub genes via performing the ML algorithms, which may contribute to the molecular diagnosis and personalized therapy of GC.
Collapse
Affiliation(s)
- Lianghua Luo
- Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
- Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| | - Ahao Wu
- Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
- Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| | - Xufeng Shu
- Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
- Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| | - Li Liu
- Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
- Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| | - Zongfeng Feng
- Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
- Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| | - Qingwen Zeng
- Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
- Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| | - Zhonghao Wang
- Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
- Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| | - Tengcheng Hu
- Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
- Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| | - Yi Cao
- Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| | - Yi Tu
- Department of Pathology, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| | - Zhengrong Li
- Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
| |
Collapse
|
57
|
Khondkaryan L, Tevosyan A, Navasardyan H, Khachatrian H, Tadevosyan G, Apresyan L, Chilingaryan G, Navoyan Z, Stopper H, Babayan N. Datasets Construction and Development of QSAR Models for Predicting Micronucleus In Vitro and In Vivo Assay Outcomes. TOXICS 2023; 11:785. [PMID: 37755795 PMCID: PMC10537630 DOI: 10.3390/toxics11090785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 09/07/2023] [Accepted: 09/11/2023] [Indexed: 09/28/2023]
Abstract
In silico (quantitative) structure-activity relationship modeling is an approach that provides a fast and cost-effective alternative to assess the genotoxic potential of chemicals. However, one of the limiting factors for model development is the availability of consolidated experimental datasets. In the present study, we collected experimental data on micronuclei in vitro and in vivo, utilizing databases and conducting a PubMed search, aided by text mining using the BioBERT large language model. Chemotype enrichment analysis on the updated datasets was performed to identify enriched substructures. Additionally, chemotypes common for both endpoints were found. Five machine learning models in combination with molecular descriptors, twelve fingerprints and two data balancing techniques were applied to construct individual models. The best-performing individual models were selected for the ensemble construction. The curated final dataset consists of 981 chemicals for micronuclei in vitro and 1309 for mouse micronuclei in vivo, respectively. Out of 18 chemotypes enriched in micronuclei in vitro, only 7 were found to be relevant for in vivo prediction. The ensemble model exhibited high accuracy and sensitivity when applied to an external test set of in vitro data. A good balanced predictive performance was also achieved for the micronucleus in vivo endpoint.
Collapse
Affiliation(s)
- Lusine Khondkaryan
- Institute of Molecular Biology, NAS RA, Yerevan 0014, Armenia; (L.K.); (G.T.); (L.A.)
- Toxometris.ai, Yerevan 0009, Armenia; (A.T.); (H.N.); (Z.N.)
| | - Ani Tevosyan
- Toxometris.ai, Yerevan 0009, Armenia; (A.T.); (H.N.); (Z.N.)
- YerevaNN, Yerevan 0025, Armenia; (H.K.); (G.C.)
| | | | - Hrant Khachatrian
- YerevaNN, Yerevan 0025, Armenia; (H.K.); (G.C.)
- Department of Informatics and Applied Mathematics, Yerevan State University, Yerevan 0025, Armenia
| | - Gohar Tadevosyan
- Institute of Molecular Biology, NAS RA, Yerevan 0014, Armenia; (L.K.); (G.T.); (L.A.)
- Toxometris.ai, Yerevan 0009, Armenia; (A.T.); (H.N.); (Z.N.)
| | - Lilit Apresyan
- Institute of Molecular Biology, NAS RA, Yerevan 0014, Armenia; (L.K.); (G.T.); (L.A.)
- Toxometris.ai, Yerevan 0009, Armenia; (A.T.); (H.N.); (Z.N.)
| | | | - Zaven Navoyan
- Toxometris.ai, Yerevan 0009, Armenia; (A.T.); (H.N.); (Z.N.)
| | - Helga Stopper
- Institute of Pharmacology and Toxicology, University of Würzburg, 97078 Würzburg, Germany;
| | - Nelly Babayan
- Institute of Molecular Biology, NAS RA, Yerevan 0014, Armenia; (L.K.); (G.T.); (L.A.)
- Toxometris.ai, Yerevan 0009, Armenia; (A.T.); (H.N.); (Z.N.)
| |
Collapse
|
58
|
Hosseini-Gerami L, Hernansaiz Ballesteros R, Liu A, Broughton H, Collier DA, Bender A. MAVEN: compound mechanism of action analysis and visualisation using transcriptomics and compound structure data in R/Shiny. BMC Bioinformatics 2023; 24:344. [PMID: 37715141 PMCID: PMC10502988 DOI: 10.1186/s12859-023-05416-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 07/18/2023] [Indexed: 09/17/2023] Open
Abstract
BACKGROUND Understanding the Mechanism of Action (MoA) of a compound is an often challenging but equally crucial aspect of drug discovery that can help improve both its efficacy and safety. Computational methods to aid MoA elucidation usually either aim to predict direct drug targets, or attempt to understand modulated downstream pathways or signalling proteins. Such methods usually require extensive coding experience and results are often optimised for further computational processing, making them difficult for wet-lab scientists to perform, interpret and draw hypotheses from. RESULTS To address this issue, we in this work present MAVEN (Mechanism of Action Visualisation and Enrichment), an R/Shiny app which allows for GUI-based prediction of drug targets based on chemical structure, combined with causal reasoning based on causal protein-protein interactions and transcriptomic perturbation signatures. The app computes a systems-level view of the mechanism of action of the input compound. This is visualised as a sub-network linking predicted or known targets to modulated transcription factors via inferred signalling proteins. The tool includes a selection of MSigDB gene set collections to perform pathway enrichment on the resulting network, and also allows for custom gene sets to be uploaded by the researcher. MAVEN is hence a user-friendly, flexible tool for researchers without extensive bioinformatics or cheminformatics knowledge to generate interpretable hypotheses of compound Mechanism of Action. CONCLUSIONS MAVEN is available as a fully open-source tool at https://github.com/laylagerami/MAVEN with options to install in a Docker or Singularity container. Full documentation, including a tutorial on example data, is available at https://laylagerami.github.io/MAVEN .
Collapse
Affiliation(s)
- Layla Hosseini-Gerami
- Centre for Molecular Informatics, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK.
- Ignota Labs, London, UK.
| | - Rosa Hernansaiz Ballesteros
- Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg University, Heidelberg, Germany
| | - Anika Liu
- Centre for Molecular Informatics, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK
| | - Howard Broughton
- Eli Lilly and Company Centre de Investigacion, Alcobendas, Spain
| | - David Andrew Collier
- Eli Lilly and Company, Bracknell, UK
- King's College London, and Genetics and Genomics Consulting, Surrey, UK
| | - Andreas Bender
- Centre for Molecular Informatics, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK.
| |
Collapse
|
59
|
Rampogu S, Shaik MR, Khan M, Khan M, Oh TH, Shaik B. CBPDdb: a curated database of compounds derived from Coumarin-Benzothiazole-Pyrazole. Database (Oxford) 2023; 2023:baad062. [PMID: 37702993 PMCID: PMC10498939 DOI: 10.1093/database/baad062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 08/01/2023] [Accepted: 08/26/2023] [Indexed: 09/14/2023]
Abstract
The present article describes the building of a small-molecule web server, CBPDdb, employing R-shiny. For the generation of the web server, three compounds were chosen, namely coumarin, benzothiazole and pyrazole, and their derivatives were curated from the literature. The two-dimensional (2D) structures were drawn using ChemDraw, and the .sdf file was created employing Discovery Studio Visualizer v2017. These compounds were read on the R-shiny app using ChemmineR, and the dataframe consisting of a total of 1146 compounds was generated and manipulated employing the dplyr package. The web server is provided with JSME 2D sketcher. The descriptors of the compounds are obtained using propOB with a filter. The users can download the filtered data in the .csv and .sdf formats, and the entire dataset of a compound can be downloaded in .sdf format. This web server facilitates the researchers to screen plausible inhibitors for different diseases. Additionally, the method used in building the web server can be adapted for developing other small-molecule databases (web servers) in RStudio. Database URL: https://srampogu.shinyapps.io/CBPDdb_Revised/.
Collapse
Affiliation(s)
| | - Mohammed Rafi Shaik
- Department of Chemistry, College of Science, King Saud University, P.O. Box 2455, Riyadh 11451, Saudi Arabia
| | - Merajuddin Khan
- Department of Chemistry, College of Science, King Saud University, P.O. Box 2455, Riyadh 11451, Saudi Arabia
| | - Mujeeb Khan
- Department of Chemistry, College of Science, King Saud University, P.O. Box 2455, Riyadh 11451, Saudi Arabia
| | - Tae Hwan Oh
- School of Chemical Engineering, Yeungnam University, Gyeongsan 38541, Republic of Korea
| | - Baji Shaik
- School of Chemical Engineering, Yeungnam University, Gyeongsan 38541, Republic of Korea
| |
Collapse
|
60
|
Han R, Yoon H, Kim G, Lee H, Lee Y. Revolutionizing Medicinal Chemistry: The Application of Artificial Intelligence (AI) in Early Drug Discovery. Pharmaceuticals (Basel) 2023; 16:1259. [PMID: 37765069 PMCID: PMC10537003 DOI: 10.3390/ph16091259] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 08/24/2023] [Accepted: 09/04/2023] [Indexed: 09/29/2023] Open
Abstract
Artificial intelligence (AI) has permeated various sectors, including the pharmaceutical industry and research, where it has been utilized to efficiently identify new chemical entities with desirable properties. The application of AI algorithms to drug discovery presents both remarkable opportunities and challenges. This review article focuses on the transformative role of AI in medicinal chemistry. We delve into the applications of machine learning and deep learning techniques in drug screening and design, discussing their potential to expedite the early drug discovery process. In particular, we provide a comprehensive overview of the use of AI algorithms in predicting protein structures, drug-target interactions, and molecular properties such as drug toxicity. While AI has accelerated the drug discovery process, data quality issues and technological constraints remain challenges. Nonetheless, new relationships and methods have been unveiled, demonstrating AI's expanding potential in predicting and understanding drug interactions and properties. For its full potential to be realized, interdisciplinary collaboration is essential. This review underscores AI's growing influence on the future trajectory of medicinal chemistry and stresses the importance of ongoing synergies between computational and domain experts.
Collapse
Affiliation(s)
| | | | | | | | - Yoonji Lee
- College of Pharmacy, Chung-Ang University, Seoul 06974, Republic of Korea
| |
Collapse
|
61
|
Wang Z, Zhou L, Hao W, Liu Y, Xiao X, Shan X, Zhang C, Wei B. Comparative antioxidant activity and untargeted metabolomic analyses of cherry extracts of two Chinese cherry species based on UPLC-QTOF/MS and machine learning algorithms. Food Res Int 2023; 171:113059. [PMID: 37330825 DOI: 10.1016/j.foodres.2023.113059] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 05/03/2023] [Accepted: 05/26/2023] [Indexed: 06/19/2023]
Abstract
P. pseudocerasus and P. tomentosa are the two native Chinese cherry species of high economic and ornamental worths. Little is known about the metabolic information of P. pseudocerasus and P. tomentosa. Effective means are lacking for distinguishing these two similar species. In this study, the differences in total phenolic content (TPC), total flavonoid content (TFC), and in vitro antioxidant activities in 21 batches of two species of cherries were compared. A comparative UPLC-QTOF/MS-based metabolomics coupled with three machine learning algorithms was established for differentiating the cherry species. The results demonstrated that P. tomentosa had higher TPC and TFC with average content differences of 12.07 times and 39.30 times, respectively, and depicted better antioxidant activity. Total of 104 differential compounds were identified by UPLC-QTOF/MS metabolomics. The major differential compounds were flavonoids, organooxygen compounds, and cinnamic acids and derivatives. Correlation analysis revealed differences in flavonoids content such as procyanidin B1 or isomer and (Epi)catechin. They could be responsible for differences in antioxidant activities between the two species. Among three machine learning algorithms, the prediction accuracy of support vector machine (SVM) was 85.7%, and those of random forest (RF) and back propagation neural network (BPNN) were 100%. BPNN exhibited better classification performance and higher prediction rate for all testing set samples than those of RF. The study herein found that P. tomentosa had higher nutritional value and biological functions, and thus considered for usage in health products. Machine models based on untargeted metabolomics can be effective tools for distinguishing these two species.
Collapse
Affiliation(s)
- Ziwei Wang
- Central Laboratory, School of Pharmacy, China Medical University, No.77 Puhe Road, Shenyang 110122, China
| | - Lin Zhou
- Department of Food, School of Public Health, Shenyang Medical College, Shenyang 110034, China
| | - Wenqian Hao
- Central Laboratory, School of Pharmacy, China Medical University, No.77 Puhe Road, Shenyang 110122, China
| | - Yu Liu
- Central Laboratory, School of Pharmacy, China Medical University, No.77 Puhe Road, Shenyang 110122, China
| | - Xia Xiao
- Central Laboratory, School of Pharmacy, China Medical University, No.77 Puhe Road, Shenyang 110122, China
| | - Xiao Shan
- Central Laboratory, School of Pharmacy, China Medical University, No.77 Puhe Road, Shenyang 110122, China
| | - Chenning Zhang
- Department of Pharmacy, Xiangyang No. 1 People's Hospital, Hubei University of Medicine, Xiangyang 441000, China.
| | - Binbin Wei
- Central Laboratory, School of Pharmacy, China Medical University, No.77 Puhe Road, Shenyang 110122, China.
| |
Collapse
|
62
|
Gorostiola González M, van den Broek RL, Braun TGM, Chatzopoulou M, Jespers W, IJzerman AP, Heitman LH, van Westen GJP. 3DDPDs: describing protein dynamics for proteochemometric bioactivity prediction. A case for (mutant) G protein-coupled receptors. J Cheminform 2023; 15:74. [PMID: 37641107 PMCID: PMC10463931 DOI: 10.1186/s13321-023-00745-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Accepted: 08/10/2023] [Indexed: 08/31/2023] Open
Abstract
Proteochemometric (PCM) modelling is a powerful computational drug discovery tool used in bioactivity prediction of potential drug candidates relying on both chemical and protein information. In PCM features are computed to describe small molecules and proteins, which directly impact the quality of the predictive models. State-of-the-art protein descriptors, however, are calculated from the protein sequence and neglect the dynamic nature of proteins. This dynamic nature can be computationally simulated with molecular dynamics (MD). Here, novel 3D dynamic protein descriptors (3DDPDs) were designed to be applied in bioactivity prediction tasks with PCM models. As a test case, publicly available G protein-coupled receptor (GPCR) MD data from GPCRmd was used. GPCRs are membrane-bound proteins, which are activated by hormones and neurotransmitters, and constitute an important target family for drug discovery. GPCRs exist in different conformational states that allow the transmission of diverse signals and that can be modified by ligand interactions, among other factors. To translate the MD-encoded protein dynamics two types of 3DDPDs were considered: one-hot encoded residue-specific (rs) and embedding-like protein-specific (ps) 3DDPDs. The descriptors were developed by calculating distributions of trajectory coordinates and partial charges, applying dimensionality reduction, and subsequently condensing them into vectors per residue or protein, respectively. 3DDPDs were benchmarked on several PCM tasks against state-of-the-art non-dynamic protein descriptors. Our rs- and ps3DDPDs outperformed non-dynamic descriptors in regression tasks using a temporal split and showed comparable performance with a random split and in all classification tasks. Combinations of non-dynamic descriptors with 3DDPDs did not result in increased performance. Finally, the power of 3DDPDs to capture dynamic fluctuations in mutant GPCRs was explored. The results presented here show the potential of including protein dynamic information on machine learning tasks, specifically bioactivity prediction, and open opportunities for applications in drug discovery, including oncology.
Collapse
Affiliation(s)
- Marina Gorostiola González
- Division of Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, Leiden, The Netherlands
- ONCODE Institute, Leiden, The Netherlands
| | - Remco L van den Broek
- Division of Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, Leiden, The Netherlands
| | - Thomas G M Braun
- Division of Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, Leiden, The Netherlands
| | - Magdalini Chatzopoulou
- Division of Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, Leiden, The Netherlands
| | - Willem Jespers
- Division of Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, Leiden, The Netherlands
| | - Adriaan P IJzerman
- Division of Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, Leiden, The Netherlands
| | - Laura H Heitman
- Division of Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, Leiden, The Netherlands
- ONCODE Institute, Leiden, The Netherlands
| | - Gerard J P van Westen
- Division of Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, Leiden, The Netherlands.
| |
Collapse
|
63
|
Kong X, Lin K, Wu G, Tao X, Zhai X, Lv L, Dong D, Zhu Y, Yang S. Machine Learning Techniques Applied to the Study of Drug Transporters. Molecules 2023; 28:5936. [PMID: 37630188 PMCID: PMC10459831 DOI: 10.3390/molecules28165936] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 07/27/2023] [Accepted: 08/02/2023] [Indexed: 08/27/2023] Open
Abstract
With the advancement of computer technology, machine learning-based artificial intelligence technology has been increasingly integrated and applied in the fields of medicine, biology, and pharmacy, thereby facilitating their development. Transporters have important roles in influencing drug resistance, drug-drug interactions, and tissue-specific drug targeting. The investigation of drug transporter substrates and inhibitors is a crucial aspect of pharmaceutical development. However, long duration and high expenses pose significant challenges in the investigation of drug transporters. In this review, we discuss the present situation and challenges encountered in applying machine learning techniques to investigate drug transporters. The transporters involved include ABC transporters (P-gp, BCRP, MRPs, and BSEP) and SLC transporters (OAT, OATP, OCT, MATE1,2-K, and NET). The aim is to offer a point of reference for and assistance with the progression of drug transporter research, as well as the advancement of more efficient computer technology. Machine learning methods are valuable and attractive for helping with the study of drug transporter substrates and inhibitors, but continuous efforts are still needed to develop more accurate and reliable predictive models and to apply them in the screening process of drug development to improve efficiency and success rates.
Collapse
Affiliation(s)
- Xiaorui Kong
- Department of Pharmacy, First Affiliated Hospital of Dalian Medical University, Dalian 116011, China; (X.K.); (K.L.); (X.T.); (X.Z.); (L.L.); (D.D.)
| | - Kexin Lin
- Department of Pharmacy, First Affiliated Hospital of Dalian Medical University, Dalian 116011, China; (X.K.); (K.L.); (X.T.); (X.Z.); (L.L.); (D.D.)
| | - Gaolei Wu
- Department of Pharmacy, Dalian Women and Children’s Medical Group, Dalian 116024, China;
| | - Xufeng Tao
- Department of Pharmacy, First Affiliated Hospital of Dalian Medical University, Dalian 116011, China; (X.K.); (K.L.); (X.T.); (X.Z.); (L.L.); (D.D.)
| | - Xiaohan Zhai
- Department of Pharmacy, First Affiliated Hospital of Dalian Medical University, Dalian 116011, China; (X.K.); (K.L.); (X.T.); (X.Z.); (L.L.); (D.D.)
| | - Linlin Lv
- Department of Pharmacy, First Affiliated Hospital of Dalian Medical University, Dalian 116011, China; (X.K.); (K.L.); (X.T.); (X.Z.); (L.L.); (D.D.)
| | - Deshi Dong
- Department of Pharmacy, First Affiliated Hospital of Dalian Medical University, Dalian 116011, China; (X.K.); (K.L.); (X.T.); (X.Z.); (L.L.); (D.D.)
| | - Yanna Zhu
- Department of Pharmacy, First Affiliated Hospital of Dalian Medical University, Dalian 116011, China; (X.K.); (K.L.); (X.T.); (X.Z.); (L.L.); (D.D.)
| | - Shilei Yang
- Department of Pharmacy, First Affiliated Hospital of Dalian Medical University, Dalian 116011, China; (X.K.); (K.L.); (X.T.); (X.Z.); (L.L.); (D.D.)
| |
Collapse
|
64
|
Elkashlan M, Ahmad RM, Hajar M, Al Jasmi F, Corchado JM, Nasarudin NA, Mohamad MS. A review of SARS-CoV-2 drug repurposing: databases and machine learning models. Front Pharmacol 2023; 14:1182465. [PMID: 37601065 PMCID: PMC10436567 DOI: 10.3389/fphar.2023.1182465] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 07/06/2023] [Indexed: 08/22/2023] Open
Abstract
The emergence of Severe Acute Respiratory Syndrome Corona Virus 2 (SARS-CoV-2) posed a serious worldwide threat and emphasized the urgency to find efficient solutions to combat the spread of the virus. Drug repurposing has attracted more attention than traditional approaches due to its potential for a time- and cost-effective discovery of new applications for the existing FDA-approved drugs. Given the reported success of machine learning (ML) in virtual drug screening, it is warranted as a promising approach to identify potential SARS-CoV-2 inhibitors. The implementation of ML in drug repurposing requires the presence of reliable digital databases for the extraction of the data of interest. Numerous databases archive research data from studies so that it can be used for different purposes. This article reviews two aspects: the frequently used databases in ML-based drug repurposing studies for SARS-CoV-2, and the recent ML models that have been developed for the prospective prediction of potential inhibitors against the new virus. Both types of ML models, Deep Learning models and conventional ML models, are reviewed in terms of introduction, methodology, and its recent applications in the prospective predictions of SARS-CoV-2 inhibitors. Furthermore, the features and limitations of the databases are provided to guide researchers in choosing suitable databases according to their research interests.
Collapse
Affiliation(s)
- Marim Elkashlan
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Al Ain, United Arab Emirates
| | - Rahaf M Ahmad
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Al Ain, United Arab Emirates
| | - Malak Hajar
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Al Ain, United Arab Emirates
| | - Fatma Al Jasmi
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Al Ain, United Arab Emirates
- Division of Metabolic Genetics, Department of Pediatrics, Tawam Hospital, Al Ain, United Arab Emirates
| | - Juan Manuel Corchado
- Departamento de Informática y Automática, Facultad de Ciencias, Grupo de Investigación BISITE, Instituto de Investigación Biomédica de Salamanca, University of Salamanca, Salamanca, Spain
| | - Nurul Athirah Nasarudin
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Al Ain, United Arab Emirates
| | - Mohd Saberi Mohamad
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Al Ain, United Arab Emirates
| |
Collapse
|
65
|
Du Y, Hua Z, Liu C, Lv R, Jia W, Su M. ATR-FTIR combined with machine learning for the fast non-targeted screening of new psychoactive substances. Forensic Sci Int 2023; 349:111761. [PMID: 37327724 DOI: 10.1016/j.forsciint.2023.111761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Revised: 05/15/2023] [Accepted: 06/06/2023] [Indexed: 06/18/2023]
Abstract
Due to the diversity and fast evolution of new psychoactive substances (NPS), both public health and safety are threatened around the world. Attenuated total reflection-Fourier transform infrared spectroscopy (ATR-FTIR), which serves as a simple and rapid technique for targeted NPS screening, is challenging with the rapid structural modifications of NPS. To achieve the fast non-targeted screening of NPS, six machine learning (ML) models were constructed to classify eight categories of NPS, including synthetic cannabinoids, synthetic cathinones, phenethylamines, fentanyl analogues, tryptamines, phencyclidine types, benzodiazepines, and "other substances" based on the 1099 IR spectra data items of 362 types of NPS collected by one desktop ATR-FTIR and two portable FTIR spectrometers. All these six ML classification models, including k-nearest neighbor (KNN), support vector machine (SVM), random forest (RF), extra trees (ET), voting, and artificial neural networks (ANNs) were trained through cross validation, and f1-scores of 0.87-1.00 were achieved. In addition, hierarchical cluster analysis (HCA) was performed on 100 synthetic cannabinoids with the most complex structural variation to investigate the structure-spectral property relationship, which leads to a summary of eight synthetic cannabinoid sub-categories with different "linked groups". ML models were also constructed to classify eight synthetic cannabinoid sub-categories. For the first time, this study developed six ML models, which were suitable for both desktop and portable spectrometers, to classify eight categories of NPS and eight synthetic cannabinoids sub-categories. These models can be applied for the fast, accurate, cost-effective, and on-site non-targeted screening of newly emerging NPS with no reference data available.
Collapse
Affiliation(s)
- Yu Du
- China Pharmaceutical University, Nanjing 210009, Jiangsu, PR China
| | - Zhendong Hua
- Key Laboratory of Drug Monitoring and Control, Drug Intelligence and Forensic Center, Ministry of Public Security, PR China; National Anti-Drug Laboratory of China, Beijing 100193, PR China
| | - Cuimei Liu
- Key Laboratory of Drug Monitoring and Control, Drug Intelligence and Forensic Center, Ministry of Public Security, PR China; National Anti-Drug Laboratory of China, Beijing 100193, PR China.
| | - Rulin Lv
- College of Forensic Science, People's Public Security University of China, Beijing, PR China
| | - Wei Jia
- Key Laboratory of Drug Monitoring and Control, Drug Intelligence and Forensic Center, Ministry of Public Security, PR China; National Anti-Drug Laboratory of China, Beijing 100193, PR China
| | - Mengxiang Su
- China Pharmaceutical University, Nanjing 210009, Jiangsu, PR China.
| |
Collapse
|
66
|
Zhao J, Shi X, Wang Z, Xiong S, Lin Y, Wei X, Li Y, Tang X. Hepatotoxicity assessment investigations on PFASs targeting L-FABP using binding affinity data and machine learning-based QSAR model. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2023; 262:115310. [PMID: 37523843 DOI: 10.1016/j.ecoenv.2023.115310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 07/23/2023] [Accepted: 07/27/2023] [Indexed: 08/02/2023]
Abstract
Per- and polyfluoroalkyl substances (PFASs) are persistent organic pollutants that have been detected in various environmental media and human serum, but their safety assessment remains challenging. PFASs may accumulate in liver tissues and cause hepatotoxicity by binding to liver fatty acid binding protein (L-FABP). Therefore, evaluating the binding affinity of PFASs to L-FABP is crucial in assessing the potential hepatotoxic effects. In this study, two binding sites of L-FABP were evaluated, results suggested that the outer site possessed high affinity to polyfluoroalkyl sulfates and the inner site preferred perfluoroalkyl sulfonamides, overall, the inner site of L-FABP was more sensitive to PFASs. The binding affinity data of PFASs to L-FABP were used as training set to develop a machine learning model-based quantitative structure-activity relationship (QSAR) for efficient prediction of potentially hazardous PFASs. Further Bayesian Kernel Machine Regression (BKMR) model disclosed flexibility as the determinant molecular property on PFASs-induced hepatotoxicity. It can influence affinity of PFASs to target protein through affecting binding conformations directly (individual effect) as well as integrating with other molecular properties (joint effect). Our present work provided more understanding on hepatotoxicity of PFASs, which could be significative in hepatotoxicity gradation, administration guidance, and safer alternatives development of PFASs.
Collapse
Affiliation(s)
- Jiayi Zhao
- Department of Medical Chemistry, School of Pharmacy, Qingdao University, Qingdao 266071, China; Department of Occupational and Environmental Health, School of Public Health, Qingdao University, Qingdao 266071, China
| | - Xiaoyue Shi
- Department of Occupational and Environmental Health, School of Public Health, Qingdao University, Qingdao 266071, China
| | - Zhiqin Wang
- Department of Occupational and Environmental Health, School of Public Health, Qingdao University, Qingdao 266071, China
| | - Sijie Xiong
- Department of Medical Chemistry, School of Pharmacy, Qingdao University, Qingdao 266071, China
| | - Yongfeng Lin
- Department of Occupational and Environmental Health, School of Public Health, Qingdao University, Qingdao 266071, China
| | - Xiaoran Wei
- Department of Occupational and Environmental Health, School of Public Health, Qingdao University, Qingdao 266071, China
| | - Yanwei Li
- Environment Research Institute, Shandong University, Qingdao 266237, China
| | - Xiaowen Tang
- Department of Medical Chemistry, School of Pharmacy, Qingdao University, Qingdao 266071, China.
| |
Collapse
|
67
|
Zhu Q, Gao S, Xiao B, He Z, Hu S. Plasmer: an Accurate and Sensitive Bacterial Plasmid Prediction Tool Based on Machine Learning of Shared k-mers and Genomic Features. Microbiol Spectr 2023; 11:e0464522. [PMID: 37191574 PMCID: PMC10269668 DOI: 10.1128/spectrum.04645-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 04/26/2023] [Indexed: 05/17/2023] Open
Abstract
Identification of plasmids in bacterial genomes is critical for many factors, including horizontal gene transfer, antibiotic resistance genes, host-microbe interactions, cloning vectors, and industrial production. There are several in silico methods to predict plasmid sequences in assembled genomes. However, existing methods have evident shortcomings, such as unbalance in sensitivity and specificity, dependency on species-specific models, and performance reduction in sequences shorter than 10 kb, which has limited their scope of applicability. In this work, we proposed Plasmer, a novel plasmid predictor based on machine-learning of shared k-mers and genomic features. Unlike existing k-mer or genomic-feature based methods, Plasmer employs the random forest algorithm to make predictions using the percent of shared k-mers with plasmid and chromosome databases combined with other genomic features, including alignment E value and replicon distribution scores (RDS). Plasmer can predict on multiple species and has achieved an average the area under the curve (AUC) of 0.996 with accuracy of 98.4%. Compared to existing methods, tests of both sliding sequences and simulated and de novo assemblies have consistently shown that Plasmer has outperforming accuracy and stable performance across long and short contigs above 500 bp, demonstrating its applicability for fragmented assemblies. Plasmer also has excellent and balanced performance on both sensitivity and specificity (both >0.95 above 500 bp) with the highest F1-score, which has eliminated the bias on sensitivity or specificity that was common in existing methods. Plasmer also provides taxonomy classification to help identify the origin of plasmids. IMPORTANCE In this study, we proposed a novel plasmid prediction tool named Plasmer. Technically, unlike existing k-mer or genomic features-based methods, Plasmer is the first tool to combine the advantages of the percent of shared k-mers and the alignment score of genomic features. This has given Plasmer (i) evident improvement in performance compared to other methods, with the best F1-score and accuracy on sliding sequences, simulated contigs, and de novo assemblies; (ii) applicability for contigs above 500 bp with highest accuracy, enabling plasmid prediction in fragmented short-read assemblies; (iii) excellent and balanced performance between sensitivity and specificity (both >0.95 above 500 bp) with the highest F1-score, which eliminated the bias on sensitivity or specificity that commonly existed in other methods; and (iv) no dependency of species-specific training models. We believe that Plasmer provides a more reliable alternative for plasmid prediction in bacterial genome assemblies.
Collapse
Affiliation(s)
- Qianhui Zhu
- State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Shenghan Gao
- State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Binghan Xiao
- State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
- Sino-Danish College, University of Chinese Academy of Sciences, Beijing, China
| | - Zilong He
- School of Engineering Medicine, Beihang University, Beijing, China
- Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, Interdisciplinary Innovation Institute of Medicine and Engineering, Beihang University, Beijing, China
| | - Songnian Hu
- State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
- Sino-Danish College, University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
68
|
Bo T, Lin Y, Han J, Hao Z, Liu J. Machine learning-assisted data filtering and QSAR models for prediction of chemical acute toxicity on rat and mouse. JOURNAL OF HAZARDOUS MATERIALS 2023; 452:131344. [PMID: 37027914 DOI: 10.1016/j.jhazmat.2023.131344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Revised: 03/20/2023] [Accepted: 03/31/2023] [Indexed: 05/03/2023]
Abstract
Machine learning (ML) methods provide a new opportunity to build quantitative structure-activity relationship (QSAR) models for predicting chemicals' toxicity based on large toxicity data sets, but they are limited in insufficient model robustness due to poor data set quality for chemicals with certain structures. To address this issue and improve model robustness, we built a large data set on rat oral acute toxicity for thousands of chemicals, then used ML to filter chemicals favorable for regression models (CFRM). In comparison to chemicals not favorable for regression models (CNRM), CFRM accounted for 67% of chemicals in the original data set, and had a higher structural similarity and a smaller toxicity distribution in 2-4 log10 (mg/kg). The performance of established regression models for CFRM was greatly improved, with root-mean-square deviations (RMSE) in the range of 0.45-0.48 log10 (mg/kg). Classification models were built for CNRM using all chemicals in the original data set, and the area under receiver operating characteristic (AUROC) reached 0.75-0.76. The proposed strategy was successfully applied to a mouse oral acute data set, yielding RMSE and AUROC in the range of 0.36-0.38 log10 (mg/kg) and 0.79, respectively.
Collapse
Affiliation(s)
- Tao Bo
- School of Environment, Hangzhou Institute for Advanced Study, UCAS, Hangzhou 310024, China; State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, P.O. Box 2871, Beijing 100085, China
| | - Yaohui Lin
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, P.O. Box 2871, Beijing 100085, China; Key Laboratory for Analytical Science of Food Safety and Biology of MOE, Fujian Provincial Key Lab of Analysis and Detection for Food Safety, College of Chemistry, Fuzhou University, Fuzhou, Fujian 350116, China
| | - Jinglong Han
- State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology Shenzhen, Shenzhen 518055, China
| | - Zhineng Hao
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, P.O. Box 2871, Beijing 100085, China.
| | - Jingfu Liu
- School of Environment, Hangzhou Institute for Advanced Study, UCAS, Hangzhou 310024, China; State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, P.O. Box 2871, Beijing 100085, China.
| |
Collapse
|
69
|
Lunghini F, Fava A, Pisapia V, Sacco F, Iaconis D, Beccari AR. ProfhEX: AI-based platform for small molecules liability profiling. J Cheminform 2023; 15:60. [PMID: 37296454 DOI: 10.1186/s13321-023-00728-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Accepted: 05/28/2023] [Indexed: 06/12/2023] Open
Abstract
Off-target drug interactions are a major reason for candidate failure in the drug discovery process. Anticipating potential drug's adverse effects in the early stages is necessary to minimize health risks to patients, animal testing, and economical costs. With the constantly increasing size of virtual screening libraries, AI-driven methods can be exploited as first-tier screening tools to provide liability estimation for drug candidates. In this work we present ProfhEX, an AI-driven suite of 46 OECD-compliant machine learning models that can profile small molecules on 7 relevant liability groups: cardiovascular, central nervous system, gastrointestinal, endocrine, renal, pulmonary and immune system toxicities. Experimental affinity data was collected from public and commercial data sources. The entire chemical space comprised 289'202 activity data for a total of 210'116 unique compounds, spanning over 46 targets with dataset sizes ranging from 819 to 18896. Gradient boosting and random forest algorithms were initially employed and ensembled for the selection of a champion model. Models were validated according to the OECD principles, including robust internal (cross validation, bootstrap, y-scrambling) and external validation. Champion models achieved an average Pearson correlation coefficient of 0.84 (SD of 0.05), an R2 determination coefficient of 0.68 (SD = 0.1) and a root mean squared error of 0.69 (SD of 0.08). All liability groups showed good hit-detection power with an average enrichment factor at 5% of 13.1 (SD of 4.5) and AUC of 0.92 (SD of 0.05). Benchmarking against already existing tools demonstrated the predictive power of ProfhEX models for large-scale liability profiling. This platform will be further expanded with the inclusion of new targets and through complementary modelling approaches, such as structure and pharmacophore-based models. ProfhEX is freely accessible at the following address: https://profhex.exscalate.eu/ .
Collapse
Affiliation(s)
- Filippo Lunghini
- EXSCALATE, Dompé Farmaceutici SpA, Via Tommaso de Amicis 95, 80123, Naples, Italy
| | - Anna Fava
- EXSCALATE, Dompé Farmaceutici SpA, Via Tommaso de Amicis 95, 80123, Naples, Italy
| | - Vincenzo Pisapia
- Professional Service Department, SAS Institute, Via Darwin 20/22, 20143, Milan, Italy
| | - Francesco Sacco
- Professional Service Department, SAS Institute, Via Darwin 20/22, 20143, Milan, Italy
| | - Daniela Iaconis
- EXSCALATE, Dompé Farmaceutici SpA, Via Tommaso de Amicis 95, 80123, Naples, Italy
| | | |
Collapse
|
70
|
Hou R, Xie C, Gui Y, Li G, Li X. Machine-Learning-Based Data Analysis Method for Cell-Based Selection of DNA-Encoded Libraries. ACS OMEGA 2023; 8:19057-19071. [PMID: 37273617 PMCID: PMC10233830 DOI: 10.1021/acsomega.3c02152] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
DNA-encoded library (DEL) is a powerful ligand discovery technology that has been widely adopted in the pharmaceutical industry. DEL selections are typically performed with a purified protein target immobilized on a matrix or in solution phase. Recently, DELs have also been used to interrogate the targets in the complex biological environment, such as membrane proteins on live cells. However, due to the complex landscape of the cell surface, the selection inevitably involves significant nonspecific interactions, and the selection data are much noisier than the ones with purified proteins, making reliable hit identification highly challenging. Researchers have developed several approaches to denoise DEL datasets, but it remains unclear whether they are suitable for cell-based DEL selections. Here, we report the proof-of-principle of a new machine-learning (ML)-based approach to process cell-based DEL selection datasets by using a Maximum A Posteriori (MAP) estimation loss function, a probabilistic framework that can account for and quantify uncertainties of noisy data. We applied the approach to a DEL selection dataset, where a library of 7,721,415 compounds was selected against a purified carbonic anhydrase 2 (CA-2) and a cell line expressing the membrane protein carbonic anhydrase 12 (CA-12). The extended-connectivity fingerprint (ECFP)-based regression model using the MAP loss function was able to identify true binders and also reliable structure-activity relationship (SAR) from the noisy cell-based selection datasets. In addition, the regularized enrichment metric (known as MAP enrichment) could also be calculated directly without involving the specific machine-learning model, effectively suppressing low-confidence outliers and enhancing the signal-to-noise ratio. Future applications of this method will focus on de novo ligand discovery from cell-based DEL selections.
Collapse
Affiliation(s)
- Rui Hou
- Department
of Chemistry and State Key Laboratory of Synthetic Chemistry, The University of Hong Kong, Hong Kong SAR, China
- Laboratory
for Synthetic Chemistry and Chemical Biology LimitedHealth@InnoHK, Innovation and Technology Commission, Hong Kong SAR, China
| | - Chao Xie
- Department
of Chemistry and State Key Laboratory of Synthetic Chemistry, The University of Hong Kong, Hong Kong SAR, China
| | - Yuhan Gui
- Department
of Chemistry and State Key Laboratory of Synthetic Chemistry, The University of Hong Kong, Hong Kong SAR, China
| | - Gang Li
- Institute
of Systems and Physical Biology, Shenzhen Bay Laboratory, Shenzhen 518132, China
| | - Xiaoyu Li
- Department
of Chemistry and State Key Laboratory of Synthetic Chemistry, The University of Hong Kong, Hong Kong SAR, China
- Laboratory
for Synthetic Chemistry and Chemical Biology LimitedHealth@InnoHK, Innovation and Technology Commission, Hong Kong SAR, China
| |
Collapse
|
71
|
Guha R, Velegol D. Harnessing Shannon entropy-based descriptors in machine learning models to enhance the prediction accuracy of molecular properties. J Cheminform 2023; 15:54. [PMID: 37211605 DOI: 10.1186/s13321-023-00712-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 03/18/2023] [Indexed: 05/23/2023] Open
Abstract
Accurate prediction of molecular properties is essential in the screening and development of drug molecules and other functional materials. Traditionally, property-specific molecular descriptors are used in machine learning models. This in turn requires the identification and development of target or problem-specific descriptors. Additionally, an increase in the prediction accuracy of the model is not always feasible from the standpoint of targeted descriptor usage. We explored the accuracy and generalizability issues using a framework of Shannon entropies, based on SMILES, SMARTS and/or InChiKey strings of respective molecules. Using various public databases of molecules, we showed that the accuracy of the prediction of machine learning models could be significantly enhanced simply by using Shannon entropy-based descriptors evaluated directly from SMILES. Analogous to partial pressures and total pressure of gases in a mixture, we used atom-wise fractional Shannon entropy in combination with total Shannon entropy from respective tokens of the string representation to model the molecule efficiently. The proposed descriptor was competitive in performance with standard descriptors such as Morgan fingerprints and SHED in regression models. Additionally, we found that either a hybrid descriptor set containing the Shannon entropy-based descriptors or an optimized, ensemble architecture of multilayer perceptrons and graph neural networks using the Shannon entropies was synergistic to improve the prediction accuracy. This simple approach of coupling the Shannon entropy framework to other standard descriptors and/or using it in ensemble models could find applications in boosting the performance of molecular property predictions in chemistry and material science.
Collapse
Affiliation(s)
- Rajarshi Guha
- Intel Corporation, 2501 NE Century Blvd, Hillsboro, OR, 97124, USA.
| | - Darrell Velegol
- Department of Chemical Engineering, Pennsylvania State University, University Park, PA, 16802, USA
| |
Collapse
|
72
|
Kao PY, Yang YC, Chiang WY, Hsiao JY, Cao Y, Aliper A, Ren F, Aspuru-Guzik A, Zhavoronkov A, Hsieh MH, Lin YC. Exploring the Advantages of Quantum Generative Adversarial Networks in Generative Chemistry. J Chem Inf Model 2023. [PMID: 37171372 DOI: 10.1021/acs.jcim.3c00562] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
De novo drug design with desired biological activities is crucial for developing novel therapeutics for patients. The drug development process is time- and resource-consuming, and it has a low probability of success. Recent advances in machine learning and deep learning technology have reduced the time and cost of the discovery process and therefore, improved pharmaceutical research and development. In this paper, we explore the combination of two rapidly developing fields with lead candidate discovery in the drug development process. First, artificial intelligence has already been demonstrated to successfully accelerate conventional drug design approaches. Second, quantum computing has demonstrated promising potential in different applications, such as quantum chemistry, combinatorial optimizations, and machine learning. This article explores hybrid quantum-classical generative adversarial networks (GAN) for small molecule discovery. We substituted each element of GAN with a variational quantum circuit (VQC) and demonstrated the quantum advantages in the small drug discovery. Utilizing a VQC in the noise generator of a GAN to generate small molecules achieves better physicochemical properties and performance in the goal-directed benchmark than the classical counterpart. Moreover, we demonstrate the potential of a VQC with only tens of learnable parameters in the generator of GAN to generate small molecules. We also demonstrate the quantum advantage of a VQC in the discriminator of GAN. In this hybrid model, the number of learnable parameters is significantly less than the classical ones, and it can still generate valid molecules. The hybrid model with only tens of training parameters in the quantum discriminator outperforms the MLP-based one in terms of both generated molecule properties and the achieved KL divergence. However, the hybrid quantum-classical GANs still face challenges in generating unique and valid molecules compared to their classical counterparts.
Collapse
Affiliation(s)
- Po-Yu Kao
- Insilico Medicine Taiwan Ltd., Taipei 110208, Taiwan
| | - Ya-Chu Yang
- Insilico Medicine Taiwan Ltd., Taipei 110208, Taiwan
| | - Wei-Yin Chiang
- Hon Hai (Foxconn) Research Institute, Taipei 114699, Taiwan
| | - Jen-Yueh Hsiao
- Hon Hai (Foxconn) Research Institute, Taipei 114699, Taiwan
| | - Yudong Cao
- Zapata Computing, Inc., Boston, Massachusetts 02110, United States
| | - Alex Aliper
- Insilico Medicine AI Limited, Masdar City, Abu Dhabi 145748, UAE
| | - Feng Ren
- Insilico Medicine Shanghai Ltd., Shanghai 201203, China
| | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, Toronto, ON M5S 3H6, Canada
- Department of Computer Science, University of Toronto, Toronto, ON M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON M5S 1M1, Canada
- Lebovic Fellow, Canadian Institute for Advanced Research, Toronto, ON M5S 1M1, Canada
| | | | - Min-Hsiu Hsieh
- Hon Hai (Foxconn) Research Institute, Taipei 114699, Taiwan
| | - Yen-Chu Lin
- Insilico Medicine Taiwan Ltd., Taipei 110208, Taiwan
- Department of Pharmacy, National Yang Ming Chiao Tung University, Taipei 112304, Taiwan
| |
Collapse
|
73
|
Li X, Wang H, Jiang M, Ding M, Xu X, Xu B, Zou Y, Yu Y, Yang W. Collision Cross Section Prediction Based on Machine Learning. Molecules 2023; 28:molecules28104050. [PMID: 37241791 DOI: 10.3390/molecules28104050] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 05/10/2023] [Accepted: 05/10/2023] [Indexed: 05/28/2023] Open
Abstract
Ion mobility-mass spectrometry (IM-MS) is a powerful separation technique providing an additional dimension of separation to support the enhanced separation and characterization of complex components from the tissue metabolome and medicinal herbs. The integration of machine learning (ML) with IM-MS can overcome the barrier to the lack of reference standards, promoting the creation of a large number of proprietary collision cross section (CCS) databases, which help to achieve the rapid, comprehensive, and accurate characterization of the contained chemical components. In this review, advances in CCS prediction using ML in the past 2 decades are summarized. The advantages of ion mobility-mass spectrometers and the commercially available ion mobility technologies with different principles (e.g., time dispersive, confinement and selective release, and space dispersive) are introduced and compared. The general procedures involved in CCS prediction based on ML (acquisition and optimization of the independent and dependent variables, model construction and evaluation, etc.) are highlighted. In addition, quantum chemistry, molecular dynamics, and CCS theoretical calculations are also described. Finally, the applications of CCS prediction in metabolomics, natural products, foods, and the other research fields are reflected.
Collapse
Affiliation(s)
- Xiaohang Li
- State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
- Haihe Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
| | - Hongda Wang
- State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
- Haihe Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
| | - Meiting Jiang
- State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
- Haihe Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
| | - Mengxiang Ding
- State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
- Haihe Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
| | - Xiaoyan Xu
- State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
- Haihe Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
| | - Bei Xu
- State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
- Haihe Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
| | - Yadan Zou
- State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
- Haihe Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
| | - Yuetong Yu
- State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
| | - Wenzhi Yang
- State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
- Haihe Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
| |
Collapse
|
74
|
Nemoto S, Mizuno T, Kusuhara H. Investigation of chemical structure recognition by encoder-decoder models in learning progress. J Cheminform 2023; 15:45. [PMID: 37046349 PMCID: PMC10100163 DOI: 10.1186/s13321-023-00713-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 03/18/2023] [Indexed: 04/14/2023] Open
Abstract
Descriptor generation methods using latent representations of encoder-decoder (ED) models with SMILES as input are useful because of the continuity of descriptor and restorability to the structure. However, it is not clear how the structure is recognized in the learning progress of ED models. In this work, we created ED models of various learning progress and investigated the relationship between structural information and learning progress. We showed that compound substructures were learned early in ED models by monitoring the accuracy of downstream tasks and input-output substructure similarity using substructure-based descriptors, which suggests that existing evaluation methods based on the accuracy of downstream tasks may not be sensitive enough to evaluate the performance of ED models with SMILES as descriptor generation methods. On the other hand, we showed that structure restoration was time-consuming, and in particular, insufficient learning led to the estimation of a larger structure than the actual one. It can be inferred that determining the endpoint of the structure is a difficult task for the model. To our knowledge, this is the first study to link the learning progress of SMILES by ED model to chemical structures for a wide range of chemicals.
Collapse
Affiliation(s)
- Shumpei Nemoto
- Department of Pharmaceutical Sciences, The University of Tokyo, Bunkyo, Tokyo, Japan
| | - Tadahaya Mizuno
- Department of Pharmaceutical Sciences, The University of Tokyo, Bunkyo, Tokyo, Japan.
| | - Hiroyuki Kusuhara
- Department of Pharmaceutical Sciences, The University of Tokyo, Bunkyo, Tokyo, Japan
| |
Collapse
|
75
|
Gholampour M, Seradj H, Sakhteman A. Structure-Selectivity Relationship Prediction of Tau Imaging Tracers Using Machine Learning-Assisted QSAR Models and Interaction Fingerprint Map. ACS Chem Neurosci 2023. [PMID: 37037183 DOI: 10.1021/acschemneuro.3c00038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/12/2023] Open
Abstract
Protein aggregates composed of tau fibrils are major pathologic findings in different tauopathies. An ideal agent for imaging tau fibrils must be highly selective. The molecular basis for the binding of current available compounds to tau aggregates is not well understood. Herein, we provide insights into previously studied positron emission tomography tracers using various computational methods, including machine learning-based quantitative structure-activity relationship (QSAR) classification, docking, and molecular dynamics (MD) simulations to investigate the structural basis of selective tau aggregate binding for potential compounds. The QSAR classification model based on the Random Forest algorithm with an accuracy of 96.6% for the selective and 97.6% for the nonselective class of compounds revealed essential selective moieties. The combination of molecular docking, MD simulations, and molecular mechanics Poisson-Boltzmann surface area (MM/PBSA) binding free-energy calculation showed superior binding energy of ligand 63 toward tau and PHF6, a key hexapeptide in tau aggregation, as the most selective compound in the data set. Dissecting the binding properties of ligand 63 and ligand 8 (the least selective compound) within tau and Aβ structures confirmed that these two compounds favor different binding sites of tau; however, the preferential binding site in Aβ was similar for both with lower binding energies calculated for ligand 8. Results revealed that the number of N-heterocycles, the position of nitrogen atoms, and the presence of tertiary amine are important components of selective binding moieties, and they should be maintained in molecules for selective binding to tau aggregates. The predicted structure-selectivity relationship will facilitate the rational design and further development of selective tau imaging agents.
Collapse
Affiliation(s)
- Maryam Gholampour
- Department of Medicinal Chemistry, Faculty of Pharmacy, Shiraz University of Medical Sciences, Shiraz 71468-64685, Iran
| | - Hassan Seradj
- Department of Medicinal Chemistry, Faculty of Pharmacy, Shiraz University of Medical Sciences, Shiraz 71468-64685, Iran
| | - Amirhossein Sakhteman
- Chair of Proteomics and Bioanalytics, Technical University of Munich (TUM), Freising 85354, Germany
| |
Collapse
|
76
|
Zhang T, Mo Q, Jiang N, Wu Y, Yang X, Chen W, Li Q, Yang S, Yang J, Zeng J, Huang F, Huang Q, Luo J, Wu J, Wang L. The combination of machine learning and transcriptomics reveals a novel megakaryopoiesis inducer, MO-A, that promotes thrombopoiesis by activating FGF1/FGFR1/PI3K/Akt/NF-κB signaling. Eur J Pharmacol 2023; 944:175604. [PMID: 36804544 DOI: 10.1016/j.ejphar.2023.175604] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 01/20/2023] [Accepted: 02/16/2023] [Indexed: 02/19/2023]
Abstract
Radiation-induced thrombocytopenia (RIT) occurs widely and causes high mortality and morbidity in cancer patients who receive radiotherapy. However, specific drugs for treating RIT remain woefully inadequate. Here, we first developed a drug screening model using naive Bayes, a machine learning (ML) algorithm, to virtually screen the active compounds promoting megakaryopoiesis and thrombopoiesis. A natural product library was screened by the model, and methylophiopogonanone A (MO-A) was identified as the most active compound. The activity of MO-A was then validated in vitro and showed that MO-A could markedly induce megakaryocyte (MK) differentiation of K562 and Meg-01 cells in a concentration-dependent manner. Furthermore, the therapeutic action of MO-A on RIT was evaluated, and MO-A significantly accelerated platelet level recovery, platelet activation, megakaryopoiesis, MK differentiation in RIT mice. Moreover, RNA-sequencing (RNA-seq) indicated that the PI3K cascade was closely related to MK differentiation induced by MO-A. Finally, experimental verification demonstrated that MO-A obviously induced the expression of FGF1 and FGFR1, and increased the phosphorylation of PI3K, Akt and NF-κB. Blocking FGFR1 with its inhibitor dovitinib suppressed MO-A-induced MK differentiation, and PI3K, Akt and NF-κB phosphorylation. Similarly, inhibition of PI3K-Akt signal pathway by its inhibitor LY294002 suppressed MK differentiation, and PI3K, Akt and NF-κB phosphorylation induced by MO-A. Taken together, our study provides an efficient drug discovery strategy for hematological diseases, and demonstrates that MO-A is a novel countermeasure for treating RIT through activation of the FGF1/FGFR1/PI3K/Akt/NF-κB signaling pathway.
Collapse
Affiliation(s)
- Ting Zhang
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
| | - Qi Mo
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
| | - Nan Jiang
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
| | - Yuesong Wu
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
| | - Xin Yang
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
| | - Wang Chen
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
| | - Qinyao Li
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
| | - Shuo Yang
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
| | - Jing Yang
- Department of Pharmacy, Chengdu Fifth People's Hospital, Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan, 611137, China
| | - Jing Zeng
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
| | - Feihong Huang
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
| | - Qianqian Huang
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
| | - Jiesi Luo
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China.
| | - Jianming Wu
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China; School of Basic Medical Sciences, Southwest Medical University, Luzhou, Sichuan, 646000, China; Education Ministry Key Laboratory of Medical Electrophysiology, Sichuan Key Medical Laboratory of New Drug Discovery and Druggability Evaluation, Luzhou Key Laboratory of Activity Screening and Druggability Evaluation for Chinese Materia Medica, Southwest Medical University, Luzhou, Sichuan, 646000, China.
| | - Long Wang
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China.
| |
Collapse
|
77
|
Jaramillo DN, Millán D, Guevara-Pulido J. Design, synthesis and cytotoxic evaluation of a selective serotonin reuptake inhibitor (SSRI) by virtual screening. Eur J Pharm Sci 2023; 183:106403. [PMID: 36758772 DOI: 10.1016/j.ejps.2023.106403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 01/24/2023] [Accepted: 02/06/2023] [Indexed: 02/11/2023]
Abstract
Depression is one of the most common mental illnesses, affecting almost 300 million people. According to the WHO, depression is one of the world's leading causes of disability and morbidity. People with this illness require both psychological and pharmaceutical treatment because severe depressive episodes often result in suicide. Selective serotonin reuptake inhibitors (SSRI) are widely used antidepressants that target the human serotonin transporter (hSERT). The crystallization of hSERT and the experimental data available allows cost and time-efficient computational tools like virtual screening (VS) to be utilized in the development of therapeutic agents. Here, we synthesized, characterized, and evaluated the biological activity of a novel SSRI analog of paroxetine, rationally designed by applying an artificial neural network-based QSAR model and a molecular docking analysis on hSERT. The analog N-substituted 18a showed higher affinity for the transporter (-10.2 kcal/mol), lower Ki value (1.19 nM) and a safer toxicological profile than paroxetine and was synthesized with a 71% yield. The in vitro cytotoxicity of the analog was evaluated using human glioblastoma (U87 MG), human neuroblastoma (SH SY5Y) and murine fibroblast (L929) cell lines. Also, the hemolytic ability of the compound was assessed on human erythrocytes. Results showed that analog 18a did not exhibit cytotoxic activity on the cell lines used and has no hemolytic activity at any of the concentrations tested, whereas with paroxetine, hemolysis was observed at 2.3, 1.29 y 0.67 mM. Based on these results, it is possible to suggest that analog 18a could be a promising new SSRI candidate for the treatment of this illness.
Collapse
Affiliation(s)
- Deissy N Jaramillo
- INQA, Applied Chemistry Research Group- Faculty of Chemistry, Universidad El Bosque, Bogotá, Colombia
| | - Diana Millán
- GIBAT, Basic and Traslational Research Group - Faculty of Medicine, Universidad El Bosque, Bogotá, Colombia
| | - James Guevara-Pulido
- INQA, Applied Chemistry Research Group- Faculty of Chemistry, Universidad El Bosque, Bogotá, Colombia.
| |
Collapse
|
78
|
Lien ST, Lin TE, Hsieh JH, Sung TY, Chen JH, Hsu KC. Establishment of extensive artificial intelligence models for kinase inhibitor prediction: Identification of novel PDGFRB inhibitors. Comput Biol Med 2023; 156:106722. [PMID: 36878123 DOI: 10.1016/j.compbiomed.2023.106722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 02/16/2023] [Accepted: 02/26/2023] [Indexed: 03/06/2023]
Abstract
Identifying hit compounds is an important step in drug development. Unfortunately, this process continues to be a challenging task. Several machine learning models have been generated to aid in simplifying and improving the prediction of candidate compounds. Models tuned for predicting kinase inhibitors have been established. However, an effective model can be limited by the size of the chosen training dataset. In this study, we tested several machine learning models to predict potential kinase inhibitors. A dataset was curated from a number of publicly available repositories. This resulted in a comprehensive dataset covering more than half of the human kinome. More than 2,000 kinase models were established using different model approaches. The performances of the models were compared, and the Keras-MLP model was determined to be the best performing model. The model was then used to screen a chemical library for potential inhibitors targeting platelet-derived growth factor receptor-β (PDGFRB). Several PDGFRB candidates were selected, and in vitro assays confirmed four compounds with PDGFRB inhibitory activity and IC50 values in the nanomolar range. These results show the effectiveness of machine learning models trained on the reported dataset. This report would aid in the establishment of machine learning models as well as in the discovery of novel kinase inhibitors.
Collapse
Affiliation(s)
- Ssu-Ting Lien
- Graduate Institute of Cancer Biology and Drug Discovery, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan
| | - Tony Eight Lin
- Graduate Institute of Cancer Biology and Drug Discovery, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan; Ph.D. Program for Cancer Molecular Biology and Drug Discovery, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan
| | - Jui-Hua Hsieh
- Division of Translational Toxicology, National Institute of Environmental Health Sciences, National Institutes of Health, Durham, NC, USA
| | - Tzu-Ying Sung
- Biomedical Translation Research Center, Academia Sinica, Taipei, Taiwan
| | - Jun-Hong Chen
- Graduate Institute of Cancer Biology and Drug Discovery, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan
| | - Kai-Cheng Hsu
- Graduate Institute of Cancer Biology and Drug Discovery, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan; Ph.D. Program for Cancer Molecular Biology and Drug Discovery, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan; Ph.D. Program in Drug Discovery and Development Industry, College of Pharmacy, Taipei Medical University, Taipei, Taiwan; Cancer Center, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan; TMU Research Center of Cancer Translational Medicine, Taipei Medical University, Taipei, Taiwan; TMU Research Center of Drug Discovery, Taipei Medical University, Taipei, Taiwan.
| |
Collapse
|
79
|
Verhaegen F, Butterworth KT, Chalmers AJ, Coppes RP, de Ruysscher D, Dobiasch S, Fenwick JD, Granton PV, Heijmans SHJ, Hill MA, Koumenis C, Lauber K, Marples B, Parodi K, Persoon LCGG, Staut N, Subiel A, Vaes RDW, van Hoof S, Verginadis IL, Wilkens JJ, Williams KJ, Wilson GD, Dubois LJ. Roadmap for precision preclinical x-ray radiation studies. Phys Med Biol 2023; 68:06RM01. [PMID: 36584393 DOI: 10.1088/1361-6560/acaf45] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Accepted: 12/30/2022] [Indexed: 12/31/2022]
Abstract
This Roadmap paper covers the field of precision preclinical x-ray radiation studies in animal models. It is mostly focused on models for cancer and normal tissue response to radiation, but also discusses other disease models. The recent technological evolution in imaging, irradiation, dosimetry and monitoring that have empowered these kinds of studies is discussed, and many developments in the near future are outlined. Finally, clinical translation and reverse translation are discussed.
Collapse
Affiliation(s)
- Frank Verhaegen
- MAASTRO Clinic, Radiotherapy Division, GROW-School for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, The Netherlands
- SmART Scientific Solutions BV, Maastricht, The Netherlands
| | - Karl T Butterworth
- Patrick G. Johnston, Centre for Cancer Research, Queen's University Belfast, Belfast, Northern Ireland, United Kingdom
| | - Anthony J Chalmers
- School of Cancer Sciences, University of Glasgow, Glasgow G61 1QH, United Kingdom
| | - Rob P Coppes
- Departments of Biomedical Sciences of Cells & Systems, Section Molecular Cell Biology and Radiation Oncology, University Medical Center Groningen, University of Groningen, 9700 AD Groningen, The Netherlands
| | - Dirk de Ruysscher
- MAASTRO Clinic, Radiotherapy Division, GROW-School for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, The Netherlands
| | - Sophie Dobiasch
- Department of Radiation Oncology, Technical University of Munich (TUM), School of Medicine and Klinikum rechts der Isar, Germany
- Department of Medical Physics, Institute of Radiation Medicine (IRM), Department of Radiation Sciences (DRS), Helmholtz Zentrum München, Germany
| | - John D Fenwick
- Department of Medical Physics & Biomedical Engineering University College LondonMalet Place Engineering Building, London WC1E 6BT, United Kingdom
| | | | | | - Mark A Hill
- MRC Oxford Institute for Radiation Oncology, University of Oxford, ORCRB Roosevelt Drive, Oxford OX3 7DQ, United Kingdom
| | - Constantinos Koumenis
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Kirsten Lauber
- Department of Radiation Oncology, University Hospital, LMU München, Munich, Germany
- German Cancer Consortium (DKTK), Partner site Munich, Germany
| | - Brian Marples
- Department of Radiation Oncology, University of Rochester, NY, United States of America
| | - Katia Parodi
- German Cancer Consortium (DKTK), Partner site Munich, Germany
- Department of Medical Physics, Faculty of Physics, Ludwig-Maximilians-Universität München, Garching b. Munich, Germany
| | | | - Nick Staut
- SmART Scientific Solutions BV, Maastricht, The Netherlands
| | - Anna Subiel
- National Physical Laboratory, Medical Radiation Science Hampton Road, Teddington, Middlesex, TW11 0LW, United Kingdom
| | - Rianne D W Vaes
- MAASTRO Clinic, Radiotherapy Division, GROW-School for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, The Netherlands
| | | | - Ioannis L Verginadis
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Jan J Wilkens
- Department of Radiation Oncology, Technical University of Munich (TUM), School of Medicine and Klinikum rechts der Isar, Germany
- Physics Department, Technical University of Munich (TUM), Germany
| | - Kaye J Williams
- Division of Pharmacy and Optometry, University of Manchester, Manchester, United Kingdom
| | - George D Wilson
- Department of Radiation Oncology, Beaumont Health, MI, United States of America
- Henry Ford Health, Detroit, MI, United States of America
| | - Ludwig J Dubois
- The M-Lab, Department of Precision Medicine, GROW-School for Oncology and Reproduction, Maastricht University, Maastricht, The Netherlands
| |
Collapse
|
80
|
Zheng W, Chen Q, Yao L, Zhuang J, Huang J, Hu Y, Yu S, Chen T, Wei N, Zeng Y, Zhang Y, Fan C, Wang Y. Prediction Models for Sleep Quality Among College Students During the COVID-19 Outbreak: Cross-sectional Study Based on the Internet New Media. J Med Internet Res 2023; 25:e45721. [PMID: 36961495 PMCID: PMC10131726 DOI: 10.2196/45721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Revised: 02/15/2023] [Accepted: 02/16/2023] [Indexed: 02/18/2023] Open
Abstract
BACKGROUND COVID-19 has been reported to affect the sleep quality of Chinese residents; however, the epidemic's effects on the sleep quality of college students during closed-loop management remain unclear, and a screening tool is lacking. OBJECTIVE This study aimed to understand the sleep quality of college students in Fujian Province during the epidemic and determine sensitive variables, in order to develop an efficient prediction model for the early screening of sleep problems in college students. METHODS From April 5 to 16, 2022, a cross-sectional internet-based survey was conducted. The Pittsburgh Sleep Quality Index (PSQI) scale, a self-designed general data questionnaire, and the sleep quality influencing factor questionnaire were used to understand the sleep quality of respondents in the previous month. A chi-square test and a multivariate unconditioned logistic regression analysis were performed, and influencing factors obtained were applied to develop prediction models. The data were divided into a training-testing set (n=14,451, 70%) and an independent validation set (n=6194, 30%) by stratified sampling. Four models using logistic regression, an artificial neural network, random forest, and naïve Bayes were developed and validated. RESULTS In total, 20,645 subjects were included in this survey, with a mean global PSQI score of 6.02 (SD 3.112). The sleep disturbance rate was 28.9% (n=5972, defined as a global PSQI score >7 points). A total of 11 variables related to sleep quality were taken as parameters of the prediction models, including age, gender, residence, specialty, respiratory history, coffee consumption, stay up, long hours on the internet, sudden changes, fears of infection, and impatient closed-loop management. Among the generated models, the artificial neural network model proved to be the best, with an area under curve, accuracy, sensitivity, specificity, positive predictive value, and negative predictive value of 0.713, 73.52%, 25.51%, 92.58%, 57.71%, and 75.79%, respectively. It is noteworthy that the logistic regression, random forest, and naive Bayes models achieved high specificities of 94.41%, 94.77%, and 86.40%, respectively. CONCLUSIONS The COVID-19 containment measures affected the sleep quality of college students on multiple levels, indicating that it is desiderate to provide targeted university management and social support. The artificial neural network model has presented excellent predictive efficiency and is favorable for implementing measures earlier in order to improve present conditions.
Collapse
Affiliation(s)
- Wanyu Zheng
- The Second Affiliated Hospital of Fujian Medical University, Quanzhou, China
| | - Qingquan Chen
- The School of Public Health, Fujian Medical University, Fuzhou, China
- The Graduate School of Fujian Medical University, Fuzhou, China
| | - Ling Yao
- The School of Clinical Medicine, Fujian Medical University, Fuzhou, China
| | - Jiajing Zhuang
- The School of Clinical Medicine, Fujian Medical University, Fuzhou, China
| | - Jiewei Huang
- The Graduate School of Fujian Medical University, Fuzhou, China
| | - Yiming Hu
- The School of Public Health, Fujian Medical University, Fuzhou, China
| | - Shaoyang Yu
- The Second Affiliated Hospital of Fujian Medical University, Quanzhou, China
| | - Tebin Chen
- The Second Affiliated Hospital of Fujian Medical University, Quanzhou, China
| | - Nan Wei
- The School of Clinical Medicine, Fujian Medical University, Fuzhou, China
| | - Yifu Zeng
- Cyberspace Institute of Advanced Technology, Guangzhou University, Guangzhou, China
| | - Yixiang Zhang
- The Second Affiliated Hospital of Fujian Medical University, Quanzhou, China
| | - Chunmei Fan
- The Second Affiliated Hospital of Fujian Medical University, Quanzhou, China
| | - Youjuan Wang
- The Second Affiliated Hospital of Fujian Medical University, Quanzhou, China
| |
Collapse
|
81
|
Mirza Z, Karim S. Structure-Based Profiling of Potential Phytomolecules with AKT1 a Key Cancer Drug Target. Molecules 2023; 28:molecules28062597. [PMID: 36985568 PMCID: PMC10051420 DOI: 10.3390/molecules28062597] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Revised: 03/07/2023] [Accepted: 03/09/2023] [Indexed: 03/14/2023] Open
Abstract
Identifying cancer biomarkers is imperative, as upregulated genes offer a better microenvironment for the tumor; hence, targeted inhibition is preferred. The theme of our study is to predict molecular interactions between cancer biomarker proteins and selected natural compounds. We identified an overexpressed potential molecular target (AKT1) and computationally evaluated its inhibition by four dietary ligands (isoliquiritigenin, shogaol, tehranolide, and theophylline). The three-dimensional structures of protein and phytochemicals were retrieved from the RCSB PDB database (4EKL) and NCBI’s PubChem, respectively. Rational structure-based docking studies were performed using AutoDock. Results were analyzed based primarily on the estimated free binding energy (kcal/mol), hydrogen bonds, and inhibition constant, Ki, to identify the most effective anti-cancer phytomolecule. Toxicity and drug-likeliness prediction were performed using OSIRIS and SwissADME. Amongst the four phytocompounds, tehranolide has better potential to suppress the expression of AKT1 and could be used for anti-cancer drug development, as inhibition of AKT1 is directly associated with the inhibition of growth, progression, and metastasis of the tumor. Docking analyses reveal that tehranolide has the most efficiency in inhibiting AKT1 and has the potential to be used for the therapeutic management of cancer. Natural compounds targeting cancer biomarkers offer less rejection, minimal toxicity, and fewer side effects.
Collapse
Affiliation(s)
- Zeenat Mirza
- King Fahd Medical Research Center, King Abdulaziz University, Jeddah 21589, Saudi Arabia
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, King Abdulaziz University, Jeddah 21589, Saudi Arabia
- Correspondence: or
| | - Sajjad Karim
- Department of Medical Laboratory Sciences, Faculty of Applied Medical Sciences, King Abdulaziz University, Jeddah 21589, Saudi Arabia
- Center of Excellence in Genomic Medicine Research, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| |
Collapse
|
82
|
Yu T, Nantasenamat C, Kachenton S, Anuwongcharoen N, Piacham T. Cheminformatic Analysis and Machine Learning Modeling to Investigate Androgen Receptor Antagonists to Combat Prostate Cancer. ACS OMEGA 2023; 8:6729-6742. [PMID: 36844574 PMCID: PMC9948163 DOI: 10.1021/acsomega.2c07346] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 02/01/2023] [Indexed: 06/18/2023]
Abstract
Prostate cancer (PCa) is a major leading cause of mortality of cancer among males. There have been numerous studies to develop antagonists against androgen receptor (AR), a crucial therapeutic target for PCa. This study is a systematic cheminformatic analysis and machine learning modeling to study the chemical space, scaffolds, structure-activity relationship, and landscape of human AR antagonists. There are 1678 molecules as final data sets. Chemical space visualization by physicochemical property visualization has demonstrated that molecules from the potent/active class generally have a mildly smaller molecular weight (MW), octanol-water partition coefficient (log P), number of hydrogen-bond acceptors (nHA), number of rotatable bonds (nRot), and topological polar surface area (TPSA) than molecules from intermediate/inactive class. The chemical space visualization in the principal component analysis (PCA) plot shows significant overlapping distributions between potent/active class molecules and intermediate/inactive class molecules; potent/active class molecules are intensively distributed, while intermediate/inactive class molecules are widely and sparsely distributed. Murcko scaffold analysis has shown low scaffold diversity in general, and scaffold diversity of potent/active class molecules is even lower than intermediate/inactive class molecules, indicating the necessity for developing molecules with novel scaffolds. Furthermore, scaffold visualization has identified 16 representative Murcko scaffolds. Among them, scaffolds 1, 2, 3, 4, 7, 8, 10, 11, 15, and 16 are highly favorable scaffolds due to their high scaffold enrichment factor values. Based on scaffold analysis, their local structure-activity relationships (SARs) were investigated and summarized. In addition, the global SAR landscape was explored by quantitative structure-activity relationship (QSAR) modelings and structure-activity landscape visualization. A QSAR classification model incorporating all of the 1678 molecules stands out as the best model from a total of 12 candidate models for AR antagonists (built on PubChem fingerprint, extra trees algorithm, accuracy for training set: 0.935, 10-fold cross-validation set: 0.735 and test set: 0.756). Deeper insights into the structure-activity landscape highlighted a total of seven significant activity cliff (AC) generators (ChEMBL molecule IDs: 160257, 418198, 4082265, 348918, 390728, 4080698, and 6530), which provide valuable SAR information for medicinal chemistry. The findings in this study provide new insights and guidelines for hit identification and lead optimization for the development of novel AR antagonists.
Collapse
Affiliation(s)
- Tianshi Yu
- Center
of Data Mining and Biomedical informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Chanin Nantasenamat
- Streamlit
Open Source, Snowflake Inc., San Mateo, California 94402, United States
| | - Supicha Kachenton
- Department
of Clinical Microbiology and Applied Technology, Faculty of Medical
Technology, Mahidol University, Bangkok 10700, Thailand
| | - Nuttapat Anuwongcharoen
- Center
of Data Mining and Biomedical informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Theeraphon Piacham
- Department
of Clinical Microbiology and Applied Technology, Faculty of Medical
Technology, Mahidol University, Bangkok 10700, Thailand
| |
Collapse
|
83
|
Comparative Studies on Resampling Techniques in Machine Learning and Deep Learning Models for Drug-Target Interaction Prediction. Molecules 2023; 28:molecules28041663. [PMID: 36838652 PMCID: PMC9964614 DOI: 10.3390/molecules28041663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 01/23/2023] [Accepted: 01/24/2023] [Indexed: 02/12/2023] Open
Abstract
The prediction of drug-target interactions (DTIs) is a vital step in drug discovery. The success of machine learning and deep learning methods in accurately predicting DTIs plays a huge role in drug discovery. However, when dealing with learning algorithms, the datasets used are usually highly dimensional and extremely imbalanced. To solve this issue, the dataset must be resampled accordingly. In this paper, we have compared several data resampling techniques to overcome class imbalance in machine learning methods as well as to study the effectiveness of deep learning methods in overcoming class imbalance in DTI prediction in terms of binary classification using ten (10) cancer-related activity classes from BindingDB. It is found that the use of Random Undersampling (RUS) in predicting DTIs severely affects the performance of a model, especially when the dataset is highly imbalanced, thus, rendering RUS unreliable. It is also found that SVM-SMOTE can be used as a go-to resampling method when paired with the Random Forest and Gaussian Naïve Bayes classifiers, whereby a high F1 score is recorded for all activity classes that are severely and moderately imbalanced. Additionally, the deep learning method called Multilayer Perceptron recorded high F1 scores for all activity classes even when no resampling method was applied.
Collapse
|
84
|
Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning. Molecules 2023; 28:molecules28041679. [PMID: 36838665 PMCID: PMC9966999 DOI: 10.3390/molecules28041679] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 01/01/2023] [Accepted: 01/12/2023] [Indexed: 02/12/2023] Open
Abstract
Cytochrome P450 17A1 (CYP17A1) is one of the key enzymes in steroidogenesis that produces dehydroepiandrosterone (DHEA) from cholesterol. Abnormal DHEA production may lead to the progression of severe diseases, such as prostatic and breast cancers. Thus, CYP17A1 is a druggable target for anti-cancer molecule development. In this study, cheminformatic analyses and quantitative structure-activity relationship (QSAR) modeling were applied on a set of 962 CYP17A1 inhibitors (i.e., consisting of 279 steroidal and 683 nonsteroidal inhibitors) compiled from the ChEMBL database. For steroidal inhibitors, a QSAR classification model built using the PubChem fingerprint along with the extra trees algorithm achieved the best performance, reflected by the accuracy values of 0.933, 0.818, and 0.833 for the training, cross-validation, and test sets, respectively. For nonsteroidal inhibitors, a systematic cheminformatic analysis was applied for exploring the chemical space, Murcko scaffolds, and structure-activity relationships (SARs) for visualizing distributions, patterns, and representative scaffolds for drug discoveries. Furthermore, seven total QSAR classification models were established based on the nonsteroidal scaffolds, and two activity cliff (AC) generators were identified. The best performing model out of these seven was model VIII, which is built upon the PubChem fingerprint along with the random forest algorithm. It achieved a robust accuracy across the training set, the cross-validation set, and the test set, i.e., 0.96, 0.92, and 0.913, respectively. It is anticipated that the results presented herein would be instrumental for further CYP17A1 inhibitor drug discovery efforts.
Collapse
|
85
|
A deep learning-based framework for automatic detection of drug resistance in tuberculosis patients. EGYPTIAN INFORMATICS JOURNAL 2023. [DOI: 10.1016/j.eij.2023.01.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
|
86
|
Firooz A, Funkhouser AT, Martin JC, Edenfield WJ, Valafar H, Blenda AV. Comprehensive and User-Analytics-Friendly Cancer Patient Database for Physicians and Researchers. ARXIV 2023:arXiv:2302.01337v1. [PMID: 36776819 PMCID: PMC9915752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/14/2023]
Abstract
Nuanced cancer patient care is needed, as the development and clinical course of cancer is multifactorial with influences from the general health status of the patient, germline and neoplastic mutations, co-morbidities, and environment. To effectively tailor an individualized treatment to each patient, such multifactorial data must be presented to providers in an easy-to-access and easy-to-analyze fashion. To address the need, a relational database has been developed integrating status of cancer-critical gene mutations, serum galectin profiles, serum and tumor glycomic profiles, with clinical, demographic, and lifestyle data points of individual cancer patients. The database, as a backend, provides physicians and researchers with a single, easily accessible repository of cancer profiling data to aid-in and enhance individualized treatment. Our interactive database allows care providers to amalgamate cohorts from these groups to find correlations between different data types with the possibility of finding "molecular signatures" based upon a combination of genetic mutations, galectin serum levels, glycan compositions, and patient clinical data and lifestyle choices. Our project provides a framework for an integrated, interactive, and growing database to analyze molecular and clinical patterns across cancer stages and subtypes and provides opportunities for increased diagnostic and prognostic power.
Collapse
Affiliation(s)
- Ali Firooz
- College of Engineering and Computing, University of South Carolina, Columbia, SC, USA
| | - Avery T Funkhouser
- School of Medicine Greenville, University of South Carolina, Greenville, SC, USA
| | | | | | - Homayoun Valafar
- College of Engineering and Computing, University of South Carolina, Columbia, SC, USA
| | - Anna V Blenda
- School of Medicine Greenville, University of South Carolina, Prisma Health Cancer Institute, Greenville, SC, USA
| |
Collapse
|
87
|
Mirzaei M, Furxhi I, Murphy F, Mullins M. Employing Supervised Algorithms for the Prediction of Nanomaterial's Antioxidant Efficiency. Int J Mol Sci 2023; 24:ijms24032792. [PMID: 36769135 PMCID: PMC9918003 DOI: 10.3390/ijms24032792] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 01/25/2023] [Accepted: 01/29/2023] [Indexed: 02/05/2023] Open
Abstract
Reactive oxygen species (ROS) are compounds that readily transform into free radicals. Excessive exposure to ROS depletes antioxidant enzymes that protect cells, leading to oxidative stress and cellular damage. Nanomaterials (NMs) exhibit free radical scavenging efficiency representing a potential solution for oxidative stress-induced disorders. This study aims to demonstrate the application of machine learning (ML) algorithms for predicting the antioxidant efficiency of NMs. We manually compiled a comprehensive dataset based on a literature review of 62 in vitro studies. We extracted NMs' physico-chemical (P-chem) properties, the NMs' synthesis technique and various experimental conditions as input features to predict the antioxidant efficiency measured by a 2,2-diphenyl-1-picrylhydrazyl (DPPH) assay. Following data pre-processing, various regression models were trained and validated. The random forest model showed the highest predictive performance reaching an R2 = 0.83. The attribute importance analysis revealed that the NM's type, core-size and dosage are the most important attributes influencing the prediction. Our findings corroborate with those of the prior research landscape regarding the importance of P-chem characteristics. This study expands the application of ML in the nano-domain beyond safety-related outcomes by capturing the functional performance. Accordingly, this study has two objectives: (1) to develop a model to forecast the antioxidant efficiency of NMs to complement conventional in vitro assays and (2) to underline the lack of a comprehensive database and the scarcity of relevant data and/or data management practices in the nanotechnology field, especially with regards to functionality assessments.
Collapse
Affiliation(s)
- Mahsa Mirzaei
- Department of Accounting and Finance, Kemmy Business School, University of Limerick, V94PH93 Limerick, Ireland
| | - Irini Furxhi
- Department of Accounting and Finance, Kemmy Business School, University of Limerick, V94PH93 Limerick, Ireland
- Transgero Limited, Newcastle West, V42V384 Limerick, Ireland
- Correspondence: ; Tel.: +353-85-106-9771
| | - Finbarr Murphy
- Department of Accounting and Finance, Kemmy Business School, University of Limerick, V94PH93 Limerick, Ireland
| | - Martin Mullins
- Department of Accounting and Finance, Kemmy Business School, University of Limerick, V94PH93 Limerick, Ireland
| |
Collapse
|
88
|
Vemula D, Jayasurya P, Sushmitha V, Kumar YN, Bhandari V. CADD, AI and ML in drug discovery: A comprehensive review. Eur J Pharm Sci 2023; 181:106324. [PMID: 36347444 DOI: 10.1016/j.ejps.2022.106324] [Citation(s) in RCA: 37] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 10/26/2022] [Accepted: 11/03/2022] [Indexed: 11/06/2022]
Abstract
Computer-aided drug design (CADD) is an emerging field that has drawn a lot of interest because of its potential to expedite and lower the cost of the drug development process. Drug discovery research is expensive and time-consuming, and it frequently took 10-15 years for a drug to be commercially available. CADD has significantly impacted this area of research. Further, the combination of CADD with Artificial Intelligence (AI), Machine Learning (ML), and Deep Learning (DL) technologies to handle enormous amounts of biological data has reduced the time and cost associated with the drug development process. This review will discuss how CADD, AI, ML, and DL approaches help identify drug candidates and various other steps of the drug discovery process. It will also provide a detailed overview of the different in silico tools used and how these approaches interact.
Collapse
Affiliation(s)
- Divya Vemula
- National Institute of Pharmaceutical Education and Research- Hyderabad, India
| | - Perka Jayasurya
- National Institute of Pharmaceutical Education and Research- Hyderabad, India
| | - Varthiya Sushmitha
- National Institute of Pharmaceutical Education and Research- Hyderabad, India
| | | | - Vasundhra Bhandari
- National Institute of Pharmaceutical Education and Research- Hyderabad, India.
| |
Collapse
|
89
|
McNair D. Artificial Intelligence and Machine Learning for Lead-to-Candidate Decision-Making and Beyond. Annu Rev Pharmacol Toxicol 2023; 63:77-97. [PMID: 35679624 DOI: 10.1146/annurev-pharmtox-051921-023255] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
The use of artificial intelligence (AI) and machine learning (ML) in pharmaceutical research and development has to date focused on research: target identification; docking-, fragment-, and motif-based generation of compound libraries; modeling of synthesis feasibility; rank-ordering likely hits according to structural and chemometric similarity to compounds having known activity and affinity to the target(s); optimizing a smaller library for synthesis and high-throughput screening; and combining evidence from screening to support hit-to-lead decisions. Applying AI/ML methods to lead optimization and lead-to-candidate (L2C) decision-making has shown slower progress, especially regarding predicting absorption, distribution, metabolism, excretion, and toxicology properties. The present review surveys reasons why this is so, reports progress that has occurred in recent years, and summarizes some of the issues that remain. Effective AI/ML tools to derisk L2C and later phases of development are important to accelerate the pharmaceutical development process, ameliorate escalating development costs, and achieve greater success rates.
Collapse
Affiliation(s)
- Douglas McNair
- Global Health, Integrated Development, Bill & Melinda Gates Foundation, Seattle, Washington, USA;
| |
Collapse
|
90
|
Yeh KB, Parekh FK, Mombo I, Leimer J, Hewson R, Olinger G, Fair JM, Sun Y, Hay J. Climate change and infectious disease: A prologue on multidisciplinary cooperation and predictive analytics. Front Public Health 2023; 11:1018293. [PMID: 36741948 PMCID: PMC9895942 DOI: 10.3389/fpubh.2023.1018293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2022] [Accepted: 01/02/2023] [Indexed: 01/22/2023] Open
Abstract
Climate change impacts global ecosystems at the interface of infectious disease agents and hosts and vectors for animals, humans, and plants. The climate is changing, and the impacts are complex, with multifaceted effects. In addition to connecting climate change and infectious diseases, we aim to draw attention to the challenges of working across multiple disciplines. Doing this requires concentrated efforts in a variety of areas to advance the technological state of the art and at the same time implement ideas and explain to the everyday citizen what is happening. The world's experience with COVID-19 has revealed many gaps in our past approaches to anticipating emerging infectious diseases. Most approaches to predicting outbreaks and identifying emerging microbes of major consequence have been with those causing high morbidity and mortality in humans and animals. These lagging indicators offer limited ability to prevent disease spillover and amplifications in new hosts. Leading indicators and novel approaches are more valuable and now feasible, with multidisciplinary approaches also within our grasp to provide links to disease predictions through holistic monitoring of micro and macro ecological changes. In this commentary, we describe niches for climate change and infectious diseases as well as overarching themes for the important role of collaborative team science, predictive analytics, and biosecurity. With a multidisciplinary cooperative "all call," we can enhance our ability to engage and resolve current and emerging problems.
Collapse
Affiliation(s)
| | | | - Illich Mombo
- CIRMF, Franceville, Gabon, Central African Republic
| | | | - Roger Hewson
- UK Health Security Agency, Salisbury, United Kingdom
- London School of Hygiene and Tropical Medicine, London, United Kingdom
| | | | - Jeanne M. Fair
- Los Alamos National Laboratory, Los Alamos, NM, United States
| | - Yijun Sun
- Jacobs School of Medicine and Biomedical Sciences, Buffalo, NY, United States
| | - John Hay
- Jacobs School of Medicine and Biomedical Sciences, Buffalo, NY, United States
| |
Collapse
|
91
|
Djokovic N, Rahnasto-Rilla M, Lougiakis N, Lahtela-Kakkonen M, Nikolic K. SIRT2i_Predictor: A Machine Learning-Based Tool to Facilitate the Discovery of Novel SIRT2 Inhibitors. Pharmaceuticals (Basel) 2023; 16:ph16010127. [PMID: 36678624 PMCID: PMC9864763 DOI: 10.3390/ph16010127] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 01/10/2023] [Accepted: 01/11/2023] [Indexed: 01/17/2023] Open
Abstract
A growing body of preclinical evidence recognized selective sirtuin 2 (SIRT2) inhibitors as novel therapeutics for treatment of age-related diseases. However, none of the SIRT2 inhibitors have reached clinical trials yet. Transformative potential of machine learning (ML) in early stages of drug discovery has been witnessed by widespread adoption of these techniques in recent years. Despite great potential, there is a lack of robust and large-scale ML models for discovery of novel SIRT2 inhibitors. In order to support virtual screening (VS), lead optimization, or facilitate the selection of SIRT2 inhibitors for experimental evaluation, a machine-learning-based tool titled SIRT2i_Predictor was developed. The tool was built on a panel of high-quality ML regression and classification-based models for prediction of inhibitor potency and SIRT1-3 isoform selectivity. State-of-the-art ML algorithms were used to train the models on a large and diverse dataset containing 1797 compounds. Benchmarking against structure-based VS protocol indicated comparable coverage of chemical space with great gain in speed. The tool was applied to screen the in-house database of compounds, corroborating the utility in the prioritization of compounds for costly in vitro screening campaigns. The easy-to-use web-based interface makes SIRT2i_Predictor a convenient tool for the wider community. The SIRT2i_Predictor's source code is made available online.
Collapse
Affiliation(s)
- Nemanja Djokovic
- Department of Pharmaceutical Chemistry, Faculty of Pharmacy, University of Belgrade, Vojvode Stepe 450, 11221 Belgrade, Serbia
- Correspondence: (N.D.); (K.N.)
| | - Minna Rahnasto-Rilla
- School of Pharmacy, University of Eastern Finland, P.O. Box 1627, 70210 Kuopio, Finland
| | - Nikolaos Lougiakis
- Laboratory of Medicinal Chemistry, Section of Pharmaceutical Chemistry, Department of Pharmacy, School of Health Sciences, National and Kapodistrian University of Athens, Panepistimiopolis-Zografou, 15771 Athens, Greece
| | | | - Katarina Nikolic
- Department of Pharmaceutical Chemistry, Faculty of Pharmacy, University of Belgrade, Vojvode Stepe 450, 11221 Belgrade, Serbia
- Correspondence: (N.D.); (K.N.)
| |
Collapse
|
92
|
López
Barreiro D, Folch-Fortuny A, Muntz I, Thies JC, Sagt CM, Koenderink GH. Sequence Control of the Self-Assembly of Elastin-Like Polypeptides into Hydrogels with Bespoke Viscoelastic and Structural Properties. Biomacromolecules 2023; 24:489-501. [PMID: 36516874 PMCID: PMC9832484 DOI: 10.1021/acs.biomac.2c01405] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
The biofabrication of structural proteins with controllable properties via amino acid sequence design is interesting for biomedicine and biotechnology, yet a complete framework that connects amino acid sequence to material properties is unavailable, despite great progress to establish design rules for synthesizing peptides and proteins with specific conformations (e.g., unfolded, helical, β-sheets, or β-turns) and intermolecular interactions (e.g., amphipathic peptides or hydrophobic domains). Molecular dynamics (MD) simulations can help in developing such a framework, but the lack of a standardized way of interpreting the outcome of these simulations hinders their predictive value for the design of de novo structural proteins. To address this, we developed a model that unambiguously classifies a library of de novo elastin-like polypeptides (ELPs) with varying numbers and locations of hydrophobic/hydrophilic and physical/chemical-cross-linking blocks according to their thermoresponsiveness at physiological temperature. Our approach does not require long simulation times or advanced sampling methods. Instead, we apply (un)supervised data analysis methods to a data set of molecular properties from relatively short MD simulations (150 ns). We also experimentally investigate hydrogels of those ELPs from the library predicted to be thermoresponsive, revealing several handles to tune their mechanical and structural properties: chain hydrophilicity/hydrophobicity or block distribution control the viscoelasticity and thermoresponsiveness, whereas ELP concentration defines the network permeability. Our findings provide an avenue to accelerate the design of de novo ELPs with bespoke phase behavior and material properties.
Collapse
Affiliation(s)
- Diego López
Barreiro
- DSM
Biosciences and Process Innovation, DSM, Alexander Fleminglaan 1, 2613 AXDelft, The Netherlands
| | - Abel Folch-Fortuny
- DSM
Biodata and Translation, DSM, Alexander Fleminglaan 1, 2613 AXDelft, The Netherlands
| | - Iain Muntz
- Department
of Bionanoscience, Kavli Institute of Nanoscience Delft, Delft University of Technology, Van der Maasweg 9, 2629 HZDelft, The Netherlands
| | - Jens C. Thies
- DSM
Biomedical, DSM, Urmonderbaan
22, 6160 BB, Geleen, The Netherlands,E-mail:
| | - Cees M.J. Sagt
- DSM
Biosciences and Process Innovation, DSM, Alexander Fleminglaan 1, 2613 AXDelft, The Netherlands,E-mail:
| | - Gijsje H. Koenderink
- Department
of Bionanoscience, Kavli Institute of Nanoscience Delft, Delft University of Technology, Van der Maasweg 9, 2629 HZDelft, The Netherlands,E-mail:
| |
Collapse
|
93
|
Lerksuthirat T, Chitphuk S, Stitchantrakul W, Dejsuphong D, Malik AA, Nantasenamat C. PARP1pred: a web server for screening the bioactivity of inhibitors against DNA repair enzyme PARP-1. EXCLI JOURNAL 2023; 22:84-107. [PMID: 36814851 PMCID: PMC9939779 DOI: 10.17179/excli2022-5602] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 12/23/2022] [Indexed: 02/24/2023]
Abstract
Cancer is the leading cause of death worldwide, resulting in the mortality of more than 10 million people in 2020, according to Global Cancer Statistics 2020. A potential cancer therapy involves targeting the DNA repair process by inhibiting PARP-1. In this study, classification models were constructed using a non-redundant set of 2018 PARP-1 inhibitors. Briefly, compounds were described by 12 fingerprint types and built using the random forest algorithm concomitant with various sampling approaches. Results indicated that PubChem with an oversampling approach yielded the best performance, with a Matthews correlation coefficient > 0.7 while also affording interpretable molecular features. Moreover, feature importance, as determined from the Gini index, revealed that the aromatic/cyclic/heterocyclic moiety, nitrogen-containing fingerprints, and the ether/aldehyde/alcohol moiety were important for PARP-1 inhibition. Finally, our predictive model was deployed as a web application called PARP1pred and is publicly available at https://parp1pred.streamlitapp.com, allowing users to predict the biological activity of query compounds using their SMILES notation as the input. It is anticipated that the model described herein will aid in the discovery of effective PARP-1 inhibitors.
Collapse
Affiliation(s)
- Tassanee Lerksuthirat
- Research Center, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok 10400, Thailand,*To whom correspondence should be addressed: Tassanee Lerksuthirat, Research Center, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok 10400, Thailand, E-mail:
| | - Sermsiri Chitphuk
- Research Center, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok 10400, Thailand
| | - Wasana Stitchantrakul
- Research Center, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok 10400, Thailand
| | - Donniphat Dejsuphong
- Program in Translational Medicine, Chakri Naruebodindra Medical Institute, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Samut Prakan 10540, Thailand
| | - Aijaz Ahmad Malik
- Center of Excellence in Computational Molecular Biology, Faculty of Medicine, Chulalongkorn University, Bangkok 10330, Thailand
| | | |
Collapse
|
94
|
Wang CC, Hung YT, Chou CY, Hsuan SL, Chen ZW, Chang PY, Jan TR, Tung CW. Using random forest to predict antimicrobial minimum inhibitory concentrations of nontyphoidal Salmonella in Taiwan. Vet Res 2023; 54:11. [PMID: 36747286 PMCID: PMC9903507 DOI: 10.1186/s13567-023-01141-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Accepted: 01/13/2023] [Indexed: 02/08/2023] Open
Abstract
Antimicrobial resistance (AMR) is a global health issue and surveillance of AMR can be useful for understanding AMR trends and planning intervention strategies. Salmonella, widely distributed in food-producing animals, has been considered the first priority for inclusion in the AMR surveillance program by the World Health Organization (WHO). Recent advances in rapid and affordable whole-genome sequencing (WGS) techniques lead to the emergence of WGS as a one-stop test to predict the antimicrobial susceptibility. Since the variation of sequencing and minimum inhibitory concentration (MIC) measurement methods could result in different results, this study aimed to develop WGS-based random forest models for predicting MIC values of 24 drugs using data generated from the same laboratories in Taiwan. The WGS data have been transformed as a feature vector of 10-mers for machine learning. Based on rigorous validation and independent tests, a good performance was obtained with an average mean absolute error (MAE) less than 1 for both validation and independent test. Feature selection was then applied to identify top-ranked 10-mers that can further improve the prediction performance. For surveillance purposes, the genome sequence-based machine learning methods could be utilized to monitor the difference between predicted and experimental MIC, where a large difference might be worthy of investigation on the emerging genomic determinants.
Collapse
Affiliation(s)
- Chia-Chi Wang
- grid.19188.390000 0004 0546 0241Department and Graduate Institute of Veterinary Medicine, School of Veterinary Medicine, National Taiwan University, Taipei, 106 Taiwan
| | - Yu-Ting Hung
- grid.482517.dAnimal Technology Laboratories, Agricultural Technology Research Institute, Hsinchu City, 300 Taiwan ,grid.260542.70000 0004 0532 3749Graduate Institute of Veterinary Pathobiology, College of Veterinary Medicine, National Chung Hsing University, Taichung, 402 Taiwan
| | - Che-Yu Chou
- grid.412896.00000 0000 9337 0481Graduate Institute of Data Science, College of Management, Taipei Medical University, Taipei, 106 Taiwan
| | - Shih-Ling Hsuan
- grid.260542.70000 0004 0532 3749Graduate Institute of Veterinary Pathobiology, College of Veterinary Medicine, National Chung Hsing University, Taichung, 402 Taiwan
| | - Zeng-Weng Chen
- grid.482517.dAnimal Technology Laboratories, Agricultural Technology Research Institute, Hsinchu City, 300 Taiwan
| | - Pei-Yu Chang
- grid.59784.370000000406229172Institute of Biotechnology and Pharmaceutical Research, National Health Research Institutes, Miaoli County, 350 Taiwan
| | - Tong-Rong Jan
- Department and Graduate Institute of Veterinary Medicine, School of Veterinary Medicine, National Taiwan University, Taipei, 106, Taiwan.
| | - Chun-Wei Tung
- Graduate Institute of Data Science, College of Management, Taipei Medical University, Taipei, 106, Taiwan. .,Institute of Biotechnology and Pharmaceutical Research, National Health Research Institutes, Miaoli County, 350, Taiwan.
| |
Collapse
|
95
|
Sobańska AW. Immobilized artificial membrane-chromatographic and computational descriptors in studies of soil-water partition of environmentally relevant compounds. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:6192-6200. [PMID: 35994147 PMCID: PMC9895004 DOI: 10.1007/s11356-022-22514-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 08/09/2022] [Indexed: 05/27/2023]
Abstract
Chromatographic retention factor log kIAM obtained from immobilized artificial membrane (IAM) HPLC with buffered, aqueous mobile phases and calculated molecular descriptors (molecular weight - log MW; molar volume - VM; polar surface area - PSA; total count of nitrogen and oxygen atoms -(N + O); count of freely rotable bonds - FRB; H-bond donor count - HD; H-bond acceptor count - HA; energy of the highest occupied molecular orbital - EHOMO; energy of the lowest unoccupied orbital - ELUMO; dipole moment - DM; polarizability - α) obtained for a group of 175 structurally unrelated compounds were tested in order to generate useful models of solutes' soil-water partition coefficient normalized to organic carbon log Koc. It was established that log kIAM obtained in the conditions described in this study is not sufficient as a sole predictor of the soil-water partition coefficient. Simple, potentially useful models based on log kIAM and a selection of readily available, calculated descriptors and accounting for over 88% of total variability were generated using multiple linear regression (MLR) and artificial neural networks (ANN). The models proposed in the study were tested on a group of 50 compounds with known experimental log Koc values by plotting the calculated vs. experimental values. There is a good close similarity between the calculated and experimental data for both MLR and ANN models for compounds from different chemical families (R2 ≥ 0.80, n = 50) which proves the models' reliability.
Collapse
Affiliation(s)
- Anna W Sobańska
- Department of Analytical Chemistry, Medical University of Łódź, ul. Muszyńskiego 1, 90-151, Lodz, Poland.
| |
Collapse
|
96
|
Ogawa K, Sakamoto D, Hosoki R. Computer Science Technology in Natural Products Research: A Review of Its Applications and Implications. Chem Pharm Bull (Tokyo) 2023; 71:486-494. [PMID: 37394596 DOI: 10.1248/cpb.c23-00039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Computational approaches to drug development are rapidly growing in popularity and have been used to produce significant results. Recent developments in information science have expanded databases and chemical informatics knowledge relating to natural products. Natural products have long been well-studied, and a large number of unique structures and remarkable active substances have been reported. Analyzing accumulated natural product knowledge using emerging computational science techniques is expected to yield more new discoveries. In this article, we discuss the current state of natural product research using machine learning. The basic concepts and frameworks of machine learning are summarized. Natural product research that utilizes machine learning is described in terms of the exploration of active compounds, automatic compound design, and application to spectral data. In addition, efforts to develop drugs for intractable diseases will be addressed. Lastly, we discuss key considerations for applying machine learning in this field. This paper aims to promote progress in natural product research by presenting the current state of computational science and chemoinformatics approaches in terms of its applications, strengths, limitations, and implications for the field.
Collapse
Affiliation(s)
- Keiko Ogawa
- Laboratory of Regulatory Science, College of Pharmaceutical Sciences, Ritsumeikan University
| | - Daiki Sakamoto
- Laboratory of Regulatory Science, College of Pharmaceutical Sciences, Ritsumeikan University
| | - Rumiko Hosoki
- Laboratory of Regulatory Science, College of Pharmaceutical Sciences, Ritsumeikan University
| |
Collapse
|
97
|
Mou M, Pan Z, Lu M, Sun H, Wang Y, Luo Y, Zhu F. Application of Machine Learning in Spatial Proteomics. J Chem Inf Model 2022; 62:5875-5895. [PMID: 36378082 DOI: 10.1021/acs.jcim.2c01161] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Spatial proteomics is an interdisciplinary field that investigates the localization and dynamics of proteins, and it has gained extensive attention in recent years, especially the subcellular proteomics. Numerous evidence indicate that the subcellular localization of proteins is associated with various cellular processes and disease progression. Mass spectrometry (MS)-based and imaging-based experimental approaches have been developed to acquire large-scale spatial proteomic data. To allow the reliable analysis of increasingly complex spatial proteomics data, machine learning (ML) methods have been widely used in both MS-based and imaging-based spatial proteomic data analysis pipelines. Here, we comprehensively survey the applications of ML in spatial proteomics from following aspects: (1) data resources for spatial proteome are comprehensively introduced; (2) the roles of different ML algorithms in data analysis pipelines are elaborated; (3) successful applications of spatial proteomics and several analytical tools integrating ML methods are presented; (4) challenges existing in modern ML-based spatial proteomics studies are discussed. This review provides guidelines for researchers seeking to apply ML methods to analyze spatial proteomic data and can facilitate insightful understanding of cell biology as well as the future research in medical and drug discovery communities.
Collapse
Affiliation(s)
- Minjie Mou
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Ziqi Pan
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Mingkun Lu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Huaicheng Sun
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Yunxia Wang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Yongchao Luo
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
98
|
Coutinho MG, Câmara GB, Barbosa RDM, Fernandes MA. SARS-CoV-2 virus classification based on stacked sparse autoencoder. Comput Struct Biotechnol J 2022; 21:284-298. [PMID: 36530948 PMCID: PMC9742810 DOI: 10.1016/j.csbj.2022.12.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 12/04/2022] [Accepted: 12/05/2022] [Indexed: 12/13/2022] Open
Abstract
Since December 2019, the world has been intensely affected by the COVID-19 pandemic, caused by the SARS-CoV-2. In the case of a novel virus identification, the early elucidation of taxonomic classification and origin of the virus genomic sequence is essential for strategic planning, containment, and treatments. Deep learning techniques have been successfully used in many viral classification problems associated with viral infection diagnosis, metagenomics, phylogenetics, and analysis. Considering that motivation, the authors proposed an efficient viral genome classifier for the SARS-CoV-2 using the deep neural network based on the stacked sparse autoencoder (SSAE). For the best performance of the model, we explored the utilization of image representations of the complete genome sequences as the SSAE input to provide a classification of the SARS-CoV-2. For that, a dataset based on k-mers image representation was applied. We performed four experiments to provide different levels of taxonomic classification of the SARS-CoV-2. The SSAE technique provided great performance results in all experiments, achieving classification accuracy between 92% and 100% for the validation set and between 98.9% and 100% when the SARS-CoV-2 samples were applied for the test set. In this work, samples of the SARS-CoV-2 were not used during the training process, only during subsequent tests, in which the model was able to infer the correct classification of the samples in the vast majority of cases. This indicates that our model can be adapted to classify other emerging viruses. Finally, the results indicated the applicability of this deep learning technique in genome classification problems.
Collapse
Affiliation(s)
- Maria G.F. Coutinho
- Laboratory of Machine Learning and Intelligent Instrumentation, IMD/nPITI, Federal University of Rio Grande do Norte, Natal, Brazil
| | - Gabriel B.M. Câmara
- Laboratory of Machine Learning and Intelligent Instrumentation, IMD/nPITI, Federal University of Rio Grande do Norte, Natal, Brazil
| | - Raquel de M. Barbosa
- Department of Pharmacy and Pharmaceutical Technology, University of Granada, 18071 Granada, Spain
| | - Marcelo A.C. Fernandes
- Laboratory of Machine Learning and Intelligent Instrumentation, IMD/nPITI, Federal University of Rio Grande do Norte, Natal, Brazil
- Department of Computer and Automation Engineering, Federal University of Rio Grande do Norte, Natal, Brazil
| |
Collapse
|
99
|
Baručić D, Kaushik S, Kybic J, Stanková J, Džubák P, Hajdúch M. Characterization of drug effects on cell cultures from phase-contrast microscopy images. Comput Biol Med 2022; 151:106171. [PMID: 36306582 DOI: 10.1016/j.compbiomed.2022.106171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 08/30/2022] [Accepted: 10/01/2022] [Indexed: 12/27/2022]
Abstract
In this work, we classify chemotherapeutic agents (topoisomerase inhibitors) based on their effect on U-2 OS cells. We use phase-contrast microscopy images, which are faster and easier to obtain than fluorescence images and support live cell imaging. We use a convolutional neural network (CNN) trained end-to-end directly on the input images without requiring for manual segmentations or any other auxiliary data. Our method can distinguish between tested cytotoxic drugs with an accuracy of 98%, provided that their mechanism of action differs, outperforming previous work. The results are even better when substance-specific concentrations are used. We show the benefit of sharing the extracted features over all classes (drugs). Finally, a 2D visualization of these features reveals clusters, which correspond well to known class labels, suggesting the possible use of our methodology for drug discovery application in analyzing new, unseen drugs.
Collapse
Affiliation(s)
- Denis Baručić
- Faculty of Electrical Engineering, Czech Technical University in Prague, Prague, Czech Republic.
| | - Sumit Kaushik
- Faculty of Electrical Engineering, Czech Technical University in Prague, Prague, Czech Republic.
| | - Jan Kybic
- Faculty of Electrical Engineering, Czech Technical University in Prague, Prague, Czech Republic.
| | - Jarmila Stanková
- Institute of Molecular and Translational Medicine, Faculty of Medicine and Dentistry, Palacky University, Olomouc, Czech Republic
| | - Petr Džubák
- Institute of Molecular and Translational Medicine, Faculty of Medicine and Dentistry, Palacky University, Olomouc, Czech Republic
| | - Marián Hajdúch
- Institute of Molecular and Translational Medicine, Faculty of Medicine and Dentistry, Palacky University, Olomouc, Czech Republic
| |
Collapse
|
100
|
Machine learning and structure-based modeling for the prediction of UDP-glucuronosyltransferase inhibition. iScience 2022; 25:105290. [PMID: 36304105 PMCID: PMC9593791 DOI: 10.1016/j.isci.2022.105290] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 09/05/2022] [Accepted: 10/03/2022] [Indexed: 11/23/2022] Open
Abstract
UDP-glucuronosyltransferases (UGTs) are responsible for 35% of the phase II drug metabolism. In this study, we focused on UGT1A1, which is a key UGT isoform. Strong inhibition of UGT1A1 may trigger adverse drug/herb-drug interactions, or result in disorders of endobiotic metabolism. Most of the current machine learning methods predicting the inhibition of drug metabolizing enzymes neglect protein structure and dynamics, both being essential for the recognition of various substrates and inhibitors. We performed molecular dynamics simulations on a homology model of the human UGT1A1 structure containing both the cofactor- (UDP-glucuronic acid) and substrate-binding domains to explore UGT conformational changes. Then, we created models for the prediction of UGT1A1 inhibitors by integrating information on UGT1A1 structure and dynamics, interactions with diverse ligands, and machine learning. These models can be helpful for further prediction of drug-drug interactions of drug candidates and safety treatments. UGTs are responsible for 35% of the phase II drug metabolism reactions We created machine learning models for prediction of UGT1A1 inhibitors Our simulations suggested key residues of UGT1A1 involved in the substrate binding
Collapse
|