1
|
Hossain MM, Roy K. The development of classification-based machine-learning models for the toxicity assessment of chemicals associated with plastic packaging. JOURNAL OF HAZARDOUS MATERIALS 2025; 484:136702. [PMID: 39637787 DOI: 10.1016/j.jhazmat.2024.136702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2024] [Revised: 11/24/2024] [Accepted: 11/26/2024] [Indexed: 12/07/2024]
Abstract
Assessing chemical toxicity in materials like plastic packaging is critical to safeguarding public health. This study presents the development of classification-based machine learning models to predict the toxicity of chemicals associated with plastic packaging. Using an extensive dataset of chemical structures, we trained multiple machine learning models-Random Forest, Support Vector Machine, Linear Discriminant Analysis, and Logistic Regression-targeting endpoints such as Neurotoxicity, Hepatotoxicity, Dermatotoxicity, Carcinogenicity, Reproductive Toxicity, Skin Sensitization, and Toxic Pneumonitis. The dataset was pre-processed by selecting 2D molecular descriptors as feature inputs, with resampling methods (ADASYN, Borderline SMOTE, Random Over-sampler, SVMSMOTE Cluster Centroid, Near Miss, Random Under Sampler) applied to balance classes for accurate classification. A five-fold cross-validation technique was used to optimize model performance, with model parameters fine-tuned using grid search. The model performance was evaluated using accuracy (Acc), sensitivity (Se), specificity (Sp), and area under the receiver operating characteristic curve (AUC-ROC) metrics. In most of the cases, the model accuracy was 0.8 or above for both training and test sets. Additionally, SHAP (SHapley Additive exPlanations) values were utilized for feature importance analysis, highlighting significant descriptors contributing to toxicity predictions. The models were ranked using the Sum of Ranking Differences (SRD) method to systematically select the most effective model. The optimal models demonstrated high predictive accuracy and interpretability, providing a scalable and efficient solution for toxicity assessment compared to traditional methods. This approach offers a valuable tool for rapidly screening potentially hazardous chemicals in plastic packaging.
Collapse
Affiliation(s)
- Md Mobarak Hossain
- Drug Theoretics and Cheminformatics (DTC) Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700032, India
| | - Kunal Roy
- Drug Theoretics and Cheminformatics (DTC) Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700032, India.
| |
Collapse
|
2
|
Pallikara I, Skelton JM, Hatcher LE, Pallipurath AR. Going beyond the Ordered Bulk: A Perspective on the Use of the Cambridge Structural Database for Predictive Materials Design. CRYSTAL GROWTH & DESIGN 2024; 24:6911-6930. [PMID: 39247224 PMCID: PMC11378158 DOI: 10.1021/acs.cgd.4c00694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Revised: 07/26/2024] [Accepted: 07/30/2024] [Indexed: 09/10/2024]
Abstract
When Olga Kennard founded the Cambridge Crystallographic Data Centre in 1965, the Cambridge Structural Database was a pioneering attempt to collect scientific data in a standard format. Since then, it has evolved into an indispensable resource in contemporary molecular materials science, with over 1.25 million structures and comprehensive software tools for searching, visualizing and analyzing the data. In this perspective, we discuss the use of the CSD and CCDC tools to address the multiscale challenge of predictive materials design. We provide an overview of the core capabilities of the CSD and CCDC software and demonstrate their application to a range of materials design problems with recent case studies drawn from topical research areas, focusing in particular on the use of data mining and machine learning techniques. We also identify several challenges that can be addressed with existing capabilities or through new capabilities with varying levels of development effort.
Collapse
Affiliation(s)
- Ioanna Pallikara
- School of Chemical and Process Engineering, University of Leeds, Leeds LS2 9JT, U.K
| | - Jonathan M Skelton
- Department of Chemistry, University of Manchester, Manchester M13 9PL, U.K
| | | | | |
Collapse
|
3
|
Chiesa L, Kellenberger E. One class classification for the detection of β2 adrenergic receptor agonists using single-ligand dynamic interaction data. J Cheminform 2022; 14:74. [PMID: 36309734 PMCID: PMC9617447 DOI: 10.1186/s13321-022-00654-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 10/17/2022] [Indexed: 11/22/2022] Open
Abstract
G protein-coupled receptors are involved in many biological processes, relaying the extracellular signal inside the cell. Signaling is regulated by the interactions between receptors and their ligands, it can be stimulated by agonists, or inhibited by antagonists or inverse agonists. The development of a new drug targeting a member of this family requires to take into account the pharmacological profile of the designed ligands in order to elicit the desired response. The structure-based virtual screening of chemical libraries may prioritize a specific class of ligands by combining docking results and ligand binding information provided by crystallographic structures. The performance of the method depends on the relevance of the structural data, in particular the conformation of the targeted site, the binding mode of the reference ligand, and the approach used to compare the interactions formed by the docked ligand with those formed by the reference ligand in the crystallographic structure. Here, we propose a new method based on the conformational dynamics of a single protein–ligand reference complex to improve the biased selection of ligands with specific pharmacological properties in a structure-based virtual screening exercise. Interactions patterns between a reference agonist and the receptor, here exemplified on the β2 adrenergic receptor, were extracted from molecular dynamics simulations of the agonist/receptor complex and encoded in graphs used to train a one-class machine learning classifier. Different conditions were tested: low to high affinity agonists, varying simulation duration, considering or ignoring hydrophobic contacts, and tuning of the classifier parametrization. The best models applied to post-process raw data from retrospective virtual screening obtained by docking of test libraries effectively filtered out irrelevant poses, discarding inactive and non-agonist ligands while identifying agonists. Taken together, our results suggest that consistency of the binding mode during the simulation is a key to the success of the method.
Collapse
Affiliation(s)
- Luca Chiesa
- Laboratoire d'innovation Thérapeutique, Faculté de Pharmacie, UMR7200 CNRS Université de Strasbourg, 67400, Illkirch, France
| | - Esther Kellenberger
- Laboratoire d'innovation Thérapeutique, Faculté de Pharmacie, UMR7200 CNRS Université de Strasbourg, 67400, Illkirch, France.
| |
Collapse
|
4
|
Yang D, Wang L, Yuan P, An Q, Su B, Yu M, Chen T, Hu K, Zhang L, Lu Y, Du G. Cocrystal virtual screening based on the XGBoost machine learning model. CHINESE CHEM LETT 2022. [DOI: 10.1016/j.cclet.2022.107964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
5
|
Prediction and Design of Cyclodextrin Inclusion Complexes formation via Machine Learning-based Strategies. Chem Eng Sci 2022. [DOI: 10.1016/j.ces.2022.117946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
6
|
Nishimoto M, Uetake Y, Yakiyama Y, Ishiwari F, Saeki A, Sakurai H. Synthesis of the C 70 Fragment Buckybowl, Homosumanene, and Heterahomosumanenes via Ring-Expansion Reactions from Sumanenone. J Org Chem 2022; 87:2508-2519. [PMID: 35179377 DOI: 10.1021/acs.joc.1c02416] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Bowl-shaped aromatic molecules, buckybowls, are attractive molecules because of the unique properties derived from their curved-π scaffolds. Doping heteroatoms into buckybowl frameworks is a powerful method to change their structural and electronical properties. Herein, we report the synthesis of C70 fragment buckybowl, homosumanene, and heterahomosumanenes having a lactone moiety and a lactam moiety via three ring-expansion reactions using sumanenone as a common intermediate. X-ray diffraction analysis of the single crystals reveals their columnar packing structure with a shallow bowl-depth. The lactam moiety is readily derivatized to give azahomosumanene derivatives, nitrogen-doped analogues of homosumanene possessing a pyridine ring at the peripheral carbon. The synthetic application of the α-phenyl azahomosumanene as a cyclometalating ligand with platinum also revealed its utility for preparing a metal complex bearing a buckybowl ligand.
Collapse
Affiliation(s)
- Mikey Nishimoto
- Division of Applied Chemistry, Graduate School of Engineering, Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Yuta Uetake
- Division of Applied Chemistry, Graduate School of Engineering, Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan.,Innovative Catalysis Science Division, Institute for Open and Transdisciplinary Research Initiatives (ICS-OTRI), Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Yumi Yakiyama
- Division of Applied Chemistry, Graduate School of Engineering, Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan.,Innovative Catalysis Science Division, Institute for Open and Transdisciplinary Research Initiatives (ICS-OTRI), Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Fumitaka Ishiwari
- Division of Applied Chemistry, Graduate School of Engineering, Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan.,Innovative Catalysis Science Division, Institute for Open and Transdisciplinary Research Initiatives (ICS-OTRI), Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan.,Frontier Research Base for Global Young Researchers, Graduate School of Engineering, Osaka University, Suita, Osaka 565-0871, Japan
| | - Akinori Saeki
- Division of Applied Chemistry, Graduate School of Engineering, Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan.,Innovative Catalysis Science Division, Institute for Open and Transdisciplinary Research Initiatives (ICS-OTRI), Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Hidehiro Sakurai
- Division of Applied Chemistry, Graduate School of Engineering, Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan.,Innovative Catalysis Science Division, Institute for Open and Transdisciplinary Research Initiatives (ICS-OTRI), Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan
| |
Collapse
|
7
|
Iqbal D, Hassan A, Ansari AA, Muhammad N, Khan A, Khalid S, Sharif F. Sustainable silver nanoparticles as the vector for green therapeutics in oncology. APPLIED NANOSCIENCE 2022. [DOI: 10.1007/s13204-022-02344-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
8
|
Omar ÖH, Del Cueto M, Nematiaram T, Troisi A. High-throughput virtual screening for organic electronics: a comparative study of alternative strategies. JOURNAL OF MATERIALS CHEMISTRY. C 2021; 9:13557-13583. [PMID: 34745630 PMCID: PMC8515942 DOI: 10.1039/d1tc03256a] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Accepted: 09/13/2021] [Indexed: 06/01/2023]
Abstract
We present a review of the field of high-throughput virtual screening for organic electronics materials focusing on the sequence of methodological choices that determine each virtual screening protocol. These choices are present in all high-throughput virtual screenings and addressing them systematically will lead to optimised workflows and improve their applicability. We consider the range of properties that can be computed and illustrate how their accuracy can be determined depending on the quality and size of the experimental datasets. The approaches to generate candidates for virtual screening are also extremely varied and their relative strengths and weaknesses are discussed. The analysis of high-throughput virtual screening is almost never limited to the identification of top candidates and often new patterns and structure-property relations are the most interesting findings of such searches. The review reveals a very dynamic field constantly adapting to match an evolving landscape of applications, methodologies and datasets.
Collapse
Affiliation(s)
- Ömer H Omar
- Department of Chemistry, University of Liverpool Liverpool L69 3BX UK
| | - Marcos Del Cueto
- Department of Chemistry, University of Liverpool Liverpool L69 3BX UK
| | | | - Alessandro Troisi
- Department of Chemistry, University of Liverpool Liverpool L69 3BX UK
| |
Collapse
|
9
|
Jiang Y, Yang Z, Guo J, Li H, Liu Y, Guo Y, Li M, Pu X. Coupling complementary strategy to flexible graph neural network for quick discovery of coformer in diverse co-crystal materials. Nat Commun 2021; 12:5950. [PMID: 34642333 PMCID: PMC8511140 DOI: 10.1038/s41467-021-26226-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Accepted: 09/23/2021] [Indexed: 11/21/2022] Open
Abstract
Cocrystal engineering have been widely applied in pharmaceutical, chemistry and material fields. However, how to effectively choose coformer has been a challenging task on experiments. Here we develop a graph neural network (GNN) based deep learning framework to quickly predict formation of the cocrystal. In order to capture main driving force to crystallization from 6819 positive and 1052 negative samples reported by experiments, a feasible GNN framework is explored to integrate important prior knowledge into end-to-end learning on the molecular graph. The model is strongly validated against seven competitive models and three challenging independent test sets involving pharmaceutical cocrystals, π-π cocrystals and energetic cocrystals, exhibiting superior performance with accuracy higher than 96%, confirming its robustness and generalization. Furthermore, one new energetic cocrystal predicted is successfully synthesized, showcasing high potential of the model in practice. All the data and source codes are available at https://github.com/Saoge123/ccgnet for aiding cocrystal community.
Collapse
Affiliation(s)
- Yuanyuan Jiang
- College of Chemistry, Sichuan University, Chengdu, 610064, China
| | - Zongwei Yang
- Institute of Chemical Materials, China Academy of Engineering Physics, Mianyang, 621900, China
| | - Jiali Guo
- College of Chemistry, Sichuan University, Chengdu, 610064, China
| | - Hongzhen Li
- Institute of Chemical Materials, China Academy of Engineering Physics, Mianyang, 621900, China
| | - Yijing Liu
- College of Computer Science, Sichuan University, Chengdu, 610064, China
| | - Yanzhi Guo
- College of Chemistry, Sichuan University, Chengdu, 610064, China
| | - Menglong Li
- College of Chemistry, Sichuan University, Chengdu, 610064, China
| | - Xuemei Pu
- College of Chemistry, Sichuan University, Chengdu, 610064, China.
| |
Collapse
|
10
|
Zheng L, Zhu B, Wu Z, Liang F, Hong M, Liu G, Li W, Ren G, Tang Y. SMINBR: An Integrated Network and Chemoinformatics Tool Specialized for Prediction of Two-Component Crystal Formation. J Chem Inf Model 2021; 61:4290-4302. [PMID: 34436889 DOI: 10.1021/acs.jcim.1c00601] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Two-component crystals such as pharmaceutical cocrystals and salts have been proven as an effective strategy to improve physicochemical and biopharmaceutical properties of drugs. It is not easy to select proper molecular combinations to form two-component crystals. The network-based models have been successfully utilized to guide cocrystal design. Yet, the traditional social network-derived methods based on molecular-interaction topology information cannot directly predict interaction partners for new chemical entities (NCEs) that have not been observed to form two-component crystals. Herein, we proposed an effective tool, namely substructure-molecular-interaction network-based recommendation (SMINBR), to prioritize potential interaction partners for NCEs. This in silico tool incorporates network and chemoinformatics methods to bridge the gap between NCEs and known molecular-interaction network. The high performance of 10-fold cross validation and external validation shows the high accuracy and good generalization capability of the model. As a case study, top 10 recommended coformers for apatinib were all experimentally confirmed and a new apatinib cocrystal with paradioxybenzene was obtained. The predictive capability of the model attributes to its accordance with complementary patterns driving the formation of intermolecular interactions. SMINBR could automatically recommend new interaction partners for a target molecule, and would be an effective tool to guide cocrystal design. A free web server for SMINBR is available at http://lmmd.ecust.edu.cn/sminbr/.
Collapse
Affiliation(s)
- Lulu Zheng
- Laboratory of Molecular Modeling & Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Bin Zhu
- State Key Laboratory of Bioreactor Engineering; Engineering Research Centre of Pharmaceutical Process Chemistry, Ministry of Education; Laboratory of Pharmaceutical Crystal Engineering & Technology, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Zengrui Wu
- Laboratory of Molecular Modeling & Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Fang Liang
- State Key Laboratory of Bioreactor Engineering; Engineering Research Centre of Pharmaceutical Process Chemistry, Ministry of Education; Laboratory of Pharmaceutical Crystal Engineering & Technology, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Minghuang Hong
- State Key Laboratory of Bioreactor Engineering; Engineering Research Centre of Pharmaceutical Process Chemistry, Ministry of Education; Laboratory of Pharmaceutical Crystal Engineering & Technology, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Guixia Liu
- Laboratory of Molecular Modeling & Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Weihua Li
- Laboratory of Molecular Modeling & Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Guobin Ren
- State Key Laboratory of Bioreactor Engineering; Engineering Research Centre of Pharmaceutical Process Chemistry, Ministry of Education; Laboratory of Pharmaceutical Crystal Engineering & Technology, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Yun Tang
- Laboratory of Molecular Modeling & Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| |
Collapse
|