Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Mauri A. alvaDesc: A Tool to Calculate and Analyze Molecular Descriptors and Fingerprints. Methods in Pharmacology and Toxicology 2020. [DOI: 10.1007/978-1-0716-0150-1_32] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Number

Cited by Other Article(s)

Galvez-Llompart M, Hierrezuelo J, Blasco M, Zanni R, Galvez J, de Vicente A, Pérez-García A, Romero D. Targeting bacterial growth in biofilm conditions: rational design of novel inhibitors to mitigate clinical and food contamination using QSAR. J Enzyme Inhib Med Chem 2024;39:2330907. [PMID: 38651823 DOI: 10.1080/14756366.2024.2330907] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Accepted: 03/06/2024] [Indexed: 04/25/2024] Open

Pore S, Pelloux A, Chatterjee M, Banerjee A, Roy K. Machine learning-based q-RASAR predictions of the bioconcentration factor of organic molecules estimated following the organisation for economic co-operation and development guideline 305. JOURNAL OF HAZARDOUS MATERIALS 2024;479:135725. [PMID: 39243539 DOI: 10.1016/j.jhazmat.2024.135725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 08/31/2024] [Accepted: 08/31/2024] [Indexed: 09/09/2024]

Liu Y, Yoshizawa AC, Ling Y, Okuda S. Insights into predicting small molecule retention times in liquid chromatography using deep learning. J Cheminform 2024;16:113. [PMID: 39375739 PMCID: PMC11460055 DOI: 10.1186/s13321-024-00905-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 09/13/2024] [Indexed: 10/09/2024] Open

Abstract

In untargeted metabolomics, structures of small molecules are annotated using liquid chromatography-mass spectrometry by leveraging information from the molecular retention time (RT) in the chromatogram and m/z (formerly called ''mass-to-charge ratio'') in the mass spectrum. However, correct identification of metabolites is challenging due to the vast array of small molecules. Therefore, various in silico tools for mass spectrometry peak alignment and compound prediction have been developed; however, the list of candidate compounds remains extensive. Accurate RT prediction is important to exclude false candidates and facilitate metabolite annotation. Recent advancements in artificial intelligence (AI) have led to significant breakthroughs in the use of deep learning models in various fields. Release of a large RT dataset has mitigated the bottlenecks limiting the application of deep learning models, thereby improving their application in RT prediction tasks. This review lists the databases that can be used to expand training datasets and concerns the issue about molecular representation inconsistencies in datasets. It also discusses the application of AI technology for RT prediction, particularly in the 5 years following the release of the METLIN small molecule RT dataset. This review provides a comprehensive overview of the AI applications used for RT prediction, highlighting the progress and remaining challenges. SCIENTIFIC CONTRIBUTION: This article focuses on the advancements in small molecule retention time prediction in computational metabolomics over the past five years, with a particular emphasis on the application of AI technologies in this field. It reviews the publicly available datasets for small molecule retention time, the molecular representation methods, the AI algorithms applied in recent studies. Furthermore, it discusses the effectiveness of these models in assisting with the annotation of small molecule structures and the challenges that must be addressed to achieve practical applications.

Collapse

Zheng JJ, Li QZ, Wang Z, Wang X, Zhao Y, Gao X. Computer-aided nanodrug discovery: recent progress and future prospects. Chem Soc Rev 2024;53:9059-9132. [PMID: 39148378 DOI: 10.1039/d3cs00575e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]

Kumar A, Ojha PK, Roy K. First report on regression-based QSAR addressing pesticide dissipation half-life in plants: A step towards sustainable public health. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024;954:176175. [PMID: 39270868 DOI: 10.1016/j.scitotenv.2024.176175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2024] [Revised: 08/03/2024] [Accepted: 09/08/2024] [Indexed: 09/15/2024]

Abstract

The excessive use of pesticides (an important group of chemicals) in the agricultural as well as public sectors raises a health concern. Pesticides affect humans and other living organisms via the food chain. Therefore, it is very necessary to calculate the dissipation half-life of pesticides in plants. Experimental prediction of pesticide dissipation half-lives requires complex environmental conditions, high cost, and a long time. Thus, in-silico half-life predictions are suitable and the best alternative. Herein, a total of six PLS (partial least squares) models namely, M1 (overall), M2 (fruit), M3 (plant interior), M4 (leaf), M5 (plant surface), and M6 (whole plant) alongside two MLR (multiple linear regression) models i.e. M7 (fruit surface) and model M8 (straw) were generated using dissipation half-lives (log10(T1/2)) of pesticides in plants and their different parts. Models were constructed in strict accordance with the guidelines outlined by the Organization for Economic Co-operation and Development (OECD) and extensively validated using globally accepted validation metrics (determination coefficient (R2) = 0.610-0.795, leave-one-out (LOO) cross-validated correlation coefficient (Q2LOO) = 0.520-0.660, MAE-FITTED TRAIN (mean absolute error fitted train) = 0.119-0.148, MAE-LOOTRAIN = 0.132-0.177, predictive R2 or Q2F1 = 0.538-0.567, Q2F2 = 0.500-0.565, MAETEST = 0.122-0.232), confirming their accuracy, reliability, predictivity, and robustness. Lipophilicity, the presence of a cyclomatic ring, suphur, aromatic amine fragments, and chlorine atom fragments are responsible (+ve contribution) for high dissipation half-lives of pesticides in plants. In contrast, hydrophilicity, pyrazine fragments, and rotatable bonds reduce (-ve negative contribution) the dissipation half-lives of pesticides in plants. To address the real-world applicability, the models were employed to screen the PPDB (Pesticide Properties Database) database, which revealed the top 10 pesticides with the highest log(T1/2) in the whole plant and respective parts of the plant body. The present work will aid in developing safer and novel pesticides, regulatory risk assessment, various risk assessments for the sustenance of public health, screening of databases, and data-gap filling.

Collapse

Xi R, Liu H, Liu X, Zhao X. Predicting and screening high-performance polyimide membranes using negative correlation based deep ensemble methods. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2024;16:5845-5863. [PMID: 39145470 DOI: 10.1039/d4ay01160k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/16/2024]

Abstract

Polyimide polymer membranes have become critical materials in gas separation and storage applications due to their high selectivity and excellent permeability. However, with over 107 known types of polyimides, relying solely on experimental research means potential high-performance candidates are likely to be overlooked. This study employs a deep learning method optimized by negative correlation ensemble techniques to predict the gas permeability and selectivity of polyimide structures, enabling rapid and efficient material screening. We propose a deep neural network model based on negative correlation deep ensemble methods (DNN-NCL), using Morgan molecular fingerprints as input. The DNN-NCL model achieves an R2 value of approximately 0.95 on the test set, which is a 4% improvement over recent model performance, and effectively mitigates overfitting with a maximum discrepancy of less than 0.03 between the training and test sets. High-throughput screening of over 8 million hypothetical polymers identified hundreds of promising candidates for gas separation membranes, with 14 structures exceeding the Robeson upper bound for CO2/N2 separation. Visualization of high-throughput predictions shows that although the Robeson upper bound was never explicitly used as a model constraint, the majority of predictions are compressed below this limit, demonstrating the deep learning model's ability to reflect real-world physical conditions. Reverse analysis of model predictions using SHAP analysis achieved interpretability of the deep learning model's predictions and identified three key functional groups deemed important by the deep neural network for gas permeability: carbonyl, thiophene, and ester groups. This established a bridge between the structure and properties of polyimide materials. Additionally, we confirmed that two polyimide structures predicted by the model to have excellent CO2/N2 selectivity, namely 6-methylpyrimidin-5-amine and 1,4,5,6-tetrahydropyrimidin-2-amine, have been experimentally validated in previous studies. This research demonstrates the feasibility of using deep learning methods to explore the vast chemical space of polyimides, providing a powerful tool for discovering high-performance gas separation membranes.

Collapse

Beck AG, Fine J, Aggarwal P, Regalado EL, Levorse D, De Jesus Silva J, Sherer EC. Machine learning models and performance dependency on 2D chemical descriptor space for retention time prediction of pharmaceuticals. J Chromatogr A 2024;1730:465109. [PMID: 38968662 DOI: 10.1016/j.chroma.2024.465109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Revised: 06/17/2024] [Accepted: 06/18/2024] [Indexed: 07/07/2024]

Abstract

The predictive modeling of liquid chromatography methods can be an invaluable asset, potentially saving countless hours of labor while also reducing solvent consumption and waste. Tasks such as physicochemical screening and preliminary method screening systems where large amounts of chromatography data are collected from fast and routine operations are particularly well suited for both leveraging large datasets and benefiting from predictive models. Therefore, the generation of predictive models for retention time is an active area of development. However, for these predictive models to gain acceptance, researchers first must have confidence in model performance and the computational cost of building them should be minimal. In this study, a simple and cost-effective workflow for the development of machine learning models to predict retention time using only Molecular Operating Environment 2D descriptors as input for support vector regression is developed. Furthermore, we investigated the relative performance of models based on molecular descriptor space by utilizing uniform manifold approximation and projection and clustering with Gaussian mixture models to identify chemically distinct clusters. Results outlined herein demonstrate that local models trained on clusters in chemical space perform equivalently when compared to models trained on all data. Through 10-fold cross-validation on a comprehensive set containing 67,950 of our company's proprietary analytes, these models achieved coefficients of determination of 0.84 and 3 % error in terms of retention time. This promising statistical significance is found to translate from cross-validation to prospective prediction on an external test set of pharmaceutically relevant analytes. The observed equivalency of global and local modeling of large datasets is retained with METLIN's SMRT dataset, thereby confirming the wider applicability of the developed machine learning workflows for global models.

Collapse

Khan AU, Porta GM, Riva M, Guadagnini A. In-silico mechanistic analysis of adsorption of Iodinated Contrast Media agents on graphene surface. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2024;280:116506. [PMID: 38875817 DOI: 10.1016/j.ecoenv.2024.116506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Revised: 05/08/2024] [Accepted: 05/22/2024] [Indexed: 06/16/2024]

Abstract

The study aims at assessing the potential of graphene-based adsorbents to reduce environmental impacts of Iodinated Contrast Media Agents (ICMs). We analyze an extensive collection of ICMs. A modeling approach resting on molecular docking and Density Functional Theory simulations is employed to examine the adsorption process at the molecular level. The study also relies on a Quantitative Structure-Activity Relationship (QSAR) modeling framework to correlate molecular properties with the adsorption energy (Ead) of ICMs, thus enabling identification of the key mechanisms underpinning adsorption and of the key factors contributing to it. A collection of distinct QSAR-based models is developed upon relying on Multiple Linear Regression and a standard genetic algorithm method. Having at our disposal multiple models enables us to take into account the uncertainty associated with model formulation. Maximum Likelihood and formal model identification/discrimination criteria (such as Bayesian and/or information theoretic criteria) are then employed to complement the traditional QSAR modeling phase. This has the advantage of (a) providing a rigorous ranking of the alternative models included in the selected set and (b) quantifying the relative degree of likelihood of each of these models through a weight or posterior probability. The resulting workflow of analysis enables one to seamlessly embed DFT and QSAR studies within a theoretical framework of analysis that explicitly takes into account model and parameter uncertainty. Our results suggest that graphene-based surfaces constitute a promising adsorbent for ICMs removal, π-π stacking being the primary mechanism behind ICM adsorption. Furthermore, our findings offer valuable insights into the potential of graphene-based adsorbent materials for effectively removing ICMs from water systems. They contribute to ascertain the significance of various factors (such as, e.g., the distribution of atomic van der Waals volumes, overall molecular complexity, the presence and arrangement of Iodine atoms, and the presence of polar functional groups) on the adsorption process.

Collapse

de Sousa NF, Duarte GD, Moraes CB, Barbosa CG, Martin HJ, Muratov NN, do Nascimento YM, Scotti L, de Freitas-Júnior LHG, Filho JMB, Scotti MT. In Silico and In Vitro Studies of Terpenes from the Fabaceae Family Using the Phenotypic Screening Model against the SARS-CoV-2 Virus. Pharmaceutics 2024;16:912. [PMID: 39065609 PMCID: PMC11279753 DOI: 10.3390/pharmaceutics16070912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2024] [Revised: 07/02/2024] [Accepted: 07/02/2024] [Indexed: 07/28/2024] Open

Affiliation(s)

Natália Ferreira de Sousa Postgraduate Program in Natural and Synthetic Bioactive Products, Federal University of Paraíba, João Pessoa 58051-900, Brazil; (N.F.d.S.); (Y.M.d.N.); (L.S.); (J.M.B.F.)
Gabrielly Diniz Duarte Postgraduate Program in Development and Innovation of Drugs and Medicines, Federal University of Paraíba, João Pessoa 58051-900, Brazil;
Carolina Borsoi Moraes Institute of Biomedical Sciences, University of São Paulo (ICB-USP), São Paulo 05508-000, Brazil; (C.B.M.); (C.G.B.); (L.H.G.d.F.-J.)
Cecília Gomes Barbosa Institute of Biomedical Sciences, University of São Paulo (ICB-USP), São Paulo 05508-000, Brazil; (C.B.M.); (C.G.B.); (L.H.G.d.F.-J.)
Holli-Joi Martin Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA;
Nail N. Muratov Department of Chemical Technology, Odessa National Polytechnic University, 65000 Odessa, Ukraine; A. V. Bogatsky Physical-Chemical Institute of NASU, 65047 Odessa, Ukraine
Yuri Mangueira do Nascimento Postgraduate Program in Natural and Synthetic Bioactive Products, Federal University of Paraíba, João Pessoa 58051-900, Brazil; (N.F.d.S.); (Y.M.d.N.); (L.S.); (J.M.B.F.)
Luciana Scotti Postgraduate Program in Natural and Synthetic Bioactive Products, Federal University of Paraíba, João Pessoa 58051-900, Brazil; (N.F.d.S.); (Y.M.d.N.); (L.S.); (J.M.B.F.)
Lúcio Holanda Gondim de Freitas-Júnior Institute of Biomedical Sciences, University of São Paulo (ICB-USP), São Paulo 05508-000, Brazil; (C.B.M.); (C.G.B.); (L.H.G.d.F.-J.)
José Maria Barbosa Filho Postgraduate Program in Natural and Synthetic Bioactive Products, Federal University of Paraíba, João Pessoa 58051-900, Brazil; (N.F.d.S.); (Y.M.d.N.); (L.S.); (J.M.B.F.)
Marcus Tullius Scotti Postgraduate Program in Natural and Synthetic Bioactive Products, Federal University of Paraíba, João Pessoa 58051-900, Brazil; (N.F.d.S.); (Y.M.d.N.); (L.S.); (J.M.B.F.)

Collapse

Nath A, Ojha PK, Roy K. Modelling lethality and teratogenicity of zebrafish (Danio rerio) due to β-lactam antibiotics employing the QSTR approach. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2024;35:565-589. [PMID: 39069787 DOI: 10.1080/1062936x.2024.2378797] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2024] [Accepted: 07/07/2024] [Indexed: 07/30/2024]

Gao H, Li S, Lan Z, Pan D, Naidu GS, Peer D, Ye C, Chen H, Ma M, Liu Z, Santos HA. Comparative optimization of polysaccharide-based nanoformulations for cardiac RNAi therapy. Nat Commun 2024;15:5398. [PMID: 38926348 PMCID: PMC11208445 DOI: 10.1038/s41467-024-49804-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Accepted: 06/19/2024] [Indexed: 06/28/2024] Open

Affiliation(s)

Han Gao Department of Biomaterials and Biomedical Technology, University Medical Center Groningen (UMCG), The Personalized Medicine Research Institute (PRECISION), University of Groningen, Ant. Deusinglaan 1, Groningen, 9713 AV, The Netherlands Drug Research Program, Division of Pharmaceutical Chemistry and Technology, Faculty of Pharmacy, University of Helsinki, Helsinki, FI-00014, Finland
Sen Li Department of Vascular Surgery, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, 310009, China
Zhengyi Lan Shanghai Institute of Ceramics, Chinese Academy of Sciences, Shanghai, 200050, China
Da Pan Key Laboratory of Environmental Medicine and Engineering of Ministry of Education, and Department of Nutrition and Food Hygiene, School of Public Health, Southeast University, Nanjing, 210009, China
Gonna Somu Naidu Laboratory of Precision Nanomedicine, Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, 69978, Israel Department of Materials Sciences and Engineering, Iby and Aladar Fleischman Faculty of Engineering, Tel Aviv University, Tel Aviv, 69978, Israel Center for Nanoscience and Nanotechnology, Tel Aviv University, Tel Aviv, 69978, Israel Cancer Biology Research Center, Tel Aviv University, Tel Aviv, 69978, Israel
Dan Peer Laboratory of Precision Nanomedicine, Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, 69978, Israel Department of Materials Sciences and Engineering, Iby and Aladar Fleischman Faculty of Engineering, Tel Aviv University, Tel Aviv, 69978, Israel Center for Nanoscience and Nanotechnology, Tel Aviv University, Tel Aviv, 69978, Israel Cancer Biology Research Center, Tel Aviv University, Tel Aviv, 69978, Israel
Chenyi Ye Department of Orthopedic Surgery, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, 310009, China
Hangrong Chen Shanghai Institute of Ceramics, Chinese Academy of Sciences, Shanghai, 200050, China
Ming Ma Shanghai Institute of Ceramics, Chinese Academy of Sciences, Shanghai, 200050, China.
Zehua Liu Department of Biomaterials and Biomedical Technology, University Medical Center Groningen (UMCG), The Personalized Medicine Research Institute (PRECISION), University of Groningen, Ant. Deusinglaan 1, Groningen, 9713 AV, The Netherlands. Drug Research Program, Division of Pharmaceutical Chemistry and Technology, Faculty of Pharmacy, University of Helsinki, Helsinki, FI-00014, Finland.
Hélder A Santos Department of Biomaterials and Biomedical Technology, University Medical Center Groningen (UMCG), The Personalized Medicine Research Institute (PRECISION), University of Groningen, Ant. Deusinglaan 1, Groningen, 9713 AV, The Netherlands. Drug Research Program, Division of Pharmaceutical Chemistry and Technology, Faculty of Pharmacy, University of Helsinki, Helsinki, FI-00014, Finland.

Collapse

Gutkin E, Gusev F, Gentile F, Ban F, Koby SB, Narangoda C, Isayev O, Cherkasov A, Kurnikova MG. In silico screening of LRRK2 WDR domain inhibitors using deep docking and free energy simulations. Chem Sci 2024;15:8800-8812. [PMID: 38873063 PMCID: PMC11168082 DOI: 10.1039/d3sc06880c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Accepted: 04/10/2024] [Indexed: 06/15/2024] Open

Das S, Samal A, Ojha PK. Chemometrics-driven prediction and prioritization of diverse pesticides on chickens for addressing hazardous effects on public health. JOURNAL OF HAZARDOUS MATERIALS 2024;471:134326. [PMID: 38636230 DOI: 10.1016/j.jhazmat.2024.134326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 04/09/2024] [Accepted: 04/15/2024] [Indexed: 04/20/2024]

de Cripan SM, Arora T, Olomí A, Canela N, Siuzdak G, Domingo-Almenara X. Predicting the Predicted: A Comparison of Machine Learning-Based Collision Cross-Section Prediction Models for Small Molecules. Anal Chem 2024;96:9088-9096. [PMID: 38783786 PMCID: PMC11154685 DOI: 10.1021/acs.analchem.4c00630] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 05/09/2024] [Accepted: 05/10/2024] [Indexed: 05/25/2024]

Affiliation(s)

Sara M. de Cripan Computational Metabolomics for Systems Biology Lab, Eurecat—Technology Centre of Catalonia, Barcelona 08005, Catalonia, Spain Centre for Omics Sciences (COS), Unique Scientific and Technical Infrastructures (ICTS), Eurecat—Technology Centre of Catalonia & Rovira i Virgili University Joint Unit, Reus 43204, Catalonia, Spain Department of Electrical, Electronic and Control Engineering (DEEEA), Universitat Rovira i Virgili, Tarragona 43007, Catalonia, Spain
Trisha Arora Computational Metabolomics for Systems Biology Lab, Eurecat—Technology Centre of Catalonia, Barcelona 08005, Catalonia, Spain Centre for Omics Sciences (COS), Unique Scientific and Technical Infrastructures (ICTS), Eurecat—Technology Centre of Catalonia & Rovira i Virgili University Joint Unit, Reus 43204, Catalonia, Spain Department of Electrical, Electronic and Control Engineering (DEEEA), Universitat Rovira i Virgili, Tarragona 43007, Catalonia, Spain
Adrià Olomí Computational Metabolomics for Systems Biology Lab, Eurecat—Technology Centre of Catalonia, Barcelona 08005, Catalonia, Spain Centre for Omics Sciences (COS), Unique Scientific and Technical Infrastructures (ICTS), Eurecat—Technology Centre of Catalonia & Rovira i Virgili University Joint Unit, Reus 43204, Catalonia, Spain
Núria Canela Centre for Omics Sciences (COS), Unique Scientific and Technical Infrastructures (ICTS), Eurecat—Technology Centre of Catalonia & Rovira i Virgili University Joint Unit, Reus 43204, Catalonia, Spain
Gary Siuzdak Scripps Center of Metabolomics and Mass Spectrometry, Department of Chemistry, Molecular and Computational Biology, Scripps Research Institute, La Jolla, California 92037, United States
Xavier Domingo-Almenara Computational Metabolomics for Systems Biology Lab, Eurecat—Technology Centre of Catalonia, Barcelona 08005, Catalonia, Spain Centre for Omics Sciences (COS), Unique Scientific and Technical Infrastructures (ICTS), Eurecat—Technology Centre of Catalonia & Rovira i Virgili University Joint Unit, Reus 43204, Catalonia, Spain Department of Electrical, Electronic and Control Engineering (DEEEA), Universitat Rovira i Virgili, Tarragona 43007, Catalonia, Spain

Collapse

Ghosh S, Roy K. Quantitative read-across structure-activity relationship (q-RASAR): A novel approach to estimate the subchronic oral safety (NOAEL) of diverse organic chemicals in rats. Toxicology 2024;505:153824. [PMID: 38705560 DOI: 10.1016/j.tox.2024.153824] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Revised: 04/28/2024] [Accepted: 04/29/2024] [Indexed: 05/07/2024]

Bhattacharjee A, Kar S, Ojha PK. Unveiling G-protein coupled receptor kinase-5 inhibitors for chronic degenerative diseases: Multilayered prioritization employing explainable machine learning-driven multi-class QSAR, ligand-based pharmacophore and free energy-inspired molecular simulation. Int J Biol Macromol 2024;269:131784. [PMID: 38697440 DOI: 10.1016/j.ijbiomac.2024.131784] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 04/02/2024] [Accepted: 04/21/2024] [Indexed: 05/05/2024]

Cheng Z, Aitha M, Thomas CA, Sturgill A, Fairweather M, Hu A, Bethel CR, Rivera DD, Dranchak P, Thomas PW, Li H, Feng Q, Tao K, Song M, Sun N, Wang S, Silwal SB, Page RC, Fast W, Bonomo RA, Weese M, Martinez W, Inglese J, Crowder MW. Machine Learning Models Identify Inhibitors of New Delhi Metallo-β-lactamase. J Chem Inf Model 2024;64:3977-3991. [PMID: 38727192 PMCID: PMC11129921 DOI: 10.1021/acs.jcim.3c02015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2024]

Affiliation(s)

Zishuo Cheng Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA
Mahesh Aitha Division of Preclinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD 20850, USA
Caitlyn A. Thomas Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA
Aidan Sturgill Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA
Mitch Fairweather Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA
Amy Hu Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA
Christopher R. Bethel Research Service, Louis Stokes Cleveland Department of Veterans Affairs Medical Center, Cleveland, OH 44106, USA
Dann D. Rivera Division of Chemical Biology and Medicinal Chemistry, College of Pharmacy, University of Texas, Austin, TX 78712, USA
Patricia Dranchak Division of Preclinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD 20850, USA
Pei W. Thomas Division of Chemical Biology and Medicinal Chemistry, College of Pharmacy, University of Texas, Austin, TX 78712, USA
Han Li Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA
Qi Feng Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA
Kaicheng Tao Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA
Minshuai Song Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA
Na Sun Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA
Shuo Wang Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA
Surendra Bikram Silwal Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA
Richard C. Page Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA
Walt Fast Division of Chemical Biology and Medicinal Chemistry, College of Pharmacy, University of Texas, Austin, TX 78712, USA
Robert A. Bonomo Research Service, Louis Stokes Cleveland Department of Veterans Affairs Medical Center, Cleveland, OH 44106, USA Departments of Medicine, Biochemistry, Molecular Biology and Microbiology, Pharmacology, and Proteomics and Bioinformatics, Case Western Reserve University School of Medicine, Cleveland, OH 44106, USA Clinician Scientist Investigator, Louis Stokes Cleveland Department of Veterans Affairs Medical Center, Cleveland, OH 44106, USA CWRU-Cleveland VAMC Center for Antimicrobial Resistance and Epidemiology (Case VA CARES) Cleveland, OH 44106, USA
Maria Weese Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA
Waldyn Martinez Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA
James Inglese Division of Preclinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD 20850, USA Metabolic Medicine Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20817, USA
Michael W. Crowder Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA

Collapse

Kumar A, Ojha PK, Roy K. The first report on the assessment of maximum acceptable daily intake (MADI) of pesticides for humans using intelligent consensus predictions. ENVIRONMENTAL SCIENCE. PROCESSES & IMPACTS 2024;26:870-881. [PMID: 38652036 DOI: 10.1039/d4em00059e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/25/2024]

Vigna V, Cova TFGG, Nunes SCC, Pais AACC, Sicilia E. Machine Learning-Based Prediction of Reduction Potentials for Pt^IV Complexes. J Chem Inf Model 2024;64:3733-3743. [PMID: 38683970 DOI: 10.1021/acs.jcim.4c00315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2024]

Abstract

Some of the well-known drawbacks of clinically approved PtII complexes can be overcome using six-coordinate PtIV complexes as inert prodrugs, which release the corresponding four-coordinate active PtII species upon reduction by cellular reducing agents. Therefore, the key factor of PtIV prodrug mechanism of action is their tendency to be reduced which, when the involved mechanism is of outer-sphere type, is measured by the value of the reduction potential. Machine learning (ML) models can be used to effectively capture intricate relationships within PtIV complex data, leading to highly accurate predictions of reduction potentials and other properties, and offering significant insights into their electrochemical behavior and potential applications. In this study, a machine learning-based approach for predicting the reduction potentials of PtIV complexes based on relevant molecular descriptors is presented. Leveraging a data set of experimentally determined reduction potentials and a diverse range of molecular descriptors, the proposed model demonstrates remarkable predictive accuracy (MSE = 0.016 V2, RMSE = 0.13 V, R2 = 0.92). Ab initio calculations and a set of different machine learning algorithms and feature engineering techniques have been employed to systematically explore the relationship between molecular structure and similarity and reduction potential. Specifically, it has been investigated whether the reduction potential of these compounds can be described by combining ML models across different combinations of constitutional, topological, and electronic molecular descriptors. Our results not only provide insights into the crucial factors influencing reduction potentials but also offer a rapid and effective tool for the rational design of PtIV complexes with tailored electrochemical properties for pharmaceutical applications. This approach has the potential to significantly expedite the development and screening of novel PtIV prodrug candidates. The analysis of principal components and key features extracted from the model highlights the significance of structural descriptors of the 2D Atom Pairs type and the lowest unoccupied molecular orbital energy. Specifically, with just 20 appropriately selected descriptors, a notable separation of complexes based on their reduction potential value is achieved.

Collapse

Ghosh S, Chhabria MT, Roy K. Chemometric modeling of pharmaceuticals for partitioning between sludge and aqueous phase during the wastewater treatment process. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024:10.1007/s11356-024-33261-6. [PMID: 38607482 DOI: 10.1007/s11356-024-33261-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Accepted: 04/05/2024] [Indexed: 04/13/2024]

Abstract

Computational techniques, such as quantitative structure-property relationships (QSPRs), can play a significant role in exploring the important chemical features essential for the degree of sorption or sludge/water partition coefficient (Kd) towards sewage sludge of wastewater treatment process to evaluate the environmental consequence and risk of pharmaceuticals. The current research work aims to construct a predictive QSPR model for the sorption of 148 diverse active pharmaceutical ingredients (APIs) in sewage sludge during wastewater treatment. For the development of the model, we employed easily computable 2D descriptors as independent variables. The model has been developed following the Organization for Economic Cooperation and Development's (OECD) guidelines. It has undergone internal and external validation using a variety of methodologies, as well as been tested for its applicability domain. A measure of hydrophobicity, i.e., MLOGP2, showed the most promising contribution in modeling the sorption coefficient of APIs. Among other parameters, the number of tertiary aromatic amines, the presence of electronegative atoms like N, O, and Cl, the size of a molecule, the number of aromatic hydroxyl groups, the presence of substituted aromatic nitrogen atoms and alkyl-substituted tertiary carbon atoms were also found to be influential for the regulation of solid water partition coefficient of APIs during the wastewater treatment process. The statistical validity tests performed on the developed partial least squares (PLS) model showed that it is statistically evident, robust, and predictive (R2Train = 0.750, Q2LOO = 0.683, Q2F1 = 0.655, Q2F2 (or R2Test) = 0.651). In addition, the predictivity of the constructed model was further inspected by using the "prediction reliability indicator" tool for 14 external APIs.

Collapse

Mauri A, Bertola M. AlvaBuilder: A Software for De Novo Molecular Design. J Chem Inf Model 2024;64:2136-2142. [PMID: 37399048 PMCID: PMC11005826 DOI: 10.1021/acs.jcim.3c00610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Indexed: 07/04/2023]

Obradović D, Stavrianidi A, Fedorova E, Bogojević A, Shpigun O, Buryak A, Lazović S. A comparative study of the predictive performance of different descriptor calculation tools: Molecular-based elution order modeling and interpretation of retention mechanism for isomeric compounds from METLIN database. J Chromatogr A 2024;1719:464731. [PMID: 38377661 DOI: 10.1016/j.chroma.2024.464731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 02/08/2024] [Accepted: 02/09/2024] [Indexed: 02/22/2024]

Abstract

In the pharmaceutical industry, the need for analytical standards is a bottleneck for comprehensive evaluation and quality control of intermediate and end products. These are complex mixtures containing structurally related molecules. In this regard, chromatographic peak annotation, especially for critical pairs of isomers and closest structural analogs, can be supported by using a Quantitative Structure Retention Relationship (QSRR) approach. In our study, we investigated the fundamental basis of the reversed-phase (RP) retention mechanism for 1141 isomeric compounds from the METLIN SMRT dataset. Nine different descriptor calculation tools combined with different feature selection methods (genetic algorithm (GA), stepwise, Boruta) and machine learning (ML) approaches (support vector machine (SVM), multiple linear regression (MLR), random forest (RF), XGBoost) were applied to provide a reliable molecular structure-based interpretation of RP retention behaviour of the isomeric compounds. Strict internal and external validation metrics were used to select models with the best predictive capabilities (rtest > 0.73, order of elution > 60 %). For the developed models, mean absolute errors were in the range of 60 to 110 s. Stepwise and GA showed the most suitable performance as descriptor selection methods, while SVM and XGBoost modeling gave satisfactory predictive characteristics in most cases. Validation performed on the published experimental data for structurally related pharmaceutical compounds confirmed the best accuracy of MLR modeling in combination with GA feature selection of general physico-chemical properties. The resulting models will be useful for the prediction of separation and identification of structurally related compounds in pharmaceutical analysis, providing a simultaneous understanding of the interaction mechanisms leading to their retention under RP conditions.

Collapse

Zdybel S, Sosnowska A, Kowalska D, Sommer J, Conrady B, Mester P, Gromelski M, Puzyn T. Hybrid Machine Learning and Experimental Studies of Antiviral Potential of Ionic Liquids against P100, MS2, and Phi6. J Chem Inf Model 2024;64:1996-2007. [PMID: 38452014 DOI: 10.1021/acs.jcim.3c02037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/09/2024]

Abstract

Viruses are a group of widespread organisms that are often responsible for very dangerous diseases, as most of them follow a mechanism to multiply and infect their hosts as quickly as possible. Pathogen viruses also mutate regularly, with the result that measures to prevent virus transmission and recover from the disease caused are often limited. The development of new substances is very time-consuming and highly budgeted and requires the sacrifice of many living organisms. Computational chemistry methods allow faster analysis at a much lower cost and, most importantly, reduce the number of living organisms sacrificed experimentally to a minimum. Ionic liquids (ILs) are a group of chemical compounds that could potentially find a wide range of applications due to their potential virucidal activity. In our study, we conducted a complex computational analysis to predict the antiviral activity of ionic liquids against three surrogate viruses: two nonenveloped viruses, Listeria monocytogenes phage P100 and Escherichia coli phage MS2, and one enveloped virus, Pseudomonas syringae phage Phi6. Based on experimental data of toxic activity (logEC90), we assigned activity classes to 154 ILs. Prediction models were created and validated according to the Organization for Economic Co-operation and Development (OECD) recommendations using the Classification Tree method. Further, we performed an external validation of our models through virtual screening on a set of 1277 theoretically generated ionic liquids and then selected 10 active ionic liquids, which were synthesized to verify their activity against the analyzed viruses. Our study proved the effectiveness and efficiency of computational methods to predict the antiviral activity of ionic liquids. Thus, computational models are a cost-effective alternative approach compared with time-consuming experimental studies where live animals are involved.

Collapse

Li W, Wen Y, Wang K, Ding Z, Wang L, Chen Q, Xie L, Xu H, Zhao H. Developing a machine learning model for accurate nucleoside hydrogels prediction based on descriptors. Nat Commun 2024;15:2603. [PMID: 38521777 PMCID: PMC10960799 DOI: 10.1038/s41467-024-46866-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2023] [Accepted: 03/13/2024] [Indexed: 03/25/2024] Open

Affiliation(s)

Weiqi Li State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Research Unit of Oral Carcinogenesis and Management, Chinese Academy of Medical Sciences, West China Hospital of Stomatology, Sichuan University, Chengdu, Sichuan, 610041, PR China
Yinghui Wen State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Research Unit of Oral Carcinogenesis and Management, Chinese Academy of Medical Sciences, West China Hospital of Stomatology, Sichuan University, Chengdu, Sichuan, 610041, PR China
Kaichao Wang State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Research Unit of Oral Carcinogenesis and Management, Chinese Academy of Medical Sciences, West China Hospital of Stomatology, Sichuan University, Chengdu, Sichuan, 610041, PR China
Zihan Ding State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Research Unit of Oral Carcinogenesis and Management, Chinese Academy of Medical Sciences, West China Hospital of Stomatology, Sichuan University, Chengdu, Sichuan, 610041, PR China
Lingfeng Wang State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Research Unit of Oral Carcinogenesis and Management, Chinese Academy of Medical Sciences, West China Hospital of Stomatology, Sichuan University, Chengdu, Sichuan, 610041, PR China
Qianming Chen State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Research Unit of Oral Carcinogenesis and Management, Chinese Academy of Medical Sciences, West China Hospital of Stomatology, Sichuan University, Chengdu, Sichuan, 610041, PR China
Liang Xie State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Research Unit of Oral Carcinogenesis and Management, Chinese Academy of Medical Sciences, West China Hospital of Stomatology, Sichuan University, Chengdu, Sichuan, 610041, PR China.
Hao Xu State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Research Unit of Oral Carcinogenesis and Management, Chinese Academy of Medical Sciences, West China Hospital of Stomatology, Sichuan University, Chengdu, Sichuan, 610041, PR China.
Hang Zhao State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Research Unit of Oral Carcinogenesis and Management, Chinese Academy of Medical Sciences, West China Hospital of Stomatology, Sichuan University, Chengdu, Sichuan, 610041, PR China.

Collapse

Erickson M, Casañola-Martin G, Han Y, Rasulev B, Kilin D. Relationships between the Photodegradation Reaction Rate and Structural Properties of Polymer Systems. J Phys Chem B 2024;128:2190-2200. [PMID: 38386478 DOI: 10.1021/acs.jpcb.3c06854] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2024]

Kumar A, Ojha PK, Roy K. First report on pesticide sub-chronic and chronic toxicities against dogs using QSAR and chemical read-across. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2024;35:241-263. [PMID: 38390626 DOI: 10.1080/1062936x.2024.2320143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Accepted: 02/12/2024] [Indexed: 02/24/2024]

Chatterjee M, Roy K. Predictive binary mixture toxicity modeling of fluoroquinolones (FQs) and the projection of toxicity of hypothetical binary FQ mixtures: a combination of 2D-QSAR and machine-learning approaches. ENVIRONMENTAL SCIENCE. PROCESSES & IMPACTS 2024;26:105-118. [PMID: 38073518 DOI: 10.1039/d3em00445g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2024]

Abstract

All sorts of chemicals get degraded under various environmental stresses, and the degradates coexist with the parent compounds as mixtures in the environment. Antibiotics emerge as an additional concern due to the bioactive nature of both the parent compound and degradation products and their combined exposure to the environment. Therefore, environmental risk assessment of antibiotics and their degradation products is very much necessary. In this direction, we made use of in silico new approach methodologies (NAMs) and machine-learning algorithms. In this study, we have developed a robust and predictive mixture-quantitative structure-activity relationship (QSAR) model with promising quality and predictability (internal: MAETrain = 0.085, QLOO2 = 0.849, external: MAETest = 0.090, and QF12 = 0.859) for predicting the toxicity of the mixtures of a class of antibiotics and their degradation products. To obtain the predictive model, toxicity data of 78 binary fluoroquinolone mixtures in E. coli (endpoint: log 1/IC50 in molar) have been utilized. We have used only 0D-2D descriptors to efficiently encode the structural features of mixture components without any additional complexities. The optimization of the class of mixture descriptors has been performed in this study by using three different mixing rules (linear combination of molecular contributions, the squared molecular contributions, and the norm of molecular contributions). Different machine-learning approaches namely, random forest (RF), ada boost, gradient boost (GB), extreme gradient boost (XGB), support vector machine (SVM), linear support vector machine (LSVM), and ridge regression (RR) have been employed here apart from the conventional partial least squares (PLS) regression to optimize the modeling approach. A rigorous validation protocol has been used for assessing the goodness-of-fit, robustness, and external predictability of the models. Finally, the toxicity of possible untested mixtures of different photodegradation products of fluoroquinolones has been predicted using the best model reported in this study.

Collapse

Song Z, Chen J, Cheng J, Chen G, Qi Z. Computer-Aided Molecular Design of Ionic Liquids as Advanced Process Media: A Review from Fundamentals to Applications. Chem Rev 2024;124:248-317. [PMID: 38108629 DOI: 10.1021/acs.chemrev.3c00223] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]

Ghosh V, Bhattacharjee A, Kumar A, Ojha PK. q-RASTR modelling for prediction of diverse toxic chemicals towards T. pyriformis. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2024;35:11-30. [PMID: 38193248 DOI: 10.1080/1062936x.2023.2298452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 12/16/2023] [Indexed: 01/10/2024]

Pandey SK, Roy K. Development of a read-across-derived classification model for the predictions of mutagenicity data and its comparison with traditional QSAR models and expert systems. Toxicology 2023;500:153676. [PMID: 37993082 DOI: 10.1016/j.tox.2023.153676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Revised: 11/06/2023] [Accepted: 11/17/2023] [Indexed: 11/24/2023]

Abstract

Mutagenicity is considered an important endpoint from the regulatory, environmental and medical points of view. Due to the wide number of compounds that may be of concern and the enormous expenses (in terms of time, money, and animals) associated with rodent mutagenicity bioassays, this endpoint is a major target for the development of alternative approaches for screening and prediction. The majority of old-aged expert systems and quantitative structure-activity relationship (QSAR) models may show reduced performance over time for their application on newer chemical candidates; thus, researchers constantly try to improve the modeling strategies. In our report, we initially performed traditional classification-based linear discriminant analysis (LDA) QSAR modeling using the benchmark Ames dataset of diverse chemicals (6512 compounds) to recognize the relationship between the molecules and their potential mutagenic behavior. The classical LDA QSAR model is developed from a selected set of 2D descriptors. The LDA QSAR model was developed by using a total of 31 descriptors identified from the analysis of the most discriminating features. Additionally, we have used similarity-derived features obtained from the read-across (RA) to develop an RA-based QSAR model. The developed RA-based LDA QSAR model has better predictivity, transferability, and interpretability compared to the LDA QSAR model, and it uses a very small number of descriptors compared to the classical QSAR model. Different machine learning (ML) models were also developed using the descriptors appearing in the read-across-based LDA QSAR model for comparative studies. We have checked the prediction quality of 216 true external set compounds using the novel similarity-derived RA model. The performance of the OECD toolbox is also compared with the RA-derived LDA QSAR model for a true external set. The current study aimed to explore the significance of the read-across-based algorithm and its application to the most current experimental mutagenicity data to complement already available expert systems.

Collapse

Ghosh S, Chatterjee M, Roy K. Quantitative Read-across structure-activity relationship (q-RASAR): A new approach methodology to model aquatic toxicity of organic pesticides against different fish species. AQUATIC TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2023;265:106776. [PMID: 38006764 DOI: 10.1016/j.aquatox.2023.106776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 11/17/2023] [Accepted: 11/19/2023] [Indexed: 11/27/2023]

McGibbon M, Shave S, Dong J, Gao Y, Houston DR, Xie J, Yang Y, Schwaller P, Blay V. From intuition to AI: evolution of small molecule representations in drug discovery. Brief Bioinform 2023;25:bbad422. [PMID: 38033290 PMCID: PMC10689004 DOI: 10.1093/bib/bbad422] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 10/13/2023] [Accepted: 11/01/2023] [Indexed: 12/02/2023] Open

Nath A, Ojha PK, Roy K. QSAR assessment of aquatic toxicity potential of diverse agrochemicals. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2023:1-20. [PMID: 37941423 DOI: 10.1080/1062936x.2023.2278074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Accepted: 10/24/2023] [Indexed: 11/10/2023]

Keefer CE, Chang G, Di L, Woody NA, Tess DA, Osgood SM, Kapinos B, Racich J, Carlo AA, Balesano A, Ferguson N, Orozco C, Zueva L, Luo L. The Comparison of Machine Learning and Mechanistic In Vitro-In Vivo Extrapolation Models for the Prediction of Human Intrinsic Clearance. Mol Pharm 2023;20:5616-5630. [PMID: 37812508 DOI: 10.1021/acs.molpharmaceut.3c00502] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/11/2023]

Abstract

Accurate prediction of human pharmacokinetics (PK) remains one of the key objectives of drug metabolism and PK (DMPK) scientists in drug discovery projects. This is typically performed by using in vitro-in vivo extrapolation (IVIVE) based on mechanistic PK models. In recent years, machine learning (ML), with its ability to harness patterns from previous outcomes to predict future events, has gained increased popularity in application to absorption, distribution, metabolism, and excretion (ADME) sciences. This study compares the performance of various ML and mechanistic models for the prediction of human IV clearance for a large (645) set of diverse compounds with literature human IV PK data, as well as measured relevant in vitro end points. ML models were built using multiple approaches for the descriptors: (1) calculated physical properties and structural descriptors based on chemical structure alone (classical QSAR/QSPR); (2) in vitro measured inputs only with no structure-based descriptors (ML IVIVE); and (3) in silico ML IVIVE using in silico model predictions for the in vitro inputs. For the mechanistic models, well-stirred and parallel-tube liver models were considered with and without the use of empirical scaling factors and with and without renal clearance. The best ML model for the prediction of in vivo human intrinsic clearance (CLint) was an in vitro ML IVIVE model using only six in vitro inputs with an average absolute fold error (AAFE) of 2.5. The best mechanistic model used the parallel-tube liver model, with empirical scaling factors resulting in an AAFE of 2.8. The corresponding mechanistic model with full in silico inputs achieved an AAFE of 3.3. These relative performances of the models were confirmed with the prediction of 16 Pfizer drug candidates that were not part of the original data set. Results show that ML IVIVE models are comparable to or superior to their best mechanistic counterparts. We also show that ML IVIVE models can be used to derive insights into factors for the improvement of mechanistic PK prediction.

Collapse

Affiliation(s)

Christopher E Keefer Translational Modeling and Simulation, Pfizer Worldwide Research and Development, Groton, Connecticut 06340, United States
George Chang Translational Modeling and Simulation, Pfizer Worldwide Research and Development, Groton, Connecticut 06340, United States
Li Di Pharmacokinetics, Dynamics and Metabolism, Pfizer Worldwide Research and Development, Groton, Connecticut 06340, United States
Nathaniel A Woody Translational Modeling and Simulation, Pfizer Worldwide Research and Development, Groton, Connecticut 06340, United States
David A Tess Translational Modeling and Simulation, Pfizer Worldwide Research and Development, Cambridge, Massachusetts 02139, United States
Sarah M Osgood Pharmacokinetics, Dynamics and Metabolism, Pfizer Worldwide Research and Development, Groton, Connecticut 06340, United States
Brendon Kapinos Discovery Sciences, Pfizer Worldwide Research and Development, Groton, Connecticut 06340, United States
Jill Racich Discovery Sciences, Pfizer Worldwide Research and Development, Groton, Connecticut 06340, United States
Anthony A Carlo Discovery Sciences, Pfizer Worldwide Research and Development, Groton, Connecticut 06340, United States
Amanda Balesano Pharmacokinetics, Dynamics and Metabolism, Pfizer Worldwide Research and Development, Groton, Connecticut 06340, United States
Nicholas Ferguson Pharmacokinetics, Dynamics and Metabolism, Pfizer Worldwide Research and Development, Groton, Connecticut 06340, United States
Christine Orozco Pharmacokinetics, Dynamics and Metabolism, Pfizer Worldwide Research and Development, Groton, Connecticut 06340, United States
Larisa Zueva Pharmacokinetics, Dynamics and Metabolism, Pfizer Worldwide Research and Development, Groton, Connecticut 06340, United States
Lina Luo Pharmacokinetics, Dynamics and Metabolism, Pfizer Worldwide Research and Development, Groton, Connecticut 06340, United States

Collapse

Sosnowska A, Mudlaff M, Gorb L, Bulawska N, Zdybel S, Bakker M, Peijnenburg W, Puzyn T. Expanding the applicability domain of QSPRs for predicting water solubility and vapor pressure of PFAS. CHEMOSPHERE 2023;340:139965. [PMID: 37633602 DOI: 10.1016/j.chemosphere.2023.139965] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 08/22/2023] [Accepted: 08/23/2023] [Indexed: 08/28/2023]

Banerjee A, Roy K. Read-across-based intelligent learning: development of a global q-RASAR model for the efficient quantitative predictions of skin sensitization potential of diverse organic chemicals. ENVIRONMENTAL SCIENCE. PROCESSES & IMPACTS 2023;25:1626-1644. [PMID: 37682520 DOI: 10.1039/d3em00322a] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/09/2023]

Chatterjee M, Banerjee A, Tosi S, Carnesecchi E, Benfenati E, Roy K. Machine learning - based q-RASAR modeling to predict acute contact toxicity of binary organic pesticide mixtures in honey bees. JOURNAL OF HAZARDOUS MATERIALS 2023;460:132358. [PMID: 37634379 DOI: 10.1016/j.jhazmat.2023.132358] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 08/02/2023] [Accepted: 08/20/2023] [Indexed: 08/29/2023]

Chatterjee M, Roy K. "Data fusion" quantitative read-across structure-activity-activity relationships (q-RASAARs) for the prediction of toxicities of binary and ternary antibiotic mixtures toward three bacterial species. JOURNAL OF HAZARDOUS MATERIALS 2023;459:132129. [PMID: 37506640 DOI: 10.1016/j.jhazmat.2023.132129] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 06/28/2023] [Accepted: 07/21/2023] [Indexed: 07/30/2023]

Kajtazi A, Russo G, Wicht K, Eghbali H, Lynen F. Facilitating structural elucidation of small environmental solutes in RPLC-HRMS by retention index prediction. CHEMOSPHERE 2023;337:139361. [PMID: 37392796 DOI: 10.1016/j.chemosphere.2023.139361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 06/06/2023] [Accepted: 06/26/2023] [Indexed: 07/03/2023]

Abstract

Implementing effective environmental management strategies requires a comprehensive understanding of the chemical composition of environmental pollutants, particularly in complex mixtures. Utilizing innovative analytical techniques, such as high-resolution mass spectrometry and predictive retention index models, can provide valuable insights into the molecular structures of environmental contaminants. Liquid Chromatography-High-Resolution Mass Spectrometry is a powerful tool for the identification of isomeric structures in complex samples. However, there are some limitations that can prevent accurate isomeric structure identification, particularly in cases where the isomers have similar mass and fragmentation patterns. Liquid chromatographic retention, determined by the size, shape, and polarity of the analyte and its interactions with the stationary phase, contains valuable 3D structural information that is vastly underutilized. Therefore, a predictive retention index model is developed which is transferrable to LC-HRMS systems and can assist in the structural elucidation of unknowns. The approach is currently restricted to carbon, hydrogen, and oxygen-based molecules <500 g mol-1. The methodology facilitates the acceptance of accurate structural formulas and the exclusion of erroneous hypothetical structural representations by leveraging retention time estimations, thereby providing a permissible tolerance range for a given elemental composition and experimental retention time. This approach serves as a proof of concept for the development of a Quantitative Structure-Retention Relationship model using a generic gradient LC approach. The use of a widely used reversed-phase (U)HPLC column and a relatively large set of training (101) and test compounds (14) demonstrates the feasibility and potential applicability of this approach for predicting the retention behaviour of compounds in complex mixtures. By providing a standard operating procedure, this approach can be easily replicated and applied to various analytical challenges, further supporting its potential for broader implementation.

Collapse

Banerjee A, Roy K. Prediction-Inspired Intelligent Training for the Development of Classification Read-across Structure-Activity Relationship (c-RASAR) Models for Organic Skin Sensitizers: Assessment of Classification Error Rate from Novel Similarity Coefficients. Chem Res Toxicol 2023;36:1518-1531. [PMID: 37584642 DOI: 10.1021/acs.chemrestox.3c00155] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/17/2023]

Abstract

The advancements in the field of cheminformatics have led to a reduction in animal testing to estimate the activity, property, and toxicity of query chemicals. Read-across structure-activity relationship (RASAR) is an emerging concept that utilizes various similarity functions derived from chemical information to develop highly predictive models. Unlike quantitative structure-activity relationship (QSAR) models, RASAR descriptors of a query compound are computed from its close congeners instead of the compound itself, thus targeting predictions in the model training phase. The objective of the present study is not to propose new QSAR models for skin sensitization but to demonstrate the enhancement in the quality of predictions of the skin-sensitizing potential of organic compounds by developing classification-based RASAR (c-RASAR) models. A diverse, previously curated data set was collected from the literature for which 2D descriptors were computed. The extracted essential features were then used to develop a classification-based linear discriminant analysis (LDA) QSAR model. Furthermore, from the read-across-based predictions, RASAR descriptors were calculated using the basic settings of the hyperparameters for the Laplacian Kernel-based optimum similarity measure. After feature selection, an LDA c-RASAR model was developed, which superseded the prediction quality of the LDA-QSAR model. Various other combinations of RASAR descriptors were also taken to develop additional c-RASAR models, all showing better prediction quality than the LDA QSAR model while using a lower number of descriptors. Various other machine learning c-RASAR models were also developed for comparison purposes. In this work, we have proposed and analyzed three new similarity metrics: gm_class, sm1, and sm2. The first one is an indicator variable used to generate a simple univariate c-RASAR model with good prediction ability, while the remaining two are similarity indices used to analyze possible activity cliffs in the training and test sets and are believed to play an important role in the modelability analysis of data sets.

Collapse

Dos Santos BR, Ramos ABDSB, de Menezes RPB, Scotti MT, Colombo FA, Marques MJ, Reimão JQ. Repurposing the Medicines for Malaria Venture's COVID Box to discover potent inhibitors of Toxoplasma gondii, and in vivo efficacy evaluation of almitrine bismesylate (MMV1804175) in chronically infected mice. PLoS One 2023;18:e0288335. [PMID: 37418497 DOI: 10.1371/journal.pone.0288335] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Accepted: 06/24/2023] [Indexed: 07/09/2023] Open

Abstract

Toxoplasmosis, caused by the obligate intracellular parasite Toxoplasma gondii, affects about one-third of the world's population and can cause severe congenital, neurological and ocular issues. Current treatment options are limited, and there are no human vaccines available to prevent transmission. Drug repurposing has been effective in identifying anti-T. gondii drugs. In this study, the screening of the COVID Box, a compilation of 160 compounds provided by the "Medicines for Malaria Venture" organization, was conducted to explore its potential for repurposing drugs to combat toxoplasmosis. The objective of the present work was to evaluate the compounds' ability to inhibit T. gondii tachyzoite growth, assess their cytotoxicity against human cells, examine their absorption, distribution, metabolism, excretion, and toxicity (ADMET) properties, and investigate the potential of one candidate drug through an experimental chronic model of toxoplasmosis. Early screening identified 29 compounds that could inhibit T. gondii survival by over 80% while keeping human cell survival up to 50% at a concentration of 1 μM. The Half Effective Concentrations (EC50) of these compounds ranged from 0.04 to 0.92 μM, while the Half Cytotoxic Concentrations (CC50) ranged from 2.48 to over 50 μM. Almitrine was chosen for further evaluation due to its favorable characteristics, including anti-T. gondii activity at nanomolar concentrations, low cytotoxicity, and ADMET properties. Administering almitrine bismesylate (Vectarion®) orally at dose of 25 mg/kg/day for ten consecutive days resulted in a statistically significant (p < 0.001) reduction in parasite burden in the brains of mice chronically infected with T. gondii (ME49 strain). This was determined by quantifying the RNA of living parasites using real-time PCR. The presented results suggest that almitrine may be a promising drug candidate for additional experimental studies on toxoplasmosis and provide further evidence of the potential of the MMV collections as a valuable source of drugs to be repositioned for infectious diseases.

Collapse

Cesaro A, Bagheri M, Torres MDT, Wan F, de la Fuente-Nunez C. Deep learning tools to accelerate antibiotic discovery. Expert Opin Drug Discov 2023;18:1245-1257. [PMID: 37794737 PMCID: PMC10790350 DOI: 10.1080/17460441.2023.2250721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Accepted: 08/18/2023] [Indexed: 10/06/2023]

Affiliation(s)

Angela Cesaro Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
Mojtaba Bagheri Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
Marcelo D. T. Torres Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
Fangping Wan Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
Cesar de la Fuente-Nunez Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America

Collapse

El-Atawneh S, Goldblum A. Activity Models of Key GPCR Families in the Central Nervous System: A Tool for Many Purposes. J Chem Inf Model 2023. [PMID: 37257045 DOI: 10.1021/acs.jcim.2c01531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

De Gauquier P, Peeters J, Vanommeslaeghe K, Vander Heyden Y, Mangelings D. Modelling the enantiorecognition of structurally diverse pharmaceuticals on O-substituted polysaccharide-based stationary phases. Talanta 2023;259:124497. [PMID: 37030098 DOI: 10.1016/j.talanta.2023.124497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 03/22/2023] [Accepted: 03/27/2023] [Indexed: 03/31/2023]

Abstract

This study aims to develop models to predict the retention, separation and elution sequence of the enantiomers of structurally diverse pharmaceuticals. More specifically, Quantitative Structure Retention Relationships (QSRR) models are built that describe the relationship between molecular descriptors and retention. Eighteen structurally diverse chiral mixtures, each consisting of a pair of enantiomers, were analyzed on two polysaccharide chiral stationary phases, Chiralcel OD-RH (cellulose tris(3,5-dimethylphenylcarbamate)) and Lux amylose-2 (amylose tris(5-chloro-2-methylphenylcarbamate)), applying either a basic or an acidic mobile phase, and their retention factor and elution sequence were determined. Both achiral and, in-house defined, chiral descriptors were used as descriptive variables to build the models. Linear regression techniques, i.e. stepwise multiple linear regression (sMLR) and partial least squares (PLS) regression, were applied to model the retention or separation as a function of the descriptors. In a first step, models were built with only achiral descriptors to model the global retention of both enantiomers of a chiral molecule. Subsequently, models were built with only chiral descriptors to predict the enantioseparation and elution sequence, and finally, models were considered with both descriptor types to predict the retention, the separation and the elution sequence of the enantiomers. The global retention was predicted well by the sMLR models with only achiral descriptors. The models with only chiral descriptors were not found suitable to predict the enantioseparation and elution sequence. Finally, the models containing both chiral and achiral descriptors allowed predicting the retention well, but their ability to predict the elution sequence and separation of the enantiomers differed widely for the chromatographic systems considered.

Collapse

Ghosh S, Chhabria MT, Roy K. Exploring quantitative structure-property relationship models for environmental fate assessment of petroleum hydrocarbons. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023;30:26218-26233. [PMID: 36355241 DOI: 10.1007/s11356-022-23904-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Accepted: 10/26/2022] [Indexed: 06/16/2023]

Abstract

The rate and extent of biodegradation of petroleum hydrocarbons in the different aquatic environments is an important element to address. The major avenue for removing petroleum hydrocarbons from the environment is thought to be biodegradation. The present study involves the development of predictive quantitative structure-property relationship (QSPR) models for the primary biodegradation half-life of petroleum hydrocarbons that may be used to forecast the biodegradation half-life of untested petroleum hydrocarbons within the established models' applicability domain. These models use easily computable two-dimensional (2D) descriptors to investigate important structural characteristics needed for the biodegradation of petroleum hydrocarbons in freshwater (dataset 1), temperate seawater (dataset 2), and arctic seawater (dataset 3). All the developed models follow OECD guidelines. We have used double cross-validation, best subset selection, and partial least squares tools for model development. In addition, the small dataset modeler tool has been successfully used for the dataset with very few compounds (dataset 3 with 17 compounds), where dataset division was not possible. The resultant models are robust, predictive, and mechanistically interpretable based on both internal and external validation metrics (R² range of 0.605-0.959. Q²_(Loo) range of 0.509-0.904, and Q²_F1 range of 0.526-0.959). The intelligent consensus predictor tool has been used for the improvement of the prediction quality for test set compounds which provided superior outcomes to those from individual partial least squares models based on several metrics (Q²_F1 = 0.808 and Q²_F2 = 0.805 for dataset 1 in freshwater). Molecular size and hydrophilic factor for freshwater, frequency of two carbon atoms at topological distance 4 for temperate seawater, and electronegative atom count relative to size for arctic seawater were found to be the most significant descriptors responsible for the regulation of biodegradation half-life of petroleum hydrocarbons.

Collapse

QSPR models for the critical temperature and pressure of cycloalkanes. Chem Phys Lett 2022. [DOI: 10.1016/j.cplett.2022.140088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Paul R, Chatterjee M, Roy K. First report on soil ecotoxicity prediction against Folsomia candida using intelligent consensus predictions and chemical read-across. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022;29:88302-88317. [PMID: 35829883 DOI: 10.1007/s11356-022-21937-w] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 07/05/2022] [Indexed: 06/15/2023]

Banerjee A, De P, Kumar V, Kar S, Roy K. Quick and efficient quantitative predictions of androgen receptor binding affinity for screening Endocrine Disruptor Chemicals using 2D-QSAR and Chemical Read-Across. CHEMOSPHERE 2022;309:136579. [PMID: 36174732 DOI: 10.1016/j.chemosphere.2022.136579] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 09/18/2022] [Accepted: 09/20/2022] [Indexed: 06/16/2023]

Chatterjee M, Roy K. Chemical similarity and machine learning-based approaches for the prediction of aquatic toxicity of binary and multicomponent pharmaceutical and pesticide mixtures against Aliivibrio fischeri. CHEMOSPHERE 2022;308:136463. [PMID: 36122748 DOI: 10.1016/j.chemosphere.2022.136463] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 09/10/2022] [Accepted: 09/12/2022] [Indexed: 06/15/2023]

The use of machine learning modeling, virtual screening, molecular docking, and molecular dynamics simulations to identify potential VEGFR2 kinase inhibitors. Sci Rep 2022;12:18825. [PMID: 36335233 PMCID: PMC9637137 DOI: 10.1038/s41598-022-22992-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 10/21/2022] [Indexed: 11/08/2022] Open