Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Consonni V, Todeschini R. Molecular Descriptors. Challenges and Advances in Computational Chemistry and Physics 2010. [DOI: 10.1007/978-1-4020-9783-6_3] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Number

Cited by Other Article(s)

Bandini E, Castellano Ontiveros R, Kajtazi A, Eghbali H, Lynen F. Physicochemical modelling of the retention mechanism of temperature-responsive polymeric columns for HPLC through machine learning algorithms. J Cheminform 2024;16:72. [PMID: 38907264 PMCID: PMC11193285 DOI: 10.1186/s13321-024-00873-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 06/14/2024] [Indexed: 06/23/2024] Open

Chatterjee M, Roy K. Predictive binary mixture toxicity modeling of fluoroquinolones (FQs) and the projection of toxicity of hypothetical binary FQ mixtures: a combination of 2D-QSAR and machine-learning approaches. ENVIRONMENTAL SCIENCE. PROCESSES & IMPACTS 2024;26:105-118. [PMID: 38073518 DOI: 10.1039/d3em00445g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2024]

Abstract

All sorts of chemicals get degraded under various environmental stresses, and the degradates coexist with the parent compounds as mixtures in the environment. Antibiotics emerge as an additional concern due to the bioactive nature of both the parent compound and degradation products and their combined exposure to the environment. Therefore, environmental risk assessment of antibiotics and their degradation products is very much necessary. In this direction, we made use of in silico new approach methodologies (NAMs) and machine-learning algorithms. In this study, we have developed a robust and predictive mixture-quantitative structure-activity relationship (QSAR) model with promising quality and predictability (internal: MAETrain = 0.085, QLOO2 = 0.849, external: MAETest = 0.090, and QF12 = 0.859) for predicting the toxicity of the mixtures of a class of antibiotics and their degradation products. To obtain the predictive model, toxicity data of 78 binary fluoroquinolone mixtures in E. coli (endpoint: log 1/IC50 in molar) have been utilized. We have used only 0D-2D descriptors to efficiently encode the structural features of mixture components without any additional complexities. The optimization of the class of mixture descriptors has been performed in this study by using three different mixing rules (linear combination of molecular contributions, the squared molecular contributions, and the norm of molecular contributions). Different machine-learning approaches namely, random forest (RF), ada boost, gradient boost (GB), extreme gradient boost (XGB), support vector machine (SVM), linear support vector machine (LSVM), and ridge regression (RR) have been employed here apart from the conventional partial least squares (PLS) regression to optimize the modeling approach. A rigorous validation protocol has been used for assessing the goodness-of-fit, robustness, and external predictability of the models. Finally, the toxicity of possible untested mixtures of different photodegradation products of fluoroquinolones has been predicted using the best model reported in this study.

Collapse

Shahini E, Chaulagain N, Shankar K, Tang T. Predicting Free Energies of Exfoliation and Solvation for Graphitic Carbon Nitrides Using Machine Learning. ACS APPLIED MATERIALS & INTERFACES 2023;15:53786-53801. [PMID: 37938813 DOI: 10.1021/acsami.3c09347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2023]

Abstract

As a metal-free and visible-light-responsive photocatalyst, graphitic carbon nitride (g-C3N4) has emerged as a new research hotspot and has attracted broad attention in the field of solar energy conversion and thin-film transistors. Liquid-phase exfoliation (LPE) is the best-known method for the synthesis of 2D g-C3N4 nanosheets. In LPE, bulk g-C3N4 is exfoliated in a solvent via high-shear mixing or sonication in order to produce a stable suspension of individual nanosheets. Two parameters of importance in gauging the performance of a solvent in LPE are the free energy required to exfoliate a unit area of layered materials into individual sheets in the solvent (ΔGexf) and the solvation free energy per unit area of a nanosheet (ΔGsol). While approximations for the free energies exist, they are shown in our previous work to be inaccurate and incapable of capturing the experimentally observed efficacy of LPE. Molecular dynamics (MD) simulations can provide accurate free-energy calculations, but doing so for every single solvent is time- and resource-consuming. Herein, machine learning (ML) algorithms are used to predict ΔGexf and ΔGsol for g-C3N4. First, a database for ΔGexf and ΔGsol is created based on a series of MD simulations involving 49 different solvents with distinct chemical structures and properties. The data set also includes values of critical descriptors for the solvents, including density, surface tension, dielectric constant, etc. Different ML methods are compared, accompanied by descriptor selection, to develop the most accurate model for predicting ΔGexf and ΔGsol. The extra tree regressor is shown to be the best performer among the six ML methods studied. Experimental validation of the model is conducted by performing dispersibility tests in several solvents for which the free energies are predicted. Finally, the influence of the selected descriptors on the free energies is analyzed, and strategies for solvent selection in LPE are proposed.

Collapse

Mullowney MW, Duncan KR, Elsayed SS, Garg N, van der Hooft JJJ, Martin NI, Meijer D, Terlouw BR, Biermann F, Blin K, Durairaj J, Gorostiola González M, Helfrich EJN, Huber F, Leopold-Messer S, Rajan K, de Rond T, van Santen JA, Sorokina M, Balunas MJ, Beniddir MA, van Bergeijk DA, Carroll LM, Clark CM, Clevert DA, Dejong CA, Du C, Ferrinho S, Grisoni F, Hofstetter A, Jespers W, Kalinina OV, Kautsar SA, Kim H, Leao TF, Masschelein J, Rees ER, Reher R, Reker D, Schwaller P, Segler M, Skinnider MA, Walker AS, Willighagen EL, Zdrazil B, Ziemert N, Goss RJM, Guyomard P, Volkamer A, Gerwick WH, Kim HU, Müller R, van Wezel GP, van Westen GJP, Hirsch AKH, Linington RG, Robinson SL, Medema MH. Artificial intelligence for natural product drug discovery. Nat Rev Drug Discov 2023;22:895-916. [PMID: 37697042 DOI: 10.1038/s41573-023-00774-7] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/20/2023] [Indexed: 09/13/2023]

Affiliation(s)

Michael W Mullowney Duchossois Family Institute, The University of Chicago, Chicago, IL, USA
Katherine R Duncan Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, Glasgow, UK
Somayah S Elsayed Department of Molecular Biotechnology, Institute of Biology, Leiden University, Leiden, The Netherlands
Neha Garg School of Chemistry and Biochemistry, Center for Microbial Dynamics and Infection, Georgia Institute of Technology, Atlanta, GA, USA
Justin J J van der Hooft Bioinformatics Group, Wageningen University, Wageningen, The Netherlands Department of Biochemistry, University of Johannesburg, Johannesburg, South Africa
Nathaniel I Martin Biological Chemistry Group, Institute of Biology, Leiden University, Leiden, The Netherlands
David Meijer Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
Barbara R Terlouw Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
Friederike Biermann Bioinformatics Group, Wageningen University, Wageningen, The Netherlands Institute of Molecular Bio Science, Goethe-University Frankfurt, Frankfurt am Main, Germany LOEWE Center for Translational Biodiversity Genomics (TBG), Frankfurt am Main, Germany
Kai Blin The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kongens Lyngby, Denmark
Janani Durairaj Biozentrum, University of Basel, Basel, Switzerland
Marina Gorostiola González Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden, The Netherlands ONCODE institute, Leiden, The Netherlands
Eric J N Helfrich Institute of Molecular Bio Science, Goethe-University Frankfurt, Frankfurt am Main, Germany LOEWE Center for Translational Biodiversity Genomics (TBG), Frankfurt am Main, Germany
Florian Huber Center for Digitalization and Digitality, Hochschule Düsseldorf, Düsseldorf, Germany
Stefan Leopold-Messer Institut für Mikrobiologie, Eidgenössische Technische Hochschule (ETH) Zürich, Zürich, Switzerland
Kohulan Rajan Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller-University Jena, Jena, Germany
Tristan de Rond School of Chemical Sciences, University of Auckland, Auckland, New Zealand
Jeffrey A van Santen Department of Chemistry, Simon Fraser University, Burnaby, British Columbia, Canada
Maria Sorokina Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller University, Jena, Germany Pharmaceuticals R&D, Bayer AG, Berlin, Germany
Marcy J Balunas Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI, USA Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
Mehdi A Beniddir Équipe "Chimie des Substances Naturelles", Université Paris-Saclay, CNRS, BioCIS, Orsay, France
Doris A van Bergeijk Department of Molecular Biotechnology, Institute of Biology, Leiden University, Leiden, The Netherlands
Laura M Carroll Structural and Computational Biology Unit, EMBL, Heidelberg, Germany
Chase M Clark Division of Pharmaceutical Sciences, School of Pharmacy, University of Wisconsin-Madison, Madison, WI, USA
Djork-Arné Clevert WRDM - Machine Learning Research, Pfizer, Berlin, Germany
Chris A Dejong Adapsyn Bioscience, Hamilton, Ontario, Canada
Chao Du Department of Molecular Biotechnology, Institute of Biology, Leiden University, Leiden, The Netherlands
Scarlet Ferrinho Chemistry Department, University of St Andrews, St Andrews, UK
Francesca Grisoni Institute for Complex Molecular Systems, Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands Centre for Living Technologies, Alliance TU/e, WUR, UU, UMC Utrecht, Utrecht, The Netherlands
Albert Hofstetter Laboratory of Physical Chemistry, ETH Zürich, Zürich, Switzerland
Willem Jespers Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden, The Netherlands
Olga V Kalinina Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarbrücken, Germany Drug Bioinformatics, Medical Faculty, Saarland University, Homburg, Germany Center for Bioinformatics, Saarland University, Saarbrücken, Germany
Satria A Kautsar Department of Chemistry, Scripps Research, FL, USA
Hyunwoo Kim College of Pharmacy and Integrated Research Institute for Drug Development, Dongguk University Seoul, Goyang-si, Republic of Korea
Tiago F Leao Center for Nuclear Energy in Agriculture, University of São Paulo, Piracicaba, Brazil
Joleen Masschelein Center for Microbiology, VIB-KU Leuven, Heverlee, Belgium Department of Biology, KU Leuven, Heverlee, Belgium
Evan R Rees Division of Pharmaceutical Sciences, School of Pharmacy, University of Wisconsin-Madison, Madison, WI, USA
Raphael Reher Institute of Pharmaceutical Biology and Biotechnology, University of Marburg, Marburg, Germany Institute of Pharmacy, Martin-Luther-University Halle-Wittenberg, Halle (Saale), Germany
Daniel Reker Department of Biomedical Engineering, Duke University, Durham, NC, USA Duke Microbiome Center, Duke University, Durham, NC, USA
Philippe Schwaller Laboratory of Artificial Chemical Intelligence, Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
Marwin Segler Microsoft Research, Cambridge, UK
Michael A Skinnider Adapsyn Bioscience, Hamilton, Ontario, Canada Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
Allison S Walker Department of Chemistry, Vanderbilt University, Nashville, TN, USA Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
Egon L Willighagen Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, The Netherlands
Barbara Zdrazil European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridgeshire, UK
Nadine Ziemert Interfaculty Institute for Microbiology and Infection Medicine Tuebingen (IMIT), Institute for Bioinformatics and Medical Informatics (IBMI), University of Tuebingen, Tuebingen, Germany
Rebecca J M Goss Chemistry Department, University of St Andrews, St Andrews, UK
Pierre Guyomard Bonsai team, CRIStAL - Centre de Recherche en Informatique Signal et Automatique de Lille, Université de Lille, Villeneuve d'Ascq Cedex, France
Andrea Volkamer Center for Bioinformatics, Saarland University, Saarbrücken, Germany In silico Toxicology and Structural Bioinformatics, Institute of Physiology, Charité - Universitätsmedizin Berlin, Berlin, Germany
William H Gerwick Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
Hyun Uk Kim Department of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Korea
Rolf Müller Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarbrücken, Germany Department of Pharmacy, Saarland University, Saarbrücken, Germany German Center for infection research (DZIF), Braunschweig, Germany Helmholtz International Lab for Anti-Infectives, Saarbrücken, Germany
Gilles P van Wezel Department of Molecular Biotechnology, Institute of Biology, Leiden University, Leiden, The Netherlands Netherlands Institute of Ecology, NIOO-KNAW, Wageningen, The Netherlands
Gerard J P van Westen Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden, The Netherlands.
Anna K H Hirsch Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarbrücken, Germany. Department of Pharmacy, Saarland University, Saarbrücken, Germany. German Center for infection research (DZIF), Braunschweig, Germany. Helmholtz International Lab for Anti-Infectives, Saarbrücken, Germany.
Roger G Linington Department of Chemistry, Simon Fraser University, Burnaby, British Columbia, Canada.
Serina L Robinson Department of Environmental Microbiology, Eawag: Swiss Federal Institute for Aquatic Science and Technology, Dübendorf, Switzerland.
Marnix H Medema Bioinformatics Group, Wageningen University, Wageningen, The Netherlands. Institute of Biology, Leiden University, Leiden, The Netherlands.

Collapse

Viesi E, Sardina DS, Perricone U, Giugno R. APDB: a database on air pollutant characterization and similarity prediction. Database (Oxford) 2023;2023:baad046. [PMID: 37450416 PMCID: PMC10348400 DOI: 10.1093/database/baad046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Revised: 05/12/2023] [Accepted: 06/16/2023] [Indexed: 07/18/2023]

Dutschmann TM, Kinzel L, Ter Laak A, Baumann K. Large-scale evaluation of k-fold cross-validation ensembles for uncertainty estimation. J Cheminform 2023;15:49. [PMID: 37118768 PMCID: PMC10142532 DOI: 10.1186/s13321-023-00709-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2022] [Accepted: 03/10/2023] [Indexed: 04/30/2023] Open

How the Structure of Per- and Polyfluoroalkyl Substances (PFAS) Influences Their Binding Potency to the Peroxisome Proliferator-Activated and Thyroid Hormone Receptors-An In Silico Screening Study. MOLECULES (BASEL, SWITZERLAND) 2023;28:molecules28020479. [PMID: 36677537 PMCID: PMC9866891 DOI: 10.3390/molecules28020479] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 12/22/2022] [Accepted: 12/23/2022] [Indexed: 01/06/2023]

Didachos C, Kintos DP, Fousteris M, Mylonas P, Kanavos A. An Optimized Cloud Computing Method for Extracting Molecular Descriptors. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2023;1424:247-254. [PMID: 37486501 DOI: 10.1007/978-3-031-31982-2_28] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/25/2023]

Metwally AA, Nayel AA, Hathout RM. In silico prediction of siRNA ionizable-lipid nanoparticles In vivo efficacy: Machine learning modeling based on formulation and molecular descriptors. Front Mol Biosci 2022;9:1042720. [PMID: 36619167 PMCID: PMC9811823 DOI: 10.3389/fmolb.2022.1042720] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Accepted: 12/07/2022] [Indexed: 12/24/2022] Open

Cheng XR, Yu BT, Song J, Ma JH, Chen YY, Zhang CX, Tu PH, Muskat MN, Zhu ZG. The Alleviation of Dextran Sulfate Sodium (DSS)-Induced Colitis Correlate with the logP Values of Food-Derived Electrophilic Compounds. Antioxidants (Basel) 2022;11:antiox11122406. [PMID: 36552614 PMCID: PMC9774124 DOI: 10.3390/antiox11122406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Revised: 11/29/2022] [Accepted: 12/02/2022] [Indexed: 12/07/2022] Open

Noise-robust optimization of quantum machine learning models for polymer properties using a simulator and validated on the IonQ quantum computer. Sci Rep 2022;12:19003. [PMID: 36347908 PMCID: PMC9643424 DOI: 10.1038/s41598-022-22940-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Accepted: 10/21/2022] [Indexed: 11/09/2022] Open

Asahara R, Miyao T. Extended Connectivity Fingerprints as a Chemical Reaction Representation for Enantioselective Organophosphorus-Catalyzed Asymmetric Reaction Prediction. ACS OMEGA 2022;7:26952-26964. [PMID: 35936487 PMCID: PMC9352214 DOI: 10.1021/acsomega.2c03812] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/18/2022] [Accepted: 07/07/2022] [Indexed: 06/15/2023]

Nkulikiyinka P, Wagland ST, Manovic V, Clough PT. Prediction of Combined Sorbent and Catalyst Materials for SE-SMR, Using QSPR and Multitask Learning. Ind Eng Chem Res 2022;61:9218-9233. [PMID: 35818477 PMCID: PMC9264356 DOI: 10.1021/acs.iecr.2c00971] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

De P, Kar S, Ambure P, Roy K. Prediction reliability of QSAR models: an overview of various validation tools. Arch Toxicol 2022;96:1279-1295. [PMID: 35267067 DOI: 10.1007/s00204-022-03252-y] [Citation(s) in RCA: 38] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2021] [Accepted: 02/14/2022] [Indexed: 01/20/2023]

Abstract

The reliability of any quantitative structure-activity relationship (QSAR) model depends on multiple aspects such as the accuracy of the input dataset, selection of significant descriptors, the appropriate splitting process of the dataset, statistical tools used, and most notably on the measures of validation. Validation, the most crucial step in QSAR model development, confirms the reliability of the developed QSAR models and the acceptability of each step in the model development. The present review deals with various validation tools that involve multiple techniques that improve the model quality and robustness. The double cross-validation tool helps in building improved quality models using different combinations of the same training set in an inner cross-validation loop. This exhaustive method is also integrated for small datasets (< 40 compounds) in another tool, namely the small dataset modeler tool. The main aim of QSAR researchers is to improve prediction quality by lowering the prediction errors for the query compounds. 'Intelligent' selection of multiple models and consensus predictions integrated in the intelligent consensus predictor tool were found to be more externally predictive than individual models. Furthermore, another tool called Prediction Reliability Indicator was explained to understand the quality of predictions for a true external set. This tool uses a composite scoring technique to identify query compounds as 'good' or 'moderate' or 'bad' predictions. We have also discussed a quantitative read-across tool which predicts a chemical response based on the similarity with structural analogues. The discussed tools are freely available from https://dtclab.webs.com/software-tools or http://teqip.jdvu.ac.in/QSAR_Tools/DTCLab/ and https://sites.google.com/jadavpuruniversity.in/dtc-lab-software/home (for read-across).

Collapse

Karthikeyan A, Priyakumar UD. Artificial intelligence: machine learning for chemical sciences. J CHEM SCI 2021;134:2. [PMID: 34955617 PMCID: PMC8691161 DOI: 10.1007/s12039-021-01995-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Revised: 09/08/2021] [Accepted: 09/14/2021] [Indexed: 12/05/2022]

Sleight TW, Sexton CN, Mpourmpakis G, Gilbertson LM, Ng CA. A Classification Model to Identify Direct-Acting Mutagenic Polycyclic Aromatic Hydrocarbon Transformation Products. Chem Res Toxicol 2021;34:2273-2286. [PMID: 34662518 DOI: 10.1021/acs.chemrestox.1c00187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Abstract

Polycyclic aromatic hydrocarbons (PAHs) are a complex group of environmental contaminants, many having long environmental half-lives. As these compounds degrade, the changes in their structure can result in a substantial increase in mutagenicity compared to the parent compound. Over time, each individual PAH can potentially degrade into several thousand unique transformation products, creating a complex, constantly evolving set of intermediates. Microbial degradation is the primary mechanism of their transformation and ultimate removal from the environment, and this process can result in mutagenic activation similar to the metabolic activation that can occur in multicellular organisms. The diversity of the potential intermediate structures in PAH-contaminated environments renders hazard assessment difficult for both remediation professionals and regulators. A mixture of structural and energetic descriptors has proven effective in existing studies for classifying which PAH transformation products will be mutagenic. However, most existing studies of environmental PAH mutagens primarily focus on nitrogenated derivatives, which are prevalent in the atmosphere and not as relevant in soil. Additionally, PAH products commonly found in the environment can range from as large as five rings to as small as a single ring, requiring a broadly inclusive methodology to comprehensively evaluate mutagenic potential. We developed a combination of supervised and unsupervised machine learning methods to predict environmentally induced PAH mutagenicity with improved performance over currently available tools. K-means clustering with principal component analysis allows us to identify molecular clusters that we hypothesize to have similar mechanisms of action. Recursive feature elimination identifies the most influential descriptors. The cluster-specific regression outperforms available classifiers in predicting direct-acting mutagens resulting from the microbial biodegradation of PAHs and provides direction for future studies evaluating the environmental hazards resulting from PAH biodegradation.

Collapse

Zhang XC, Wu CK, Yang ZJ, Wu ZX, Yi JC, Hsieh CY, Hou TJ, Cao DS. MG-BERT: leveraging unsupervised atomic representation learning for molecular property prediction. Brief Bioinform 2021;22:6265201. [PMID: 33951729 DOI: 10.1093/bib/bbab152] [Citation(s) in RCA: 48] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 03/11/2021] [Accepted: 04/01/2021] [Indexed: 11/12/2022] Open

Khan PM, Lombardo A, Benfenati E, Roy K. First report on chemometric modeling of hydrolysis half-lives of organic chemicals. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2021;28:1627-1642. [PMID: 32844343 DOI: 10.1007/s11356-020-10500-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Accepted: 08/12/2020] [Indexed: 06/11/2023]

Abstract

Hydrolysis is one of the most important processes of transformation of organic chemicals in water. The rates of reactions, final chemical entities of these processes, and half-lives of organic chemicals are of considerable interest to environmental chemists as well as authorities involved in the controlling the processing and disposal of such organic chemicals. In this study, we have proposed QSPR models for the prediction of hydrolysis half-life of organic chemicals as a function of different pH and temperature conditions using only two-dimensional molecular descriptors with definite physicochemical significance. For each model, suitable subsets of variables were elected using a genetic algorithm method; next, the elected subsets of variables were subjected to the best subset selection with a key objective to determine the best combination of descriptors for model generation. Finally, QSPR models were constructed using the best combination of variables employing the partial least squares (PLS) regression technique. Next, every final model was subjected for strict validation employing the internationally accepted internal and external validation parameters. The proposed models could be applicable for data gap filling to determine hydrolysis half-lives of organic chemicals at different environmental conditions. Generally, presence of aliphatic ether and ether functional groups, high percentage of oxygen content in the molecule and presence of O-Si pairs of atoms at topological distance one, results in a shorter hydrolysis half-life of organic chemicals. On the other hand, higher unsaturation content and high percentage of nitrogen content in molecules lead to higher hydrolysis half-life. It is also found that branched and compact molecules will have a lower half-life while straight chain analogues will have a higher half-life. To the best of our knowledge, the presented models are the first reported QSPR models for hydrolysis half-lives of organic chemicals at different pH values.

Collapse

Hu Y, Zhou G, Zhang C, Zhang M, Chen Q, Zheng L, Niu B. Identify Compounds' Target Against Alzheimer's Disease Based on In-Silico Approach. Curr Alzheimer Res 2020;16:193-208. [PMID: 30605059 DOI: 10.2174/1567205016666190103154855] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Revised: 12/20/2018] [Accepted: 01/03/2019] [Indexed: 11/22/2022]

Zhang Y, Han Z, Gao Q, Bai X, Zhang C, Hou H. Prediction of K562 Cells Functional Inhibitors Based on Machine Learning Approaches. Curr Pharm Des 2019;25:4296-4302. [PMID: 31696803 DOI: 10.2174/1381612825666191107092214] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Accepted: 11/04/2019] [Indexed: 12/14/2022]

Quantitative structure-property relationship modeling of polar analytes lacking UV chromophores to charged aerosol detector response. Anal Bioanal Chem 2019;411:2945-2959. [DOI: 10.1007/s00216-019-01744-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Revised: 02/26/2019] [Accepted: 03/01/2019] [Indexed: 11/27/2022]

Classification of thyroid hormone receptor agonists and antagonists using statistical learning approaches. Mol Divers 2018;23:85-92. [DOI: 10.1007/s11030-018-9857-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Accepted: 07/09/2018] [Indexed: 02/06/2023]

Quantitative structure –retention relationship modeling of selected antipsychotics and their impurities in green liquid chromatography using cyclodextrin mobile phases. Anal Bioanal Chem 2018;410:2533-2550. [DOI: 10.1007/s00216-018-0911-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2017] [Revised: 12/15/2017] [Accepted: 01/23/2018] [Indexed: 11/25/2022]

Sizochenko N, Gajewicz A, Leszczynski J, Puzyn T. Causation or only correlation? Application of causal inference graphs for evaluating causality in nano-QSAR models. NANOSCALE 2016;8:7203-8. [PMID: 26972917 DOI: 10.1039/c5nr08279j] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]

Glaab E. Building a virtual ligand screening pipeline using free software: a survey. Brief Bioinform 2016;17:352-66. [PMID: 26094053 PMCID: PMC4793892 DOI: 10.1093/bib/bbv037] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2015] [Revised: 05/20/2015] [Indexed: 12/17/2022] Open

Machine Learning Strategy for Accelerated Design of Polymer Dielectrics. Sci Rep 2016;6:20952. [PMID: 26876223 PMCID: PMC4753456 DOI: 10.1038/srep20952] [Citation(s) in RCA: 111] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2015] [Accepted: 01/13/2016] [Indexed: 01/28/2023] Open

Mamy L, Patureau D, Barriuso E, Bedos C, Bessac F, Louchart X, Martin-laurent F, Miege C, Benoit P. Prediction of the Fate of Organic Compounds in the Environment From Their Molecular Properties: A Review. CRITICAL REVIEWS IN ENVIRONMENTAL SCIENCE AND TECHNOLOGY 2015;45:1277-1377. [PMID: 25866458 PMCID: PMC4376206 DOI: 10.1080/10643389.2014.955627] [Citation(s) in RCA: 76] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]

Kumar SP, Jha PC, Jasrai YT, Pandya HA. The effect of various atomic partial charge schemes to elucidate consensus activity-correlating molecular regions: a test case of diverse QSAR models. J Biomol Struct Dyn 2015;34:540-59. [PMID: 25997097 DOI: 10.1080/07391102.2015.1044474] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]

Kandel DD, Raychaudhury C, Pal D. Two new atom centered fragment descriptors and scoring function enhance classification of antibacterial activity. J Mol Model 2014;20:2164. [PMID: 24664120 DOI: 10.1007/s00894-014-2164-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2013] [Accepted: 01/30/2014] [Indexed: 11/26/2022]

Duardo-Sánchez A, Munteanu CR, Riera-Fernández P, López-Díaz A, Pazos A, González-Díaz H. Modeling Complex Metabolic Reactions, Ecological Systems, and Financial and Legal Networks with MIANN Models Based on Markov-Wiener Node Descriptors. J Chem Inf Model 2013;54:16-29. [DOI: 10.1021/ci400280n] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

Tian D, Choi KP. Sharp bounds and normalization of Wiener-type indices. PLoS One 2013;8:e78448. [PMID: 24260118 PMCID: PMC3832646 DOI: 10.1371/journal.pone.0078448] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2013] [Accepted: 09/11/2013] [Indexed: 11/21/2022] Open

Haranczyk M, Urbaszek P, Ng EG, Puzyn T. Combinatorial × Computational × Cheminformatics (C3) Approach to Characterization of Congeneric Libraries of Organic Pollutants. J Chem Inf Model 2012;52:2902-9. [DOI: 10.1021/ci300289b] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]

García GC, Palacios-Bejarano B, Ruiz IL, Gómez-Nieto MÁ. Comparison of representational spaces based on structural information in the development of QSAR models for benzylamino enaminone derivatives. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2012;23:751-774. [PMID: 22988909 DOI: 10.1080/1062936x.2012.719543] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]

Natarajan R. New topological indices with very high discriminatory power. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2011;22:1-20. [PMID: 21391138 DOI: 10.1080/1062936x.2010.528611] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]