Zhu T, Li S, Li L, Tao C. A new perspective on predicting the reaction rate constants of hydrated electrons for organic contaminants: Exploring molecular structure characterization methods and ambient conditions.
THE SCIENCE OF THE TOTAL ENVIRONMENT 2023;
904:166316. [PMID:
37591396 DOI:
10.1016/j.scitotenv.2023.166316]
[Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 07/26/2023] [Accepted: 08/12/2023] [Indexed: 08/19/2023]
Abstract
Hydrated electrons (eaq-) exhibit rapid degradation of diverse persistent organic contaminants (OCs) and hold great promise as a formidable reducing agent in water treatment. However, the diverse structures of compounds exert different influences on the second-order rate constant of hydrated electron reactions (keaq-), while the same OCs demonstrate notable discrepancies in keaq- values across different pH levels. This study aims to develop machine learning (ML) models that can effectively simulate the intricate reaction kinetics between eaq- and OCs. Furthermore, the introduction of the pH variable enables a comprehensive investigation into the impact of ambient conditions on this process, thereby improving the practicality of the model. A dataset encompassing 701 keaq- values derived from 351 peer-reviewed publications was compiled. To comprehensively investigate compound properties, this study introduced molecular descriptor (MD), molecular fingerprint (MF), and the integration of both (MD + MF) as model variables. Furthermore, 60 sets of predictive models were established utilizing two variable screening methodologies (MLR and RF) and ten prominent algorithms. Through statistical parameter analysis, it was determined that descriptors combined with MD and MF, the RF screening method, and the symbolism algorithm exhibited the best predictive efficacy. Importantly, the combination of descriptor models exhibited significantly superior performance compared to individual MF and MD models. Notably, the optimal model, denoted as RF - (MF + MD) - LGB, exhibited highly satisfactory predictive results (R2tra = 0.967, Q2tra = 0.840, R2ext = 0.761). The mechanistic explanation study based on Shapley Additive Explanations (SHAP) values further elucidated the crucial influences of polarity, pH, molecular weight, electronegativity, carbon-carbon double bonds, and molecular topology on the degradation of OCs by eaq-. The proposed modeling approach, particularly the integration of MF and MD, alongside the introduction of pH, may furnish innovative ideas for advanced reduction or oxidation processes (ARPs/AOPs) and machine learning applications in other domains.
Collapse