1
|
Wang C, Wang L, Yu H, Seo A, Wang Z, Rajabzadeh S, Ni BJ, Shon HK. Machine learning for layer-by-layer nanofiltration membrane performance prediction and polymer candidate exploration. CHEMOSPHERE 2024; 350:140999. [PMID: 38151066 DOI: 10.1016/j.chemosphere.2023.140999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Revised: 12/18/2023] [Accepted: 12/19/2023] [Indexed: 12/29/2023]
Abstract
In this study, machine learning-based models were established for layer-by-layer (LBL) nanofiltration (NF) membrane performance prediction and polymer candidate exploration. Four different models, i.e., linear, random forest (RF), boosted tree (BT), and eXtreme Gradient Boosting (XGBoost), were formed, and membrane performance prediction was determined in terms of membrane permeability and selectivity. The XGBoost exhibited optimal prediction accuracy for membrane permeability (coefficient of determination (R2): 0.99) and membrane selectivity (R2: 0.80). The Shapley Additive exPlanation (SHAP) method was utilized to evaluate the effects of different LBL NF membrane fabrication conditions on membrane performances. The SHAP method was also used to identify the relationships between polymer structure and membrane performance. Polymers were represented by Morgan fingerprint, which is an effective description approach for developing modeling. Based on the SHAP value results, two reference Morgan fingerprints were constructed containing atomic groups with positive contributions to membrane permeability and selectivity. According to the reference Morgan fingerprint, 204 potential polymers were explored from the largest polymer database (PoLyInfo). By calculating the similarities between each potential polymer and both reference Morgan fingerprints, 23 polymer candidates were selected and could be further used for LBL NF membrane fabrication with the potential for providing good membrane performance. Overall, this work provided new ways both for LBL NF membrane performance prediction and high-performance polymer candidate exploration. The source code for the models and algorithms used in this study is publicly available to facilitate replication and further research. https://github.com/wangliwfsd/LLNMPP/.
Collapse
Affiliation(s)
- Chen Wang
- School of Civil and Environmental Engineering, University of Technology Sydney, Sydney, New South Wales, 2007, Australia
| | - Li Wang
- CSIRO Space and Astronomy, PO Box 1130, Bentley, WA, 6102, Australia
| | - Hanwei Yu
- School of Civil and Environmental Engineering, University of Technology Sydney, Sydney, New South Wales, 2007, Australia
| | - Allan Seo
- School of Civil and Environmental Engineering, University of Technology Sydney, Sydney, New South Wales, 2007, Australia
| | - Zhining Wang
- Shandong Provincial Key Laboratory of Water Pollution Control and Resource Reuse, School of Environmental Science and Engineering, Shandong University, Qingdao, 266237, China
| | - Saeid Rajabzadeh
- School of Civil and Environmental Engineering, University of Technology Sydney, Sydney, New South Wales, 2007, Australia
| | - Bing-Jie Ni
- School of Civil and Environmental Engineering, University of New South Wales, Sydney, New South Wales, 2052, Australia
| | - Ho Kyong Shon
- School of Civil and Environmental Engineering, University of Technology Sydney, Sydney, New South Wales, 2007, Australia.
| |
Collapse
|