1
|
Yang Y, Yang Z, Pang X, Cao H, Sun Y, Wang L, Zhou Z, Wang P, Liang Y, Wang Y. Molecular designing of potential environmentally friendly PFAS based on deep learning and generative models. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 953:176095. [PMID: 39245376 DOI: 10.1016/j.scitotenv.2024.176095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Revised: 09/03/2024] [Accepted: 09/04/2024] [Indexed: 09/10/2024]
Abstract
Perfluoroalkyl and polyfluoroalkyl substances (PFAS) are widely used across a spectrum of industrial and consumer goods. Nonetheless, their persistent nature and tendency to accumulate in biological systems pose substantial environmental and health threats. Consequently, striking a balance between maximizing product efficiency and minimizing environmental and health risks by tailoring the molecular structure of PFAS has become a pivotal challenge in the fields of environmental chemistry and sustainable development. To address this issue, a computational workflow was proposed for designing an environmentally friendly PFAS by incorporating deep learning (DL) and molecular generative models. The hybrid DL architecture MolHGT+ based on heterogeneous graph neural network with transformer-like attention was applied to predict the surface tension, bioaccumulation, and hepatotoxicity of the molecules. Through virtual screening of the PFAS master database using MolHGT+, the findings indicate that incorporating the siloxane group and betaine fragment can effectively decrease both the bioaccumulation and hepatotoxicity of PFAS while preserving low surface tension. In addition, molecular generative models were employed to create a structurally diverse pool of novel PFASs with the aforementioned hit molecules serving as the initial template structures. Overall, our study presents a promising AI-driven method for advancing the development of environmentally friendly PFAS.
Collapse
Affiliation(s)
- Ying Yang
- Hubei Key Laboratory of Environmental and Health Effects of Persistent Toxic Substances, School of Environment and Health, Jianghan University, Wuhan 430056, China
| | - Zeguo Yang
- Hubei Key Laboratory of Environmental and Health Effects of Persistent Toxic Substances, School of Environment and Health, Jianghan University, Wuhan 430056, China
| | - Xudi Pang
- Hubei Key Laboratory of Environmental and Health Effects of Persistent Toxic Substances, School of Environment and Health, Jianghan University, Wuhan 430056, China
| | - Huiming Cao
- Hubei Key Laboratory of Environmental and Health Effects of Persistent Toxic Substances, School of Environment and Health, Jianghan University, Wuhan 430056, China.
| | - Yuzhen Sun
- Hubei Key Laboratory of Environmental and Health Effects of Persistent Toxic Substances, School of Environment and Health, Jianghan University, Wuhan 430056, China
| | - Ling Wang
- Hubei Key Laboratory of Environmental and Health Effects of Persistent Toxic Substances, School of Environment and Health, Jianghan University, Wuhan 430056, China
| | - Zhen Zhou
- Hubei Key Laboratory of Environmental and Health Effects of Persistent Toxic Substances, School of Environment and Health, Jianghan University, Wuhan 430056, China
| | - Pu Wang
- Hubei Key Laboratory of Environmental and Health Effects of Persistent Toxic Substances, School of Environment and Health, Jianghan University, Wuhan 430056, China
| | - Yong Liang
- Hubei Key Laboratory of Environmental and Health Effects of Persistent Toxic Substances, School of Environment and Health, Jianghan University, Wuhan 430056, China.
| | - Yawei Wang
- Hubei Key Laboratory of Environmental and Health Effects of Persistent Toxic Substances, School of Environment and Health, Jianghan University, Wuhan 430056, China; State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China
| |
Collapse
|
2
|
Yang S, Ni J, Xu P. AI4ACEIP: A Computing Tool to Identify Food Peptides with High Inhibitory Activity for ACE by Merged Molecular Representation and Rich Intrinsic Sequence Information Based on an Ensemble Learning Strategy. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2024. [PMID: 39495772 DOI: 10.1021/acs.jafc.4c05650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/06/2024]
Abstract
Hypertension is a common chronic disorder and a major risk factor for cardiovascular diseases. Angiotensin-converting enzyme (ACE) converts angiotensin I to angiotensin II, causing vasoconstriction and raising blood pressure. Pharmacotherapy is the mainstay of traditional hypertension treatment, leading to various negative side effects. Some food-derived peptides can suppress ACE, named ACEIP with fewer undesirable effects. Therefore, it is crucial to seek strong dietary ACEIP to aid in hypertension treatment. In this article, we propose a new model called AI4ACEIP to identify ACEIP. AI4ACEIP uses a novel two-layer stacked ensemble architecture to predict ACEIP relying on integrated view features derived from sequence, large language models, and molecular-based information. The analysis of feature combinations reveals that four selected integrated feature pairs exhibit enhancing performance for identifying ACEIP. For finding meta models with strong abilities to learn information from integrated feature pairs, PowerShap, a feature selection method, is used to select 40 optimal feature and meta model combinations. Compared with seven state-of-the-art methods on the source and clear benchmark data sets, AI4ACEIP significantly outperformed by 8.47 to 20.65% and 5.49 to 14.42% for Matthew's correlation coefficient. In brief, AI4ACEIP is a reliable model for ACEIP prediction and is freely available at https://github.com/abcair/AI4ACEIP.
Collapse
Affiliation(s)
- Sen Yang
- School of Computer Science and Artificial Intelligence, Aliyun School of Big Data School of Software, Changzhou University, Changzhou 213164, China
- The Affiliated Changzhou No.2 People's Hospital of Nanjing Medical University, Changzhou 213164, China
| | - Jiaqi Ni
- School of Computer Science and Artificial Intelligence, Aliyun School of Big Data School of Software, Changzhou University, Changzhou 213164, China
| | - Piao Xu
- College of Economics and Management, Nanjing Forestry University, Nanjing 210037, China
| |
Collapse
|
3
|
Bao Z, Tom G, Cheng A, Watchorn J, Aspuru-Guzik A, Allen C. Towards the prediction of drug solubility in binary solvent mixtures at various temperatures using machine learning. J Cheminform 2024; 16:117. [PMID: 39468626 PMCID: PMC11520512 DOI: 10.1186/s13321-024-00911-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Accepted: 09/28/2024] [Indexed: 10/30/2024] Open
Abstract
Drug solubility is an important parameter in the drug development process, yet it is often tedious and challenging to measure, especially for expensive drugs or those available in small quantities. To alleviate these challenges, machine learning (ML) has been applied to predict drug solubility as an alternative approach. However, the majority of existing ML research has focused on the predictions of aqueous solubility and/or solubility at specific temperatures, which restricts the model applicability in pharmaceutical development. To bridge this gap, we compiled a dataset of 27,000 solubility datapoints, including solubility of small molecules measured in a range of binary solvent mixtures under various temperatures. Next, a panel of ML models were trained on this dataset with their hyperparameters tuned using Bayesian optimization. The resulting top-performing models, both gradient boosted decision trees (light gradient boosting machine and extreme gradient boosting), achieved mean absolute errors (MAE) of 0.33 for LogS (S in g/100 g) on the holdout set. These models were further validated through a prospective study, wherein the solubility of four drug molecules were predicted by the models and then validated with in-house solubility experiments. This prospective study demonstrated that the models accurately predicted the solubility of solutes in specific binary solvent mixtures under different temperatures, especially for drugs whose features closely align within the solutes in the dataset (MAE < 0.5 for LogS). To support future research and facilitate advancements in the field, we have made the dataset and code openly available. Scientific contribution Our research advances the state-of-the-art in predicting solubility for small molecules by leveraging ML and a uniquely comprehensive dataset. Unlike existing ML studies that predominantly focus on solubility in aqueous solvents at fixed temperatures, our work enables prediction of drug solubility in a variety of binary solvent mixtures over a broad temperature range, providing practical insights on the modeling of solubility for realistic pharmaceutical applications. These advancements along with the open access dataset and code support significant steps in the drug development process including new molecule discovery, drug analysis and formulation.
Collapse
Affiliation(s)
- Zeqing Bao
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON, M5S 3M2, Canada
| | - Gary Tom
- Department of Chemistry, University of Toronto, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, M5S 1M1, Canada
| | - Austin Cheng
- Department of Chemistry, University of Toronto, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, M5S 1M1, Canada
| | | | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, M5S 1M1, Canada
- Acceleration Consortium, Toronto, ON, M5S 3H6, Canada
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), Toronto, ON, M5S 1M1, Canada
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, ON, M5S 3E5, Canada
- Department of Materials Science and Engineering, University of Toronto, Toronto, ON, M5S 3E4, Canada
- CIFAR Artificial Intelligence Research Chair, Vector Institute, Toronto, ON, M5S 1M1, Canada
| | - Christine Allen
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON, M5S 3M2, Canada.
- Acceleration Consortium, Toronto, ON, M5S 3H6, Canada.
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, ON, M5S 3E5, Canada.
| |
Collapse
|
4
|
Liu R, Liu J, Zhou P. Theoretical advances in understanding and enhancing the thermostability of energetic materials. Phys Chem Chem Phys 2024; 26:26209-26221. [PMID: 39380550 DOI: 10.1039/d4cp02499k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/10/2024]
Abstract
The quest for thermally stable energetic materials is pivotal in advancing the safety of applications ranging from munitions to aerospace. This perspective delves into the role of theoretical methodologies in interpreting and advancing the thermal stability of energetic materials. Quantum chemical calculations offer an in-depth understanding of the molecular and electronic structure properties of energetic compounds related to thermal stability. It is also essential to incorporate the surrounding interactions and their impact on molecular stability. Ab initio molecular dynamics (AIMD) simulations provide detailed theoretical insights into the reaction pathways and the key intermediates during thermal decomposition in the condensed phase. Analyzing the kinetic barrier of rate-determining steps under various temperature and pressure conditions allows for a comprehensive assessment of thermal stability. Recent advances in machine learning have demonstrated their utility in constructing potential energy surfaces and predicting thermal stability for newly designed energetic materials. The machine learning-assisted high-throughput virtual screening (HTVS) methodology can accelerate the discovery of novel energetic materials with improved properties. As a result, the newly identified and synthesized energetic molecule ICM-104 revealed excellence in performance and thermostability. Theoretical approaches are pivotal in elucidating the mechanisms underlying thermal stability, enabling the prediction and design of enhanced thermal stability for emerging EMs. These insights are instrumental in accelerating the development of novel energetic materials that optimally balance performance and thermal stability.
Collapse
Affiliation(s)
- Runze Liu
- School of Science, Dalian Jiaotong University, Dalian 116028, P. R. China
- Institute of Frontier Chemistry, School of Chemistry and Chemical Engineering, Shandong University, Qingdao 266235, P. R. China.
| | - Jianyong Liu
- Research Center of Advanced Biological Manufacture, Dalian National Laboratory for Clean Energy, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, P. R. China.
| | - Panwang Zhou
- State Key Laboratory of Catalysis, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, P. R. China
- Institute of Frontier Chemistry, School of Chemistry and Chemical Engineering, Shandong University, Qingdao 266235, P. R. China.
| |
Collapse
|
5
|
Boiko DA, Arkhipova DM, Ananikov VP. Recognition of Molecular Structure of Phosphonium Salts from the Visual Appearance of Material with Deep Learning Can Reveal Subtle Homologs. SMALL (WEINHEIM AN DER BERGSTRASSE, GERMANY) 2024:e2403423. [PMID: 39254289 DOI: 10.1002/smll.202403423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Revised: 07/31/2024] [Indexed: 09/11/2024]
Abstract
Determining molecular structures is foundational in chemistry and biology. The notion of discerning molecular structures simply from the visual appearance of a material remained almost unthinkable until the advent of machine learning. This paper introduces a pioneering approach bridging the visual appearance of materials (both at the micro- and nanostructural levels) with traditional chemical structure analysis methods. Quaternary phosphonium salts are opted as the model compounds, given their significant roles in diverse chemical and medicinal fields and their ability to form homologs with only minute intermolecular variances. This research results in the successful creation of a neural network model capable of recognizing molecular structures from visual electron microscopy images of the material. The performance of the model is evaluated and related to the chemical nature of the studied chemicals. Additionally, unsupervised domain transfer is tested as a method to use the resulting model on optical microscopy images, as well as test models trained on optical images directly. The robustness of the method is further tested using a complex system of phosphonium salt mixtures. To the best of the authors' knowledge, this study offers the first evidence of the feasibility of discerning nearly indistinguishable molecular structures.
Collapse
Affiliation(s)
- Daniil A Boiko
- Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninsky Prospect, 47, Moscow, 119991, Russia
| | - Daria M Arkhipova
- Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninsky Prospect, 47, Moscow, 119991, Russia
| | - Valentine P Ananikov
- Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninsky Prospect, 47, Moscow, 119991, Russia
| |
Collapse
|
6
|
Pallikara I, Skelton JM, Hatcher LE, Pallipurath AR. Going beyond the Ordered Bulk: A Perspective on the Use of the Cambridge Structural Database for Predictive Materials Design. CRYSTAL GROWTH & DESIGN 2024; 24:6911-6930. [PMID: 39247224 PMCID: PMC11378158 DOI: 10.1021/acs.cgd.4c00694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Revised: 07/26/2024] [Accepted: 07/30/2024] [Indexed: 09/10/2024]
Abstract
When Olga Kennard founded the Cambridge Crystallographic Data Centre in 1965, the Cambridge Structural Database was a pioneering attempt to collect scientific data in a standard format. Since then, it has evolved into an indispensable resource in contemporary molecular materials science, with over 1.25 million structures and comprehensive software tools for searching, visualizing and analyzing the data. In this perspective, we discuss the use of the CSD and CCDC tools to address the multiscale challenge of predictive materials design. We provide an overview of the core capabilities of the CSD and CCDC software and demonstrate their application to a range of materials design problems with recent case studies drawn from topical research areas, focusing in particular on the use of data mining and machine learning techniques. We also identify several challenges that can be addressed with existing capabilities or through new capabilities with varying levels of development effort.
Collapse
Affiliation(s)
- Ioanna Pallikara
- School of Chemical and Process Engineering, University of Leeds, Leeds LS2 9JT, U.K
| | - Jonathan M Skelton
- Department of Chemistry, University of Manchester, Manchester M13 9PL, U.K
| | | | | |
Collapse
|
7
|
Niu X, Zhang Q, Dang Y, Hu W, Sun Y. MolPackL: Quantification and Interpretation of Intermolecular Interactions Driven by Molecular Packing. J Am Chem Soc 2024; 146:24075-24084. [PMID: 39141522 DOI: 10.1021/jacs.4c08132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/16/2024]
Abstract
In organic optoelectronic devices, the properties of the aggregated organic materials depend not only on individual molecules or monomers but also significantly on their packing modes. Different from their inorganic counterparts linked by explicit covalent bonds, organic solids exhibit intricate and numerous intermolecular interactions (IMIs). Due to the intrinsic complexity and disorder of IMIs, identifying and understanding them is a formidable challenge in experimental, theoretical, and data-driven approaches. In this work, we constructed an innovative algorithm framework, Molecular Packing Learning (MolPackL), which can accurately quantify elusive IMIs using contact density histograms (CDHs) and efficiently extract intermolecular features for further property prediction of organic solids. It performs satisfactorily in training predictive models of IMI-related properties in molecular crystals. Particularly, the band gap predictive model based on MolPackL achieved the best-reported performance, with an MAE of 0.20 eV and an impressive R2 of 0.92. Class activation mapping (CAM) visually demonstrates MolPackL's accurate identification of effective interaction sites as the molecular packing changes. What is more, the elemental importance analysis verified that the superior score benefits from MolPackL's ability to comprehensively consider multiple influencing factors of IMIs. In summary, MolPackL provides a new framework for quantitative assessment and understanding of the effect of IMIs. The development of MolPackL marks a significant advancement in establishing predictive models of molecular aggregates, deepening the comprehension of IMIs on the material properties. Given the superior performance, we believe that MolPackL will also become a powerful tool in the design of high-performance organic optoelectronic materials.
Collapse
Affiliation(s)
- Xinxin Niu
- Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science, Tianjin University, Tianjin 300072, P.R. China
| | - Qian Zhang
- Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science, Tianjin University, Tianjin 300072, P.R. China
| | - Yanfeng Dang
- Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science, Tianjin University, Tianjin 300072, P.R. China
| | - Wenping Hu
- Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science, Tianjin University, Tianjin 300072, P.R. China
- Joint School of National University of Singapore and Tianjin University, Fuzhou 350207, P.R. China
| | - Yajing Sun
- Tianjin Key Laboratory of Molecular Optoelectronic Sciences, Department of Chemistry, School of Science, Tianjin University, Tianjin 300072, P.R. China
| |
Collapse
|
8
|
Liu Y, Yang F, Zhang W, Xia H, Wu Z, Zhang Z. High precision deep-learning model combined with high-throughput screening to discover fused [5,5] biheterocyclic energetic materials with excellent comprehensive properties. RSC Adv 2024; 14:23672-23682. [PMID: 39077321 PMCID: PMC11284349 DOI: 10.1039/d4ra03233k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Accepted: 07/16/2024] [Indexed: 07/31/2024] Open
Abstract
Finding novel energetic materials with good comprehensive performance has always been challenging because of the low efficiency in conventional trial and error experimental procedure. In this paper, we established a deep learning model with high prediction accuracy using embedded features in Directed Message Passing Neural Networks. The model combined with high-throughput screening was shown to facilitate rapid discovery of fused [5,5] biheterocyclic energetic materials with high energy and excellent thermal stability. Density Functional Theory (DFT) calculations proved that the performances of the targeting molecules are consistent with the predicted results from the deep learning model. Furthermore, 6,7-trinitro-3H-pyrrolo[1,2-b][1,2,4]triazo-5-amine with both good detonation properties and thermal stability was screened out, whose crystal structure and intermolecular interactions were also analyzed.
Collapse
Affiliation(s)
- Youhai Liu
- School of Chemical Engineering and Technology, Xi'an Jiaotong University Xi'an 710049 China
| | - Fusheng Yang
- School of Chemical Engineering and Technology, Xi'an Jiaotong University Xi'an 710049 China
| | - Wenquan Zhang
- Research Center of Energetic Material Genome Science, Institute of Chemical Materials, China Academy of Engineering Physics (CAEP) Mianyang 621900 P. R. China
| | - Honglei Xia
- Research Center of Energetic Material Genome Science, Institute of Chemical Materials, China Academy of Engineering Physics (CAEP) Mianyang 621900 P. R. China
| | - Zhen Wu
- School of Chemical Engineering and Technology, Xi'an Jiaotong University Xi'an 710049 China
| | - Zaoxiao Zhang
- School of Chemical Engineering and Technology, Xi'an Jiaotong University Xi'an 710049 China
| |
Collapse
|
9
|
Luan Y, Li X, Kong D, Li W, Li W, Zhang Q, Pang A. Development and uniqueness test of highly selective atomic topological indices based on the number of attached hydrogen atoms. J Mol Graph Model 2024; 129:108752. [PMID: 38479237 DOI: 10.1016/j.jmgm.2024.108752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Accepted: 02/27/2024] [Indexed: 04/15/2024]
Abstract
On the basis of the atomic graph-theoretical index - aEAID (atomic Extended Adjacency matrix IDentification) and molecular adjacent topological index - ATID (Adjacent Topological IDentification) suggested by one of the authors (Zhang Q), a highly selective atomic topological index - aATID (atomic Adjacent Topological IDentification) index was suggested to identify the equivalent atoms in this study. The aATID index of an atom was derived from the number of the attached hydrogen atoms of the atom but omitting bond types. In this case, the suggested index can be used to identify equivalent atoms in chemistry but perhaps not equivalent in the molecular graph. To test the uniqueness of aATID indices, the virtual atomic data sets were derived from alkanes containing 15-20 carbon atoms and the isomers of Octogen, as well as a real data set was derived from the NCI database. Only four pairs of atoms from alkanes containing 20 carbons can't be discriminated by aATID, that is, four pairs of degenerates were found for this data set. To solve this problem, the aATID index was modified by introducing distance factors between atoms, and the 2-aATID index was suggested. Its uniqueness was examined by 5,939,902 atoms derived from alkanes containing 20 carbons and further 16,166,984 atoms from alkanes of 21 carbons, and no degenerates were found. In addition, another large real data set of 16,650,688 atoms derived from the PubChem database was also used to test the uniqueness of both aATID and 2-aATID. As a result, each atom was successfully discriminated by any of the two indices. Finally, the suggested aATID index was applied to the identification of duplicate atoms as data pretreatment for QSPR (Quantitative Structure-Property Relationships) studies.
Collapse
Affiliation(s)
- Yue Luan
- Henan Engineering Research Center of Industrial Circulating Water Treatment, Henan Joint International Research Laboratory of Environmental Pollution Control Materials, Henan University, Kaifeng, 475004, China
| | - Xianlan Li
- Henan Engineering Research Center of Industrial Circulating Water Treatment, Henan Joint International Research Laboratory of Environmental Pollution Control Materials, Henan University, Kaifeng, 475004, China
| | - Dingling Kong
- Henan Engineering Research Center of Industrial Circulating Water Treatment, Henan Joint International Research Laboratory of Environmental Pollution Control Materials, Henan University, Kaifeng, 475004, China
| | - Wanli Li
- Henan Engineering Research Center of Industrial Circulating Water Treatment, Henan Joint International Research Laboratory of Environmental Pollution Control Materials, Henan University, Kaifeng, 475004, China
| | - Wei Li
- Science and Technology on Aerospace Chemical Power Laboratory, Hubei Institute of Aerospace Chemotechnology, Xiangyang, 441003, Hubei, China
| | - Qingyou Zhang
- Henan Engineering Research Center of Industrial Circulating Water Treatment, Henan Joint International Research Laboratory of Environmental Pollution Control Materials, Henan University, Kaifeng, 475004, China.
| | - Aimin Pang
- Science and Technology on Aerospace Chemical Power Laboratory, Hubei Institute of Aerospace Chemotechnology, Xiangyang, 441003, Hubei, China.
| |
Collapse
|
10
|
King-Smith E. Transfer learning for a foundational chemistry model. Chem Sci 2024; 15:5143-5151. [PMID: 38577363 PMCID: PMC10988575 DOI: 10.1039/d3sc04928k] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Accepted: 11/15/2023] [Indexed: 04/06/2024] Open
Abstract
Data-driven chemistry has garnered much interest concurrent with improvements in hardware and the development of new machine learning models. However, obtaining sufficiently large, accurate datasets of a desired chemical outcome for data-driven chemistry remains a challenge. The community has made significant efforts to democratize and curate available information for more facile machine learning applications, but the limiting factor is usually the laborious nature of generating large-scale data. Transfer learning has been noted in certain applications to alleviate some of the data burden, but this protocol is typically carried out on a case-by-case basis, with the transfer learning task expertly chosen to fit the finetuning. Herein, I develop a machine learning framework capable of accurate chemistry-relevant prediction amid general sources of low data. First, a chemical "foundational model" is trained using a dataset of ∼1 million experimental organic crystal structures. A task specific module is then stacked atop this foundational model and subjected to finetuning. This approach achieves state-of-the-art performance on a diverse set of tasks: toxicity prediction, yield prediction, and odor prediction.
Collapse
|
11
|
Zhang D, Chu Q, Chen D. Predicting the enthalpy of formation of energetic molecules via conventional machine learning and GNN. Phys Chem Chem Phys 2024; 26:7029-7041. [PMID: 38345363 DOI: 10.1039/d3cp05490j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2024]
Abstract
Machine learning (ML) provides a promising method for efficiently and accurately predicting molecular properties. Using ML models to predict the enthalpy of formation of energetic molecules helps in fast screening of potential high-energy molecules, thereby accelerating the design of energetic materials. A persistent challenge is to determine the optimal featurization methods for molecular representation and use an appropriate ML model. Thus, in our study, we evaluate various featurization methods (CDS, ECFP, SOAP, GNF) and ML models (RF, MLP, GCN, MPNN), dividing them into two groups: conventional ML models and GNN models, to predict the enthalpy of formation of potential high-energy molecules. Our results demonstrate that CDS and SOAP have advantages over the ECFP, while the GNFs in GCN and MPNN models perform better. Furthermore, the MPNN model performs best among all models with a root mean square error (RMSE) as low as 8.42 kcal mol-1, surpassing even the best performing CDS-MLP model in conventional ML models. Overall, this study provides a benchmark for ML in predicting enthalpy of formation and emphasizes the tremendous potential of GNN in property prediction.
Collapse
Affiliation(s)
- Di Zhang
- State Key Laboratory of Explosion Science and Safety Protection, Beijing 100081, China.
| | - Qingzhao Chu
- State Key Laboratory of Explosion Science and Safety Protection, Beijing 100081, China.
| | - Dongping Chen
- State Key Laboratory of Explosion Science and Safety Protection, Beijing 100081, China.
| |
Collapse
|
12
|
Malashin IP, Tynchenko VS, Nelyub VA, Borodulin AS, Gantimurov AP. Estimation and Prediction of the Polymers' Physical Characteristics Using the Machine Learning Models. Polymers (Basel) 2023; 16:115. [PMID: 38201778 PMCID: PMC10780762 DOI: 10.3390/polym16010115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2023] [Revised: 12/23/2023] [Accepted: 12/27/2023] [Indexed: 01/12/2024] Open
Abstract
This article investigates the utility of machine learning (ML) methods for predicting and analyzing the diverse physical characteristics of polymers. Leveraging a rich dataset of polymers' characteristics, the study encompasses an extensive range of polymer properties, spanning compressive and tensile strength to thermal and electrical behaviors. Using various regression methods like Ensemble, Tree-based, Regularization, and Distance-based, the research undergoes thorough evaluation using the most common quality metrics. As a result of a series of experimental studies on the selection of effective model parameters, those that provide a high-quality solution to the stated problem were found. The best results were achieved by Random Forest with the highest R2 scores of 0.71, 0.73, and 0.88 for glass transition, thermal decomposition, and melting temperatures, respectively. The outcomes are intricately compared, providing valuable insights into the efficiency of distinct ML approaches in predicting polymer properties. Unknown values for each characteristic were predicted, and a method validation was performed by training on the predicted values, comparing the results with the specified variance values of each characteristic. The research not only advances our comprehension of polymer physics but also contributes to informed model selection and optimization for materials science applications.
Collapse
Affiliation(s)
- Ivan Pavlovich Malashin
- Artificial Intelligence Technology Scientific and Education Center, Bauman Moscow State Technical University, 105005 Moscow, Russia; (V.A.N.); (A.S.B.); (A.P.G.)
| | - Vadim Sergeevich Tynchenko
- Artificial Intelligence Technology Scientific and Education Center, Bauman Moscow State Technical University, 105005 Moscow, Russia; (V.A.N.); (A.S.B.); (A.P.G.)
- Information-Control Systems Department, Institute of Computer Science and Telecommunications, Reshetnev Siberian State University of Science and Technology, 660037 Krasnoyarsk, Russia
- Department of Technological Machines and Equipment of Oil and Gas Complex, School of Petroleum and Natural Gas Engineering, Siberian Federal University, 660041 Krasnoyarsk, Russia
| | - Vladimir Aleksandrovich Nelyub
- Artificial Intelligence Technology Scientific and Education Center, Bauman Moscow State Technical University, 105005 Moscow, Russia; (V.A.N.); (A.S.B.); (A.P.G.)
| | - Aleksei Sergeevich Borodulin
- Artificial Intelligence Technology Scientific and Education Center, Bauman Moscow State Technical University, 105005 Moscow, Russia; (V.A.N.); (A.S.B.); (A.P.G.)
| | - Andrei Pavlovich Gantimurov
- Artificial Intelligence Technology Scientific and Education Center, Bauman Moscow State Technical University, 105005 Moscow, Russia; (V.A.N.); (A.S.B.); (A.P.G.)
| |
Collapse
|
13
|
Li J, Wu N, Zhang J, Wu HH, Pan K, Wang Y, Liu G, Liu X, Yao Z, Zhang Q. Machine Learning-Assisted Low-Dimensional Electrocatalysts Design for Hydrogen Evolution Reaction. NANO-MICRO LETTERS 2023; 15:227. [PMID: 37831203 PMCID: PMC10575847 DOI: 10.1007/s40820-023-01192-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 08/10/2023] [Indexed: 10/14/2023]
Abstract
Efficient electrocatalysts are crucial for hydrogen generation from electrolyzing water. Nevertheless, the conventional "trial and error" method for producing advanced electrocatalysts is not only cost-ineffective but also time-consuming and labor-intensive. Fortunately, the advancement of machine learning brings new opportunities for electrocatalysts discovery and design. By analyzing experimental and theoretical data, machine learning can effectively predict their hydrogen evolution reaction (HER) performance. This review summarizes recent developments in machine learning for low-dimensional electrocatalysts, including zero-dimension nanoparticles and nanoclusters, one-dimensional nanotubes and nanowires, two-dimensional nanosheets, as well as other electrocatalysts. In particular, the effects of descriptors and algorithms on screening low-dimensional electrocatalysts and investigating their HER performance are highlighted. Finally, the future directions and perspectives for machine learning in electrocatalysis are discussed, emphasizing the potential for machine learning to accelerate electrocatalyst discovery, optimize their performance, and provide new insights into electrocatalytic mechanisms. Overall, this work offers an in-depth understanding of the current state of machine learning in electrocatalysis and its potential for future research.
Collapse
Affiliation(s)
- Jin Li
- College of Chemistry and Chemical Engineering, and Henan Key Laboratory of Function-Oriented Porous Materials, Luoyang Normal University, Luoyang, 471934, People's Republic of China
| | - Naiteng Wu
- College of Chemistry and Chemical Engineering, and Henan Key Laboratory of Function-Oriented Porous Materials, Luoyang Normal University, Luoyang, 471934, People's Republic of China
| | - Jian Zhang
- New Energy Technology Engineering Lab of Jiangsu Province, College of Science, Nanjing University of Posts and Telecommunications (NUPT), Nanjing, 210023, People's Republic of China
| | - Hong-Hui Wu
- School of Materials Science and Engineering, University of Science and Technology Beijing, Beijing, 100083, People's Republic of China.
- Department of Chemistry, University of Nebraska-Lincoln, Lincoln, NE, 8588, USA.
| | - Kunming Pan
- Henan Key Laboratory of High-Temperature Structural and Functional Materials, National Joint Engineering Research Center for Abrasion Control and Molding of Metal Materials, Henan University of Science and Technology, Luoyang, 471003, People's Republic of China
| | - Yingxue Wang
- National Engineering Laboratory for Risk Perception and Prevention, Beijing, 100041, People's Republic of China.
| | - Guilong Liu
- College of Chemistry and Chemical Engineering, and Henan Key Laboratory of Function-Oriented Porous Materials, Luoyang Normal University, Luoyang, 471934, People's Republic of China
| | - Xianming Liu
- College of Chemistry and Chemical Engineering, and Henan Key Laboratory of Function-Oriented Porous Materials, Luoyang Normal University, Luoyang, 471934, People's Republic of China.
| | - Zhenpeng Yao
- Center of Hydrogen Science, Shanghai Jiao Tong University, Shanghai, 200000, People's Republic of China
- State Key Laboratory of Metal Matrix Composites, School of Materials Science and Engineering, Shanghai Jiao Tong University, Shanghai, 200000, People's Republic of China
| | - Qiaobao Zhang
- State Key Laboratory of Physical Chemistry of Solid Surfaces, College of Materials, Xiamen University, Xiamen, 361005, People's Republic of China.
| |
Collapse
|
14
|
Jin JX, Ren GP, Hu J, Liu Y, Gao Y, Wu KJ, He Y. Force field-inspired transformer network assisted crystal density prediction for energetic materials. J Cheminform 2023; 15:65. [PMID: 37468954 DOI: 10.1186/s13321-023-00736-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Accepted: 07/12/2023] [Indexed: 07/21/2023] Open
Abstract
Machine learning has great potential in predicting chemical information with greater precision than traditional methods. Graph neural networks (GNNs) have become increasingly popular in recent years, as they can automatically learn the features of the molecule from the graph, significantly reducing the time needed to find and build molecular descriptors. However, the application of machine learning to energetic materials property prediction is still in the initial stage due to insufficient data. In this work, we first curated a dataset of 12,072 compounds containing CHON elements, which are traditionally regarded as main composition elements of energetic materials, from the Cambridge Structural Database, then we implemented a refinement to our force field-inspired neural network (FFiNet), through the adoption of a Transformer encoder, resulting in force field-inspired Transformer network (FFiTrNet). After the improvement, our model outperforms other machine learning-based and GNNs-based models and shows its powerful predictive capabilities especially for high-density materials. Our model also shows its capability in predicting the crystal density of potential energetic materials dataset (i.e. Huang & Massa dataset), which will be helpful in practical high-throughput screening of energetic materials.
Collapse
Affiliation(s)
- Jun-Xuan Jin
- Zhejiang Provincial Key Laboratory of Advanced Chemical Engineering Manufacture Technology, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, China
- Institute of Zhejiang University-Quzhou, Quzhou, 324000, China
| | - Gao-Peng Ren
- Zhejiang Provincial Key Laboratory of Advanced Chemical Engineering Manufacture Technology, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, China
- Institute of Zhejiang University-Quzhou, Quzhou, 324000, China
| | - Jianjian Hu
- Xi'an Modern Chemistry Research Institute, Xi'an, 710065, China
| | - Yingzhe Liu
- Xi'an Modern Chemistry Research Institute, Xi'an, 710065, China
| | - Yunhu Gao
- Department of Engineering, University of Cambridge, Cambridge, CB2 1PZ, UK
| | - Ke-Jun Wu
- Zhejiang Provincial Key Laboratory of Advanced Chemical Engineering Manufacture Technology, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, China.
- Institute of Zhejiang University-Quzhou, Quzhou, 324000, China.
| | - Yuchen He
- State Key Laboratory of Industrial Control Technology, College of Control Science and Engineering, Zhejiang University, Hangzhou, 310027, China.
| |
Collapse
|
15
|
Shirokii N, Din Y, Petrov I, Seregin Y, Sirotenko S, Razlivina J, Serov N, Vinogradov V. Quantitative Prediction of Inorganic Nanomaterial Cellular Toxicity via Machine Learning. SMALL (WEINHEIM AN DER BERGSTRASSE, GERMANY) 2023; 19:e2207106. [PMID: 36772908 DOI: 10.1002/smll.202207106] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 01/09/2023] [Indexed: 05/11/2023]
Abstract
Organic chemistry has seen colossal progress due to machine learning (ML). However, the translation of artificial intelligence (AI) into materials science is challenging, where biological behavior prediction becomes even more complicated. Nanotoxicity is a critical parameter that describes their interaction with the living organisms screened in every bio-related research. To prevent excessive experiments, such properties have to be pre-evaluated. Several existing ML models partially fulfill the gap by predicting whether a nanomaterial is toxic or not. Yet, this binary categorization neglects the concentration dependencies crucial for experimental scientists. Here, an ML-based approach is proposed to the quantitative prediction of inorganic nanomaterial cytotoxicity achieving the precision expressed by 10-fold cross-validation (CV) Q2 = 0.86 with the root mean squared error (RMSE) of 12.2% obtained by the correlation-based feature selection and grid search-based model hyperparameters optimization. To provide further model flexibility, quantitative atom property-based nanomaterial descriptors are introduced allowing the model to extrapolate on unseen samples. Feature importance is calculated to find an interpretable model with optimal decision-making. These findings allow experimental scientists to perform primary in silico candidate screening and minimize the number of excessive, labor-intensive experiments enabling the rapid development of nanomaterials for medicinal purposes.
Collapse
Affiliation(s)
- Nikolai Shirokii
- International Institute "Solution Chemistry of Advanced Materials and Technologies", ITMO University, 191002, Saint-Petersburg, Russian Federation
| | - Yevgeniya Din
- International Institute "Solution Chemistry of Advanced Materials and Technologies", ITMO University, 191002, Saint-Petersburg, Russian Federation
| | - Ilya Petrov
- International Institute "Solution Chemistry of Advanced Materials and Technologies", ITMO University, 191002, Saint-Petersburg, Russian Federation
| | - Yurii Seregin
- International Institute "Solution Chemistry of Advanced Materials and Technologies", ITMO University, 191002, Saint-Petersburg, Russian Federation
| | - Sofia Sirotenko
- International Institute "Solution Chemistry of Advanced Materials and Technologies", ITMO University, 191002, Saint-Petersburg, Russian Federation
| | - Julia Razlivina
- International Institute "Solution Chemistry of Advanced Materials and Technologies", ITMO University, 191002, Saint-Petersburg, Russian Federation
| | - Nikita Serov
- Advanced Engineering School, Almetyevsk State Oil Institute, Almetyevsk, Russia
| | - Vladimir Vinogradov
- International Institute "Solution Chemistry of Advanced Materials and Technologies", ITMO University, 191002, Saint-Petersburg, Russian Federation
| |
Collapse
|
16
|
Zang X, Zhou X, Bian H, Jin W, Pan X, Jiang J, Koroleva MY, Shen R. Prediction and Construction of Energetic Materials Based on Machine Learning Methods. Molecules 2022; 28:322. [PMID: 36615516 PMCID: PMC9821915 DOI: 10.3390/molecules28010322] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 12/18/2022] [Accepted: 12/28/2022] [Indexed: 01/03/2023] Open
Abstract
Energetic materials (EMs) are the core materials of weapons and equipment. Achieving precise molecular design and efficient green synthesis of EMs has long been one of the primary concerns of researchers around the world. Traditionally, advanced materials were discovered through a trial-and-error processes, which required long research and development (R&D) cycles and high costs. In recent years, the machine learning (ML) method has matured into a tool that compliments and aids experimental studies for predicting and designing advanced EMs. This paper reviews the critical process of ML methods to discover and predict EMs, including data preparation, feature extraction, model construction, and model performance evaluation. The main ideas and basic steps of applying ML methods are analyzed and outlined. The state-of-the-art research about ML applications in property prediction and inverse material design of EMs is further summarized. Finally, the existing challenges and the strategies for coping with challenges in the further applications of the ML methods are proposed.
Collapse
Affiliation(s)
- Xiaowei Zang
- College of Safety Science and Engineering, Nanjing Tech University, Nanjing 211816, China
| | - Xiang Zhou
- School of Chemistry and Chemical Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
| | - Haitao Bian
- College of Safety Science and Engineering, Nanjing Tech University, Nanjing 211816, China
| | - Weiping Jin
- Jiangxi Xinyu Guoke Technology Co., Ltd., Xinyu 338018, China
| | - Xuhai Pan
- College of Safety Science and Engineering, Nanjing Tech University, Nanjing 211816, China
| | - Juncheng Jiang
- College of Safety Science and Engineering, Nanjing Tech University, Nanjing 211816, China
| | - M. Yu. Koroleva
- Institute of Modern Energetics and Nanomaterials, D. Mendeleev University of Chemical Technology of Russia, Moscow 125047, Russia
| | - Ruiqi Shen
- School of Chemistry and Chemical Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
- Micro-Nano Energetic Devices Key Laboratory of MIIT, Nanjing 210094, China
- Institute of Space Propulsion, Nanjing University of Science and Technology, Nanjing 210094, China
| |
Collapse
|
17
|
Lansford JL, Barnes BC, Rice BM, Jensen KF. Building Chemical Property Models for Energetic Materials from Small Datasets Using a Transfer Learning Approach. J Chem Inf Model 2022; 62:5397-5410. [PMID: 36240441 DOI: 10.1021/acs.jcim.2c00841] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
For many experimentally measured chemical properties that cannot be directly computed from first-principles, the existing physics-based models do not extrapolate well to out-of-sample molecules, and experimental datasets themselves are too small for traditional machine learning (ML) approaches. To overcome these limitations, we apply a transfer learning approach, whereby we simultaneously train a multi-target regression model on a small number of molecules with experimentally measured values and a large number of molecules with related computed properties. We demonstrate this methodology on predicting the experimentally measured impact sensitivity of energetic crystals, finding that both characteristics of the computed dataset and model architecture are important to prediction accuracy of the small experimental dataset. Our directed-message passing neural network (D-MPNN) ML model using transfer learning outperforms direct-ML and physics-based models on a diverse test set, and the new methods described here are widely applicable to modeling many other structure-property relationships.
Collapse
Affiliation(s)
- Joshua L Lansford
- U.S. Army Combat Capabilities Development Command (DEVCOM) Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, United States.,Department of Chemical Engineering, MIT, Cambridge, Massachusetts 02139, United States
| | - Brian C Barnes
- U.S. Army Combat Capabilities Development Command (DEVCOM) Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, United States
| | - Betsy M Rice
- U.S. Army Combat Capabilities Development Command (DEVCOM) Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, United States
| | - Klavs F Jensen
- Department of Chemical Engineering, MIT, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
18
|
Antoniuk ER, Li P, Kailkhura B, Hiszpanski AM. Representing Polymers as Periodic Graphs with Learned Descriptors for Accurate Polymer Property Predictions. J Chem Inf Model 2022; 62:5435-5445. [PMID: 36315033 DOI: 10.1021/acs.jcim.2c00875] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Accurately predicting new polymers' properties with machine learning models apriori to synthesis has potential to significantly accelerate new polymers' discovery and development. However, accurately and efficiently capturing polymers' complex, periodic structures in machine learning models remains a grand challenge for the polymer cheminformatics community. Specifically, there has yet to be an ideal solution for the problems of how to capture the periodicity of polymers, as well as how to optimally develop polymer descriptors without requiring human-based feature design. In this work, we tackle these problems by utilizing a periodic polymer graph representation that accounts for polymers' periodicity and coupling it with a message-passing neural network that leverages the power of graph deep learning to automatically learn chemically relevant polymer descriptors. Remarkably, this approach achieves state-of-the-art performance on 8 out of 10 distinct polymer property prediction tasks. These results highlight the advancement in predictive capability that is possible through learning descriptors that are specifically optimized for capturing the unique chemical structure of polymers.
Collapse
Affiliation(s)
- Evan R Antoniuk
- Materials Science Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, California94550-5507, United States
| | - Peggy Li
- Global Security Computing Applications Division, Computing Directorate, Lawrence Livermore National Laboratory, Livermore, California94550-5507, United States
| | - Bhavya Kailkhura
- Machine Intelligence Group/Center for Applied Scientific Computing, Computing Directorate, Lawrence Livermore National Laboratory, Livermore, California94550-5507, United States
| | - Anna M Hiszpanski
- Materials Science Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, California94550-5507, United States
| |
Collapse
|
19
|
Zhang Z, Cheng M, Xiao X, Bi K, Song T, Hu KQ, Dai Y, Zhou L, Liu C, Ji X, Shi WQ. Machine-Learning-Guided Identification of Coordination Polymer Ligands for Crystallizing Separation of Cs/Sr. ACS APPLIED MATERIALS & INTERFACES 2022; 14:33076-33084. [PMID: 35801670 DOI: 10.1021/acsami.2c05272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Separation of Cs/Sr is one of many coordination-chemistry-centered processes in the grand scheme of spent nuclear fuel reprocessing, a critical link for a sustainable nuclear energy industry. To deploy a crystallizing Cs/Sr separation technology, we planned to systematically screen and identify candidate ligands that can efficiently and selectively bind to Sr2+ and form coordination polymers. Therefore, we mined the Cambridge Structural Database for characteristic structural information and developed a machine-learning-guided methodology for ligand evaluation. The optimized machine-learning model, correlating the molecular structures of the ligands with the predicted coordinative properties, generated a ranking list of potential compounds for Cs/Sr selective crystallization. The Sr2+ sequestration capability and selectivity over Cs+ of the promising ligands identified (squaric acid and chloranilic acid) were subsequently confirmed experimentally, with commendable performances, corroborating the artificial-intelligence-guided strategy.
Collapse
Affiliation(s)
- Zhiyuan Zhang
- School of Chemical Engineering, Sichuan University, Chengdu 610065, People's Republic of China
| | - Min Cheng
- School of Chemical Engineering, Sichuan University, Chengdu 610065, People's Republic of China
| | - Xinyi Xiao
- School of Chemical Engineering, Sichuan University, Chengdu 610065, People's Republic of China
| | - Kexin Bi
- School of Chemical Engineering, Sichuan University, Chengdu 610065, People's Republic of China
| | - Ting Song
- School of Chemical Engineering, Sichuan University, Chengdu 610065, People's Republic of China
| | - Kong-Qiu Hu
- Laboratory of Nuclear Energy Chemistry, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, People's Republic of China
| | - Yiyang Dai
- School of Chemical Engineering, Sichuan University, Chengdu 610065, People's Republic of China
| | - Li Zhou
- School of Chemical Engineering, Sichuan University, Chengdu 610065, People's Republic of China
| | - Chong Liu
- School of Chemical Engineering, Sichuan University, Chengdu 610065, People's Republic of China
| | - Xu Ji
- School of Chemical Engineering, Sichuan University, Chengdu 610065, People's Republic of China
| | - Wei-Qun Shi
- Laboratory of Nuclear Energy Chemistry, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, People's Republic of China
| |
Collapse
|
20
|
Bürgi HB. Crystal structures. ACTA CRYSTALLOGRAPHICA SECTION B, STRUCTURAL SCIENCE, CRYSTAL ENGINEERING AND MATERIALS 2022; 78:283-289. [PMID: 35695099 DOI: 10.1107/s205252062200292x] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 03/16/2022] [Indexed: 06/15/2023]
Abstract
A personal view is offered on various solved and open problems related to crystal structures: the present state of reconstructing the crystal electron density from X-ray diffraction data; characterization of atomic and molecular motion from a combination of atomic displacement parameters and quantum chemical calculations; Bragg diffraction and diffuse scattering: twins, but different; models of real (as opposed to ideal) crystal structures from diffuse scattering; exploiting unexplored neighbourhoods of crystallography to mathematics, physics and chemistry.
Collapse
Affiliation(s)
- Hans Beat Bürgi
- Department of Chemistry, Biochemistry and Pharmacy, University of Berne, Freiestrasse 12, Bern, CH-3012, Switzerland
| |
Collapse
|
21
|
Wang R, Liu J, He X, Xie W, Zhang C. Decoding hexanitrobenzene (HNB) and 1,3,5-triamino-2,4,6-trinitrobenzene (TATB) as two distinctive energetic nitrobenzene compounds by machine learning. Phys Chem Chem Phys 2022; 24:9875-9884. [PMID: 35415730 DOI: 10.1039/d2cp00439a] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Energetic materials (EMs) are a group of special energy materials, and it is generally full of safety risks and generally costs much to create new EMs. Thus, machine learning (ML)-aided discovery becomes highly desired for EMs, as ML is good at risk and cost reduction. This work decodes hexanitrobenzene (HNB) and 1,3,5-triamino-2,4,6-trinitrobenzene (TATB) as two distinctive energetic nitrobenzene compounds by ML, in combination with theoretical calculations. Based on a series of highly accurate models of density, heat of formation, bond dissociation energy and molecular flatness, the ML predictions show that HNB is the most energetic among ∼370 000 000 single benzene ring-containing compounds, while TATB possesses a moderate energy content and very high safety, as determined experimentally. This work exhibits the significant power of ML and presents an instructive procedure for using it in the field of EMs. The ML-aided design and highly efficient synthesis and fabrication combined strategy is expected to accelerate the discovery of new EMs.
Collapse
Affiliation(s)
- Rong Wang
- Institute of Chemical Materials, China Academy of Engineering Physics (CAEP), PO Box 919-311, Mianyang, Sichuan 621900, China.
| | - Jian Liu
- Institute of Chemical Materials, China Academy of Engineering Physics (CAEP), PO Box 919-311, Mianyang, Sichuan 621900, China.
| | - Xudong He
- Institute of Chemical Materials, China Academy of Engineering Physics (CAEP), PO Box 919-311, Mianyang, Sichuan 621900, China.
| | - Weiyu Xie
- Institute of Chemical Materials, China Academy of Engineering Physics (CAEP), PO Box 919-311, Mianyang, Sichuan 621900, China.
| | - Chaoyang Zhang
- Institute of Chemical Materials, China Academy of Engineering Physics (CAEP), PO Box 919-311, Mianyang, Sichuan 621900, China. .,Beijing Computational Science Research Center, Beijing 100048, China.
| |
Collapse
|