1
|
Yang H, Chen R, Dai L, Ren B, Yang F, Xu YJ, Li Q. Construction of a reaction-based fluorescent sensor for tandem detection of Cu 2+ and glutathione in wine. Food Chem 2025; 464:141632. [PMID: 39423546 DOI: 10.1016/j.foodchem.2024.141632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Revised: 10/06/2024] [Accepted: 10/11/2024] [Indexed: 10/21/2024]
Abstract
The purpose of this study was to develop a novel reaction-based fluorescent sensor for the detection of Cu2+ and glutathione in real wine samples. The sensor, tris-(2-pyridyl)-methylamine rhodol derivative, was synthesized and validated for the tandem and selective detection of both Cu2+ and glutathione. The sensor exhibited a strong linear correlation between fluorescence intensity and Cu2+ concentration ranging from 100 to 900 nM, while the in situ generated Cu2+ ensemble selectively detected glutathione with a robust linear response from 3 to 30 μM. The detection limits for Cu2+ and glutathione were as low as 28 nM and 0.60 μM, respectively. Additionally, the sensor enabled quantitative detection of Cu2+ and glutathione in real wine samples. This work provides the first reaction-based fluorescence sensor with an "on-off-on" fluorescence response for the tandem detection of Cu2+ and glutathione in wine, offering potential applications in food and beverage quality control.
Collapse
Affiliation(s)
- Han Yang
- College of Chemistry and Material Science, Sichuan Normal University, Chengdu, Sichuan 610066, China
| | - Renqiang Chen
- College of Chemistry and Material Science, Sichuan Normal University, Chengdu, Sichuan 610066, China
| | - Linjun Dai
- College of Chemistry and Material Science, Sichuan Normal University, Chengdu, Sichuan 610066, China
| | - Boquan Ren
- College of Chemistry and Material Science, Sichuan Normal University, Chengdu, Sichuan 610066, China
| | - Feng Yang
- College of Chemistry and Material Science, Sichuan Normal University, Chengdu, Sichuan 610066, China
| | - Yan-Jun Xu
- College of Chemistry and Material Science, Sichuan Normal University, Chengdu, Sichuan 610066, China
| | - Qing Li
- College of Chemistry and Material Science, Sichuan Normal University, Chengdu, Sichuan 610066, China.
| |
Collapse
|
2
|
Chu DBK, González-Narváez DA, Meyer R, Nandy A, Kulik HJ. Ligand Many-Body Expansion as a General Approach for Accelerating Transition Metal Complex Discovery. J Chem Inf Model 2024; 64:9397-9412. [PMID: 39606954 DOI: 10.1021/acs.jcim.4c01728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]
Abstract
Methods that accelerate the evaluation of molecular properties are essential for chemical discovery. While some degree of ligand additivity has been established for transition metal complexes, it is underutilized in asymmetric complexes, such as the square pyramidal coordination geometries highly relevant to catalysis. To develop predictive methods beyond simple additivity, we apply a many-body expansion to octahedral and square pyramidal complexes and introduce a correction based on adjacent ligands (i.e., the cis interaction model). We first test the cis interaction model on adiabatic spin-splitting energies of octahedral Fe(II) complexes, predicting DFT-calculated values of unseen binary complexes to within an average error of 1.4 kcal/mol. Uncertainty analysis reveals the optimal basis, comprising the homoleptic and mer symmetric complexes. We next show that the cis model (i.e., the cis interaction model solved for the optimal basis) infers both DFT- and CCSD(T)-calculated model catalytic reaction energies to within 1 kcal/mol on average. The cis model predicts low-symmetry complexes with reaction energies outside the range of binary complex reaction energies. We observe that trans interactions are unnecessary for most monodentate systems but can be important for some combinations of ligands, such as complexes containing a mixture of bidentate and monodentate ligands. Finally, we demonstrate that the cis model may be combined with Δ-learning to predict CCSD(T) reaction energies from exhaustively calculated DFT reaction energies and the same fraction of CCSD(T) reaction energies needed for the cis model, achieving around 30% of the error from using the CCSD(T) reaction energies in the cis model alone.
Collapse
Affiliation(s)
- Daniel B K Chu
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - David A González-Narváez
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Ralf Meyer
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Aditya Nandy
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Heather J Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
3
|
Shakiba M, Philips AB, Autschbach J, Akimov AV. Machine Learning Mapping Approach for Computing Spin Relaxation Dynamics. J Phys Chem Lett 2024:153-162. [PMID: 39707977 DOI: 10.1021/acs.jpclett.4c03293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2024]
Abstract
In this work, a machine learning mapping approach for predicting the properties of atomistic systems is reported. Within this approach, the atomic orbital overlap, density, or Kohn-Sham (KS) Fock matrix elements obtained at a low level of theory such as extended tight-binding have been used as input features to predict the electric field gradient (EFG) tensors at a higher level of theory such as those obtained with hybrid functionals. It is shown that the machine-learning-predicted EFG tensors can be used to compute spin relaxation rates of several ions in aqueous solutions. From only a fraction of data used in direct calculation, one can predict the quadrupolar isotropic spin relaxation rates with good accuracy, achieving relative errors between about 2-8% for different ions.
Collapse
Affiliation(s)
- Mohammad Shakiba
- Department of Chemistry, University at Buffalo, The State University of New York, Buffalo, New York 14260, United States
| | - Adam B Philips
- Department of Chemistry, University at Buffalo, The State University of New York, Buffalo, New York 14260, United States
| | - Jochen Autschbach
- Department of Chemistry, University at Buffalo, The State University of New York, Buffalo, New York 14260, United States
| | - Alexey V Akimov
- Department of Chemistry, University at Buffalo, The State University of New York, Buffalo, New York 14260, United States
| |
Collapse
|
4
|
Shang X, Han Y, Lian X, Ye S, Ao L, Sun B, Liu R, Zhen P, Zhang Y, Jia Y, Dong W, Sun X, Cui F. Extraction of short-chain fatty acid ethyl Ester in Baijiu using covalent organic framework-based magnetic nanoparticles: Theoretical screening and experimental validation. Food Chem 2024; 468:142494. [PMID: 39700812 DOI: 10.1016/j.foodchem.2024.142494] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2024] [Revised: 11/23/2024] [Accepted: 12/13/2024] [Indexed: 12/21/2024]
Abstract
Short-chain fatty acid ethyl esters (SFAEEs) are critical aroma compounds in Baijiu, and their wider concentration range can lead to differences in the quality grade of Baijiu. Efficiently designing an SFAEEs adsorbent before instrument analysis remains challenging. In this work, nine functionalized covalent organic frameworks (COFs) with different postmodification groups were designed for targeting SFAEEs. Based on interaction energy as the evaluation criterion, COFs modified with 5-Mercapto-1-methyltetrazole (MMTZ) had been identified through density functional theory screening. Using imine COFs and MMTZ, novel magnetic nanoparticles (Fe3O4@COFs@MMTZ) were prepared and used to develop the magnetic solid-phase extraction of SFAEEs from Baijiu. The adsorption mechanism of Fe3O4@COFs@MMTZ was analyzed using wave function analysis, revealing that adsorption occurred via vdW interaction, CH···π interaction, and hydrogen bonding. This study provides a new concept for the rapid detection of SFAEEs and theoretical support for the scientific construction of quality control during Baijiu production.
Collapse
Affiliation(s)
- Xiaolong Shang
- Beijing Laboratory of Food Quality and Safety, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Brewing Molecular Engineering of China Light Industry, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Geriatric Nutrition and Health Ministry of Education, Beijing Technology and Business University, Beijing 100048, China
| | - Ying Han
- Shanxi Xinghuacun Fenjiu Distillery Co., Ltd., Fenyang 032205, China
| | - Xudong Lian
- Beijing Laboratory of Food Quality and Safety, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Brewing Molecular Engineering of China Light Industry, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Geriatric Nutrition and Health Ministry of Education, Beijing Technology and Business University, Beijing 100048, China
| | - Siting Ye
- Beijing Laboratory of Food Quality and Safety, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Brewing Molecular Engineering of China Light Industry, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Geriatric Nutrition and Health Ministry of Education, Beijing Technology and Business University, Beijing 100048, China
| | - Ling Ao
- Luzhoulaojiao Distillery Co., Ltd., Luzhou 646000, China
| | - Baoguo Sun
- Beijing Laboratory of Food Quality and Safety, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Brewing Molecular Engineering of China Light Industry, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Geriatric Nutrition and Health Ministry of Education, Beijing Technology and Business University, Beijing 100048, China
| | - Rong Liu
- Shanxi Xinghuacun Fenjiu Distillery Co., Ltd., Fenyang 032205, China
| | - Pan Zhen
- Shanxi Xinghuacun Fenjiu Distillery Co., Ltd., Fenyang 032205, China
| | - Yongqing Zhang
- Beijing Laboratory of Food Quality and Safety, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Brewing Molecular Engineering of China Light Industry, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Geriatric Nutrition and Health Ministry of Education, Beijing Technology and Business University, Beijing 100048, China
| | - Yintao Jia
- Beijing Laboratory of Food Quality and Safety, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Brewing Molecular Engineering of China Light Industry, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Geriatric Nutrition and Health Ministry of Education, Beijing Technology and Business University, Beijing 100048, China
| | - Wei Dong
- Beijing Laboratory of Food Quality and Safety, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Brewing Molecular Engineering of China Light Industry, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Geriatric Nutrition and Health Ministry of Education, Beijing Technology and Business University, Beijing 100048, China.
| | - Xiaotao Sun
- Beijing Laboratory of Food Quality and Safety, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Brewing Molecular Engineering of China Light Industry, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Geriatric Nutrition and Health Ministry of Education, Beijing Technology and Business University, Beijing 100048, China.
| | - Fan Cui
- Shanxi Xinghuacun Fenjiu Distillery Co., Ltd., Fenyang 032205, China
| |
Collapse
|
5
|
Verma NK, Raghav N. Molecular modeling of cellulose tosylate immobilized α-amylases: An in silico case study through MD simulation and refinement. Int J Biol Macromol 2024:138808. [PMID: 39694388 DOI: 10.1016/j.ijbiomac.2024.138808] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2024] [Revised: 11/24/2024] [Accepted: 12/14/2024] [Indexed: 12/20/2024]
Abstract
The use of enzymes as catalysts in industrial processes has been studied, and they offer more ecologically friendly options for chemical reactions. In the current work, we investigated the potential of molecular modeling to solve the ordinarily difficult problem of identifying the amino acids involved in the covalent mode of immobilization by in silico investigations. The immobilized α-Amylase on Cellulose tosylate (henceforth referred to as Celltos) shows extra peaks of OH and NH2, CN, SO, C-O-C, and CS. Celltos exhibits distinct ether, imine, and CS peaks, indicating the potential contribution of α-Amylase's hydroxyl, amino, and thiol groups towards immobilization with cellulose's tosylate group. The native amylase was processed for Molecular Dynamics simulation. The simulated amylase was found to be the root mean squarely deviated to 1.16 Å. Autodock Vina, GOLD, SwissDock, and iGemdock generate output averages of 6.164, 6.549, 9.313 & 137.811 and 5.903, 7.656, 9.752 & 132.218 for an unrefined and refined dataset, respectively. The catalytic site intactness values for unrefined and refined SAT9, SAT13, and LAT21 were 83.3 %, 100 %, 100 %, and 8.33 %, 0 %, and 0 %, respectively. Our findings were additionally confirmed by bond distance similarity computations.
Collapse
Affiliation(s)
| | - Neera Raghav
- Chemistry Department, Kurukshetra University, Kurukshetra 136119, Haryana, India.
| |
Collapse
|
6
|
Maennel H, Unke OT, Müller KR. Complete and Efficient Covariants for Three-Dimensional Point Configurations with Application to Learning Molecular Quantum Properties. J Phys Chem Lett 2024:12513-12519. [PMID: 39670428 DOI: 10.1021/acs.jpclett.4c02376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2024]
Abstract
When physical properties of molecules are being modeled with machine learning, it is desirable to incorporate SO(3)-covariance. While such models based on low body order features are not complete, we formulate and prove general completeness properties for higher order methods and show that 6k - 5 of these features are enough for up to k atoms. We also find that the Clebsch-Gordan operations commonly used in these methods can be replaced by matrix multiplications without sacrificing completeness, lowering the scaling from O(l6) to O(l3) in the degree of the features. We apply this to quantum chemistry, but the proposed methods are generally applicable for problems involving three-dimensional point configurations.
Collapse
Affiliation(s)
- Hartmut Maennel
- Google DeepMind Zürich, Brandschenkestraße 110, 8002 Zürich, Switzerland
| | - Oliver T Unke
- Google DeepMind Berlin, Tucholskystraße 2, 10117 Berlin, Germany
| | - Klaus-Robert Müller
- Google DeepMind, https://deepmind.google/
- TU Berlin, Machine Learning Group, Marchstraße 23, 10587 Berlin, Germany
- Berlin Institute for the Foundation of Learning and Data, Ernst-Reuter-Platz 7, 10587 Berlin, Germany
- Max Planck Institute for Informatics Saarbrücken, Saarland Informatics Campus, Building E1 4, 66123 Sarbrücken, Germany
- Department of Artificial Intelligence, Korea University, Seoul 136-713, Korea
| |
Collapse
|
7
|
Zhang W, Zhang G, Ma J, Xie Z, Gao Z, Yu K, Peng L. The Role of Transition Metal Versus Coordination Mode in Single-Atom Catalyst for Electrocatalytic Sulfur Reduction Reaction. ACS APPLIED MATERIALS & INTERFACES 2024; 16:66981-66990. [PMID: 38830270 DOI: 10.1021/acsami.4c01811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2024]
Abstract
Electrocatalytic sulfur reduction reaction (SRR) is emerging as an effective strategy to combat the polysulfide shuttling effect, which remains a critical factor impeding the practical application of the Li-S battery. Single-atom catalyst (SAC), one of the most studied catalytic materials, has shown considerable potential in addressing the polysulfide shuttling effect in a Li-S battery. However, the role played by transition metal vs coordination mode in electrocatalytic SRR is trial-and-error, and the general understanding that guides the synthesis of the specific SAC with desired property remains elusive. Herein, we use first-principles calculations and machine learning to screen a comprehensive data set of graphene-based SACs with different transition metals, heteroatom doping, and coordination modes. The results reveal that the type of transition metal plays the decisive role in SAC for electrocatalytic SRR, rather than the coordination mode. Specifically, the 3d transition metals exhibit admirable electrocatalytic SRR activity for all of the coordination modes. Compared with the reported N3C1 and N4 coordinated graphene-based SACs covering 3d, 4d, and 5d transition metals, the proposed para-MnO2C2 and para-FeN2C2 possess significant advantages on the electrocatalytic SRR, including a considerably low overpotential down to 1 mV and reduced Li2S decomposition energy barrier, both suggesting an accelerated conversion process among the polysulfides. This study may clarify some understanding of the role played by transition metal vs coordination mode for SAC materials with specific structure and desired catalytic properties toward electrocatalytic SRR and beyond.
Collapse
Affiliation(s)
- Wentao Zhang
- Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, P. R. China
- School of Materials Science and Engineering, Tsinghua University, Beijing 100084, P. R. China
| | - Gaoshang Zhang
- Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, P. R. China
- School of Materials Science and Engineering, Tsinghua University, Beijing 100084, P. R. China
| | - Jiabin Ma
- Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, P. R. China
- School of Materials Science and Engineering, Tsinghua University, Beijing 100084, P. R. China
| | - Zhaotian Xie
- Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, P. R. China
- School of Materials Science and Engineering, Tsinghua University, Beijing 100084, P. R. China
| | - Ziyao Gao
- Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, P. R. China
- School of Materials Science and Engineering, Tsinghua University, Beijing 100084, P. R. China
| | - Kuang Yu
- Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, P. R. China
- School of Materials Science and Engineering, Tsinghua University, Beijing 100084, P. R. China
| | - Lele Peng
- Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, P. R. China
- School of Materials Science and Engineering, Tsinghua University, Beijing 100084, P. R. China
| |
Collapse
|
8
|
Li P, Luo S, Lin Y, Xiao J, Xia X, Liu X, Wang L, He X. Fundamentals of the recycling of spent lithium-ion batteries. Chem Soc Rev 2024; 53:11967-12013. [PMID: 39471089 DOI: 10.1039/d4cs00362d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2024]
Abstract
This review discusses the critical role of fundamentals of battery recycling in addressing the challenges posed by the increasing number of spent lithium-ion batteries (LIBs) due to the widespread use of electric vehicles and portable electronics, by providing the theoretical basis and technical support for recycling spent LIBs, including battery classification, ultrasonic flaw detection, pretreatment (e.g., discharging, mechanical crushing, and physical separation), electrolyte recovery, direct regeneration, and theoretical calculations and simulations. Physical chemistry principles are essential for achieving effective separation of different components through methods like screening, magnetic separation, and flotation. Electrolyte recovery involves separation and purification of electrolytes through advanced physical and chemical techniques. Direct regeneration technology restores the structure of electrode materials at the microscopic scale, requiring precise control of the physical state and crystal structure of the material. Physical processes such as phase changes, solubility, and diffusion are fundamental to techniques like solid-state sintering, eutectic-salt treatment, and hydrothermal methods. Theoretical calculations and simulations help predict the behaviour of materials during recycling, guiding process optimization. This review provides insights into understanding and improving the recycling process, emphasizing the central role of physical chemistry principles in addressing environmental and energy issues. It is valuable for promoting innovation in spent LIB recycling processes and is expected to stimulate interest among researchers and manufacturers.
Collapse
Affiliation(s)
- Pengwei Li
- School of Materials Science and Engineering, Northeastern University, Shenyang 110819, China.
| | - Shaohua Luo
- School of Materials Science and Engineering, Northeastern University, Shenyang 110819, China.
- School of Resources and Materials, Northeastern University at Qinhuangdao, Qinhuangdao, 066004, P. R. China
| | - Yicheng Lin
- School of Materials Science and Engineering, Northeastern University, Shenyang 110819, China.
| | - Jiefeng Xiao
- Department of Environmental Science and Engineering, Huaqiao University, Xiamen 361021, China
| | - Xiaoning Xia
- Department of Materials and Production, Aalborg University, Aalborg, 9220, Denmark
| | - Xin Liu
- School of Materials Science and Engineering, Northeastern University, Shenyang 110819, China.
- School of Resources and Materials, Northeastern University at Qinhuangdao, Qinhuangdao, 066004, P. R. China
| | - Li Wang
- Institute of Nuclear and New Energy Technology, Tsinghua University, Beijing 100084, China.
| | - Xiangming He
- Institute of Nuclear and New Energy Technology, Tsinghua University, Beijing 100084, China.
| |
Collapse
|
9
|
Li DZ, Gong XQ. Challenges with Literature-Derived Data in Machine Learning for Yield Prediction: A Case Study on Pd-Catalyzed Carbonylation Reactions. J Phys Chem A 2024; 128:10423-10430. [PMID: 39565904 DOI: 10.1021/acs.jpca.4c05489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2024]
Abstract
The application of machine learning (ML) to predict reaction yields has shown remarkable accuracy when based on high-throughput computational and experimental data. However, the accuracy significantly diminishes when leveraging literature-derived data, highlighting a gap in the predictive capability of the current ML models. This study, focusing on Pd-catalyzed carbonylation reactions, reveals that even with a data set of 2512 reactions, the best-performing model reaches only an R2 of 0.51. Further investigations show that the models' effectiveness is predominantly confined to predictions within narrow subsets of data, closely related and from the same literature sources, rather than across the broader, heterogeneous data sets available in the literature. The reliance on data similarity, coupled with small sample sizes from the same sources, makes the model highly sensitive to inherent fluctuations typical of small data sets, adversely impacting stability, accuracy, and generalizability. The findings underscore the inherent limitations of current ML techniques in leveraging literature-derived data for predicting chemical reaction yields, highlighting the need for more sophisticated approaches to handle the complexity and diversity of chemical data.
Collapse
Affiliation(s)
- Dong-Zhi Li
- Centre for Computational Chemistry, School of Chemistry and Molecular Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Xue-Qing Gong
- Centre for Computational Chemistry, School of Chemistry and Molecular Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
- School of Chemistry and Chemical Engineering, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai 200240, China
| |
Collapse
|
10
|
Uceda RG, Gijón A, Míguez‐Lago S, Cruz CM, Blanco V, Fernández‐Álvarez F, Álvarez de Cienfuegos L, Molina‐Solana M, Gómez‐Romero J, Miguel D, Mota AJ, Cuerva JM. Can Deep Learning Search for Exceptional Chiroptical Properties? The Halogenated [6]Helicene Case. Angew Chem Int Ed Engl 2024; 63:e202409998. [PMID: 39329214 PMCID: PMC11586703 DOI: 10.1002/anie.202409998] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Revised: 09/11/2024] [Accepted: 09/24/2024] [Indexed: 09/28/2024]
Abstract
The relationship between chemical structure and chiroptical properties is not always clearly understood. Nowadays, efforts to develop new systems with enhanced optical properties follow the trial-error method. A large number of data would allow us to obtain more robust conclusions and guide research toward molecules with practical applications. In this sense, in this work we predict the chiroptical properties of millions of halogenated [6]helicenes in terms of the rotatory strength (R). We have used DFT calculations to randomly create derivatives including from 1 to 16 halogen atoms, that were then used as a data set to train different deep neural network models. These models allow us to i) predict the Rmax for any halogenated [6]helicene with a very low computational cost, and ii) to understand the physical reasons that favour some substitutions over others. Finally, we synthesized derivatives with higher predicted Rmax obtaining excellent correlation among the values obtained experimentally and the predicted ones.
Collapse
Affiliation(s)
- Rafael G. Uceda
- Departamento de Química Orgánica, Unidad de Excelencia de Química Aplicada a la Biomedicina y Medioambiente (UEQ)Universidad de Granada (UGR), Facultad de CienciasC. U. Fuentenueva18071GranadaSpain
| | - Alfonso Gijón
- Departamento de Ciencias de la Computación e Inteligencia Artificial, UGRE.T.S. de Ingenierías Informática y de TelecomunicaciónC/ Periodista Daniel Saucedo Aranda S/N18071GranadaSpain
| | - Sandra Míguez‐Lago
- Departamento de Química Orgánica, Unidad de Excelencia de Química Aplicada a la Biomedicina y Medioambiente (UEQ)Universidad de Granada (UGR), Facultad de CienciasC. U. Fuentenueva18071GranadaSpain
| | - Carlos M. Cruz
- Departamento de Química Orgánica, Unidad de Excelencia de Química Aplicada a la Biomedicina y Medioambiente (UEQ)Universidad de Granada (UGR), Facultad de CienciasC. U. Fuentenueva18071GranadaSpain
| | - Víctor Blanco
- Departamento de Química Orgánica, Unidad de Excelencia de Química Aplicada a la Biomedicina y Medioambiente (UEQ)Universidad de Granada (UGR), Facultad de CienciasC. U. Fuentenueva18071GranadaSpain
| | - Fátima Fernández‐Álvarez
- Departamento de Química Orgánica, Unidad de Excelencia de Química Aplicada a la Biomedicina y Medioambiente (UEQ)Universidad de Granada (UGR), Facultad de CienciasC. U. Fuentenueva18071GranadaSpain
| | - Luis Álvarez de Cienfuegos
- Departamento de Química Orgánica, Unidad de Excelencia de Química Aplicada a la Biomedicina y Medioambiente (UEQ)Universidad de Granada (UGR), Facultad de CienciasC. U. Fuentenueva18071GranadaSpain
- Instituto de Investigación BiosanitariaAvda. Madrid, 1518016GranadaSpain
| | - Miguel Molina‐Solana
- Departamento de Ciencias de la Computación e Inteligencia Artificial, UGRE.T.S. de Ingenierías Informática y de TelecomunicaciónC/ Periodista Daniel Saucedo Aranda S/N18071GranadaSpain
| | - Juan Gómez‐Romero
- Departamento de Ciencias de la Computación e Inteligencia Artificial, UGRE.T.S. de Ingenierías Informática y de TelecomunicaciónC/ Periodista Daniel Saucedo Aranda S/N18071GranadaSpain
| | - Delia Miguel
- Departamento de Fisicoquímica, UEQ, UGRFacultad de FarmaciaAvda. Profesor Clavera s/nC. U. Cartuja18071GranadaSpain
| | - Antonio J. Mota
- Departamento de Química Inorgánica, UEQ, UGRFacultad de CienciasC. U. Fuentenueva18071GranadaSpain
| | - Juan M. Cuerva
- Departamento de Química Orgánica, Unidad de Excelencia de Química Aplicada a la Biomedicina y Medioambiente (UEQ)Universidad de Granada (UGR), Facultad de CienciasC. U. Fuentenueva18071GranadaSpain
| |
Collapse
|
11
|
Rezaee P, Rezaee S, Maaza M, Arab SS. Screening of BindingDB database ligands against EGFR, HER2, Estrogen, Progesterone and NF-κB receptors based on machine learning and molecular docking. Comput Biol Med 2024; 183:109279. [PMID: 39461104 DOI: 10.1016/j.compbiomed.2024.109279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2024] [Revised: 09/24/2024] [Accepted: 10/14/2024] [Indexed: 10/29/2024]
Abstract
Breast cancer, the second most prevalent cancer among women worldwide, necessitates the exploration of novel therapeutic approaches. To target the four subgroups of breast cancer "hormone receptor-positive and HER2-negative, hormone receptor-positive and HER2-positive, hormone receptor-negative and HER2-positive, and hormone receptor-negative and HER2-negative" it is crucial to inhibit specific targets such as EGFR, HER2, ER, NF-κB, and PR. In this study, we evaluated various methods for binary and multiclass classification. Among them, the GA-SVM-SVM:GA-SVM-SVM model was selected with an accuracy of 0.74, an F1-score of 0.73, and an AUC of 0.92 for virtual screening of ligands from the BindingDB database. This model successfully identified 4454, 803, 438, and 378 ligands with over 90% precision in both active/inactive and target prediction for the classes of EGFR+HER2, ER, NF-κB, and PR, respectively, from the BindingDB database. Based on to the selected ligands, we created a dendrogram that categorizes different ligands based on their targets. This dendrogram aims to facilitate the exploration of chemical space for various therapeutic targets. Ligands that surpassed a 90% threshold in the product of activity probability and correct target selection probability were chosen for further investigation using molecular docking. The binding energy range for these ligands against their respective targets was calculated to be between -15 and -5 kcal/mol. Finally, based on general and common rules in medicinal chemistry, we selected 2, 3, 3, and 8 new ligands with high priority for further studies in the EGFR+HER2, ER, NF-κB, and PR classes, respectively.
Collapse
Affiliation(s)
- Parham Rezaee
- Department of Biophysics, School of Biological Sciences, Tarbiat Modares University, Tehran, Iran; UNESCO-UNISA-iTLABS Africa Chair in Nanoscience and Nanotechnology (U2ACN2), College of Graduate Studies, University of South Africa (UNISA), Pretoria, South Africa
| | - Shahab Rezaee
- Department of Biophysics, School of Biological Sciences, Tarbiat Modares University, Tehran, Iran
| | - Malik Maaza
- UNESCO-UNISA-iTLABS Africa Chair in Nanoscience and Nanotechnology (U2ACN2), College of Graduate Studies, University of South Africa (UNISA), Pretoria, South Africa
| | - Seyed Shahriar Arab
- Department of Pediatrics, University of California, La Jolla, San Diego, 92093, CA, USA.
| |
Collapse
|
12
|
Häfner M, Bianchini M. Exploring Cationic Substitutions in the Solid Electrolyte NaAlCl 4 with Density Functional Theory. THE JOURNAL OF PHYSICAL CHEMISTRY. C, NANOMATERIALS AND INTERFACES 2024; 128:19978-19988. [PMID: 39634025 PMCID: PMC11613591 DOI: 10.1021/acs.jpcc.4c05559] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/27/2024] [Revised: 10/08/2024] [Accepted: 10/18/2024] [Indexed: 12/07/2024]
Abstract
NaAlCl4 is an established solid electrolyte in high-temperature Na-based battery systems, but its ionic conductivity is not sufficiently high for room-temperature applications. We employ density functional theory and thermodynamic corrections to evaluate the efficacy of various elements for substitution, utilizing on-the-fly machine-learned potentials to accelerate the required phonon calculations by 1 order of magnitude at a minor error of -0.7 ± 1.0 meV/atom. All investigated isovalent substitutions are favorable within 4 meV/atom, with potassium and silver as substitutes for sodium and gallium as a substitute for aluminum. The most promising aliovalent substitution was identified for Zn on the tieline between NaAlCl4 and Na2ZnCl4. The structure of latter, with aluminum ions replacing zinc, yields a structure with separate layers for the differently charged cations and vacancies for potential Na conduction. Our investigation may pave the way for more reliable discovery of new Na conductors by inclusion of thermodynamic properties.
Collapse
Affiliation(s)
- Michael Häfner
- Faculty
of Biology, Chemistry and Earth Sciences, Universität Bayreuth, Universitätsstrasse 30, 95447 Bayreuth, Germany
- Bavarian
Center for Battery Technology (BayBatt), Universität Bayreuth, Weiherstrasse 26, 95448 Bayreuth, Germany
| | - Matteo Bianchini
- Faculty
of Biology, Chemistry and Earth Sciences, Universität Bayreuth, Universitätsstrasse 30, 95447 Bayreuth, Germany
- Bavarian
Center for Battery Technology (BayBatt), Universität Bayreuth, Weiherstrasse 26, 95448 Bayreuth, Germany
| |
Collapse
|
13
|
Tan EX, Zhong QZ, Ting Chen JR, Leong YX, Leon GK, Tran CT, Phang IY, Ling XY. Surface-Enhanced Raman Scattering-Based Multimodal Techniques: Advances and Perspectives. ACS NANO 2024; 18:32315-32334. [PMID: 39530425 DOI: 10.1021/acsnano.4c12996] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2024]
Abstract
Surface-enhanced Raman scattering (SERS) spectroscopy is a versatile molecular fingerprinting technique with rapid signal readout, high aqueous compatibility, and portability. To translate SERS for real-world applications, it is pertinent to overcome inherent challenges, including high sample variability and heterogeneity, matrix effects, and nonlinear SERS signal responses of different analytes in complex (bio)chemical matrices with numerous interfering species. In this perspective, we highlight emerging SERS-based multimodal techniques to address the key roadblocks to improving the sensitivity, specificity, and reliability of (bio)chemical detection, bioimaging, theragnosis, and theragnostic. SERS-based multimodal techniques can be broadly categorized into two categories: (1) complementary methods or systems that work together to achieve a common goal where each method compensates for the weaknesses of the other to culminate in a single enhanced outcome or (2) orthogonal techniques that are independent and provide separate but corroborating results simultaneously without interfering with each other. These multimodal techniques maximize information gained from a single experiment to achieve enhanced qualitative or quantitative analysis and broaden the range of detectable analytes from small molecules to tissues. Finally, we discuss emerging directions in multimodal platform design, instrument integration, and data analytics that aim to push the analytical limits of holistic detection.
Collapse
Affiliation(s)
- Emily Xi Tan
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, 21 Nanyang Link, 637371 Singapore
| | - Qi-Zhi Zhong
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, 21 Nanyang Link, 637371 Singapore
| | - Jaslyn Ru Ting Chen
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, 21 Nanyang Link, 637371 Singapore
| | - Yong Xiang Leong
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, 21 Nanyang Link, 637371 Singapore
| | - Guo Kang Leon
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, 21 Nanyang Link, 637371 Singapore
| | - Cam Tu Tran
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, 21 Nanyang Link, 637371 Singapore
| | - In Yee Phang
- Key Laboratory of Synthetic and Biological Colloids, Ministry of Education, International Joint Research Laboratory for Nano Energy Composites, School of Chemical and Material Engineering, Jiangnan University, Wuxi 214122, P. R. China
| | - Xing Yi Ling
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, 21 Nanyang Link, 637371 Singapore
- Key Laboratory of Synthetic and Biological Colloids, Ministry of Education, International Joint Research Laboratory for Nano Energy Composites, School of Chemical and Material Engineering, Jiangnan University, Wuxi 214122, P. R. China
- Lee Kong Chian School of Medicine, Nanyang Technological University, 59 Nanyang Drive, 636921 Singapore
| |
Collapse
|
14
|
Weng J, Cui H, Zheng D, Zhou Z, Zhang D, Chu H, Wang A, Li G. A Multipole-Based Reactive Force Field for Hydrocarbons. J Chem Theory Comput 2024; 20:10045-10058. [PMID: 39497480 DOI: 10.1021/acs.jctc.4c01285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2024]
Abstract
The computational complexity of quantum chemistry methods has prompted the development of reactive force fields, facilitating practical applications of molecular dynamics simulations for large-scale reactive systems. Current reactive force fields typically employ intricate corrections based on prior chemical knowledge, which severely impedes their further advancement. This study presents a new atomic multipole-based reactive model with bond free (OPERATOR). The force field is constructed on a simple, physically motivated model within the AMOEBA framework that closely resembles the physical representation of the chemical reaction processes. In the force field, the atomic multipoles are generated dynamically according to the atomic environments, aiming to effectively capture significant changes in the electrostatic environments during chemical reactions. Subsequently, atomic multipole-based charge penetration, polarization, and charge transfer effects are incorporated into the force field to describe the complex electrostatic interactions in the system. The force field also includes van der Waals interactions and three-body potentials. In addition, to extend these nonreactive interactions to chemical reactions, the atom distribution multipole moments are used to characterize different chemical environments. The force field has been optimized using the dataset of potential energy surfaces (PESs) of hydrocarbons derived from DFT results of millions of conformations with six degrees of freedom (DOFs). The results demonstrate that the new force field effectively replicates both the monopoles and the energies. In comparison to ReaxFF, the new force field exhibits comparable or superior performance. Furthermore, molecular dynamics simulations of n-heptane decomposition effectively reproduce the primary products and reactions observed in the experiments. Given the simplicity and physically motivated nature of the model, it is expected that the new force field will be utilized in future studies to investigate chemical reaction mechanisms involving more elements.
Collapse
Affiliation(s)
- Junben Weng
- Laboratory of Molecular Modeling and Design, State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Hongqiang Cui
- Laboratory of Molecular Modeling and Design, State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Da Zheng
- Laboratory of Molecular Modeling and Design, State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhenhao Zhou
- Laboratory of Molecular Modeling and Design, State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Dinglin Zhang
- Laboratory of Molecular Modeling and Design, State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
- Interdisciplinary Research Center for Biology and Chemistry, Liaoning Normal University, Dalian 116029, China
| | - Huiying Chu
- Laboratory of Molecular Modeling and Design, State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
- Interdisciplinary Research Center for Biology and Chemistry, Liaoning Normal University, Dalian 116029, China
| | - Anhui Wang
- Laboratory of Molecular Modeling and Design, State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
- Interdisciplinary Research Center for Biology and Chemistry, Liaoning Normal University, Dalian 116029, China
| | - Guohui Li
- Laboratory of Molecular Modeling and Design, State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
- Interdisciplinary Research Center for Biology and Chemistry, Liaoning Normal University, Dalian 116029, China
| |
Collapse
|
15
|
Si Z, Liu D, Nie W, Hu J, Wang C, Jiang T, Yu H, Fu Y. Data-Based Prediction of Redox Potentials via Introducing Chemical Features into the Transformer Architecture. J Chem Inf Model 2024; 64:8453-8463. [PMID: 39513760 DOI: 10.1021/acs.jcim.4c01299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2024]
Abstract
Rapid and accurate prediction of basic physicochemical parameters of molecules will greatly accelerate the target-orientated design of novel reactions and materials but has been long challenging. Herein, a chemical language model-based deep learning method, TransChem, has been developed for the prediction of redox potentials of organic molecules. Embedding an effective molecular characterization (combining spatial and electronic features), a nonlinear molecular messaging approach (Mol-Attention), and a perturbation learning method, TransChem, shows high accuracy in predicting the redox potential of organic radicals comprising over 100,000 data (R2 > 0.97, MAE <0.09 V) and is generalized to the smaller 2,1,3-benzothiadiazole data set (<3000 data points) and electron affinity data set (660 data) with low MAE of 0.07 V and 0.18 eV, respectively. In this context, a self-developed data set, i.e., the oxidation potential (OP) of a full-space disubstituted phenol data set (OPP-data set, total set: 74,529), has been predicted by TransChem with a high-throughput, and active learning strategy. The rapid and reliable prediction of OP could hopefully accelerate the screening of plausible reagents in highly selective cross-coupling of phenol derivatives. This study presents an important attempt to guide language modeling with chemical knowledge, while TransChem demonstrates state-of-the-art (SOTA) predictive performance on redox potential prediction benchmark data sets for its better understanding of molecular design and conformational relationships.
Collapse
Affiliation(s)
- Zhan Si
- Department of Chemistry and Centre for Atomic Engineering of Advanced Materials, Anhui Province Key Laboratory of Chemistry for Inorganic/Organic Hybrid Functionalized Materials, Anhui University, Hefei 230601, China
| | - Deguang Liu
- Key Laboratory of Precision and Intelligent Chemistry, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, University of Science and Technology of China, Hefei 230026, China
| | - Wan Nie
- Department of Computer Science, City University of Hong Kong, Hong Kong 999077, China
| | - Jingjing Hu
- Department of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230601, China
| | - Chen Wang
- Department of Chemistry and Centre for Atomic Engineering of Advanced Materials, Anhui Province Key Laboratory of Chemistry for Inorganic/Organic Hybrid Functionalized Materials, Anhui University, Hefei 230601, China
| | - Tingting Jiang
- Department of Chemistry and Centre for Atomic Engineering of Advanced Materials, Anhui Province Key Laboratory of Chemistry for Inorganic/Organic Hybrid Functionalized Materials, Anhui University, Hefei 230601, China
| | - Haizhu Yu
- Department of Chemistry and Centre for Atomic Engineering of Advanced Materials, Anhui Province Key Laboratory of Chemistry for Inorganic/Organic Hybrid Functionalized Materials, Anhui University, Hefei 230601, China
| | - Yao Fu
- Key Laboratory of Precision and Intelligent Chemistry, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, University of Science and Technology of China, Hefei 230026, China
| |
Collapse
|
16
|
Brough HA, Cheneler D, Hardy JG. Progress in Multiscale Modeling of Silk Materials. Biomacromolecules 2024; 25:6987-7014. [PMID: 39438248 PMCID: PMC11558682 DOI: 10.1021/acs.biomac.4c01122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2024] [Revised: 09/28/2024] [Accepted: 10/03/2024] [Indexed: 10/25/2024]
Abstract
As a result of their hierarchical structure and biological processing, silk fibers rank among nature's most remarkable materials. The biocompatibility of silk-based materials and the exceptional mechanical properties of certain fibers has inspired the use of silk in numerous technical and medical applications. In recent years, computational modeling has clarified the relationship between the molecular architecture and emergent properties of silk fibers and has demonstrated predictive power in studies on novel biomaterials. Here, we review advances in modeling the structure and properties of natural and synthetic silk-based materials, from early structural studies of silkworm cocoon fibers to cutting-edge atomistic simulations of spider silk nanofibrils and the recent use of machine learning models. We explore applications of modeling across length scales: from quantum mechanical studies on model peptides, to atomistic and coarse-grained molecular dynamics simulations of silk proteins, to finite element analysis of spider webs. As computational power and algorithmic efficiency continue to advance, we expect multiscale modeling to become an indispensable tool for understanding nature's most impressive fibers and developing bioinspired functional materials.
Collapse
Affiliation(s)
- Harry
D. A. Brough
- Department
of Chemistry, Lancaster University, Lancaster LA1 4YB, United Kingdom
| | - David Cheneler
- School
of Engineering, Lancaster University, Lancaster LA1 4YW, United Kingdom
- Materials
Science Lancaster, Lancaster University, Lancaster, LA1 4YW, United Kingdom
| | - John G. Hardy
- Department
of Chemistry, Lancaster University, Lancaster LA1 4YB, United Kingdom
- Materials
Science Lancaster, Lancaster University, Lancaster, LA1 4YW, United Kingdom
| |
Collapse
|
17
|
Shaban Tameh M, Coropceanu V, Purcell TAR, Brédas JL. Prediction of the Infrared Absorbance Intensities and Frequencies of Hydrocarbons: A Message Passing Neural Network Approach. J Phys Chem A 2024; 128:9695-9706. [PMID: 39466724 DOI: 10.1021/acs.jpca.4c06745] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/30/2024]
Abstract
Accurately and efficiently predicting the infrared (IR) spectra of a molecule can provide insights into the structure-properties relationships of molecular species, which has led to a proliferation of machine learning tools designed for this purpose. However, earlier studies have focused primarily on obtaining normalized IR spectra, which limits their potential for a comprehensive analysis of molecular behavior in the IR range. For instance, to fully understand and predict the optical properties, such as the transparency characteristics, it is necessary to predict the molar absorptivity IR spectra instead. Here, we propose a graph-based communicative message passing neural network algorithm that can predict both the peak positions and absolute intensities corresponding to density functional theory calculated molar absorptivities in the IR domain. By modifying existing spectral loss functions, we show that our method is able to predict with DFT-accuracy level the IR molar absorptivities of a series of hydrocarbons containing up to ten carbon atoms and apply the model to a set of larger molecules. We also compare the predicted spectra with those generated by the direct message passing neural network. The results suggest that both algorithms demonstrate similar predictive capabilities for hydrocarbons, indicating that either model could be effectively used in future research on spectral prediction for such systems.
Collapse
Affiliation(s)
- Maliheh Shaban Tameh
- Department of Chemistry and Biochemistry, The University of Arizona, Tucson, Arizona 85721-0041, United States
| | - Veaceslav Coropceanu
- Department of Chemistry and Biochemistry, The University of Arizona, Tucson, Arizona 85721-0041, United States
| | - Thomas A R Purcell
- Department of Chemistry and Biochemistry, The University of Arizona, Tucson, Arizona 85721-0041, United States
| | - Jean-Luc Brédas
- Department of Chemistry and Biochemistry, The University of Arizona, Tucson, Arizona 85721-0041, United States
| |
Collapse
|
18
|
Bu Z, Xue Y, Zhao X, Liu G, An Y, Zhou H, Chen J. Exploring the Crystal Structure and Electronic Properties of γ-Al 2O 3: Machine Learning Drives Future Material Innovations. ACS APPLIED MATERIALS & INTERFACES 2024; 16:60458-60471. [PMID: 39444300 DOI: 10.1021/acsami.4c10774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2024]
Abstract
For decades, researchers have struggled to determine the precise crystal structure of γ-Al2O3 due to its atomic-level disorder and the challenges associated with obtaining high-purity, high-crystallinity γ-Al2O3 in laboratory settings. This study investigates the crystal structure and electronic properties of γ-Al2O3 coatings under the influence of an external electric field, integrating machine learning with density functional theory (DFT). A potential 160-atom supercell structure was identified from over 600,000 γ-Al2O3 configurations and confirmed through high-resolution transmission electron microscopy and selected area electron diffraction. The findings indicate that γ-Al2O3 deviates from the conventional spinel structure, suggesting that octahedral vacancies can reduce the system's energy. Under an external electric field, the material's band structure and density of states (DOS) undergo significant changes: the bandgap narrows from 3.996 to 0 eV, resulting in metallic behavior, while the projected density of states (PDOS) exhibits peak broadening and splitting of oxygen atom PDOS below the Fermi level. These alterations elucidate the variations in the electrical conductivity of alumina coatings under an electric field. These findings clarify the mechanisms of γ-Al2O3's electronic property modulation and offer insights into its covalent and ionic mixed bonding as a wide-bandgap semiconductor. This discovery is essential for understanding dielectric breakdown in insulating materials.
Collapse
Affiliation(s)
- Zhenyu Bu
- State Key Laboratory of Solid Lubrication, Lanzhou Institute of Chemical Physics, Chinese Academy of Sciences, Lanzhou 730000, China
- Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yun Xue
- State Key Laboratory of Solid Lubrication, Lanzhou Institute of Chemical Physics, Chinese Academy of Sciences, Lanzhou 730000, China
| | - Xiaoqin Zhao
- State Key Laboratory of Solid Lubrication, Lanzhou Institute of Chemical Physics, Chinese Academy of Sciences, Lanzhou 730000, China
- Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Guang Liu
- Inner Mongolia Metal Materials Research Institute, Ningbo 315103, China
| | - Yulong An
- State Key Laboratory of Solid Lubrication, Lanzhou Institute of Chemical Physics, Chinese Academy of Sciences, Lanzhou 730000, China
- Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Huidi Zhou
- State Key Laboratory of Solid Lubrication, Lanzhou Institute of Chemical Physics, Chinese Academy of Sciences, Lanzhou 730000, China
- Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jianmin Chen
- State Key Laboratory of Solid Lubrication, Lanzhou Institute of Chemical Physics, Chinese Academy of Sciences, Lanzhou 730000, China
- Center of Materials Science and Optoelectronics Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
19
|
Wang L, Li N, Cao M, Zhu Y, Xiong X, Li L, Zhu T, Pei H. Predicting DNA Reactions with a Quantum Chemistry-Based Deep Learning Model. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2409880. [PMID: 39297371 PMCID: PMC11558088 DOI: 10.1002/advs.202409880] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/19/2024] [Indexed: 11/14/2024]
Abstract
In this study, a deep learning model based on quantum chemistry is introduced to enhance the accuracy and efficiency of predicting DNA reaction parameters. By integrating quantum chemical calculations with self-designed descriptor matrices, the model offers a comprehensive description of energy variations and considers a broad range of relevant factors. To overcome the challenge of limited labeled data, an active learning method is employed. The results demonstrate that this model outperforms existing methods in predicting DNA hybridization free energies and strand displacement rate constants, thus advancing the understanding of DNA molecular interactions, and aiding in the precise design and optimization of DNA-based systems.
Collapse
Affiliation(s)
- Likun Wang
- Shanghai Key Laboratory of Green Chemistry and Chemical ProcessesShanghai Engineering Research Center of Molecular Therapeutics and New Drug DevelopmentSchool of Chemistry and Molecular EngineeringEast China Normal UniversityShanghai200241China
| | - Na Li
- Shanghai Key Laboratory of Green Chemistry and Chemical ProcessesShanghai Engineering Research Center of Molecular Therapeutics and New Drug DevelopmentSchool of Chemistry and Molecular EngineeringEast China Normal UniversityShanghai200241China
| | - Mengyao Cao
- Shanghai Key Laboratory of Green Chemistry and Chemical ProcessesShanghai Engineering Research Center of Molecular Therapeutics and New Drug DevelopmentSchool of Chemistry and Molecular EngineeringEast China Normal UniversityShanghai200241China
| | - Yun Zhu
- Shanghai Key Laboratory of Green Chemistry and Chemical ProcessesShanghai Engineering Research Center of Molecular Therapeutics and New Drug DevelopmentSchool of Chemistry and Molecular EngineeringEast China Normal UniversityShanghai200241China
| | - Xiewei Xiong
- Shanghai Key Laboratory of Green Chemistry and Chemical ProcessesShanghai Engineering Research Center of Molecular Therapeutics and New Drug DevelopmentSchool of Chemistry and Molecular EngineeringEast China Normal UniversityShanghai200241China
| | - Li Li
- Shanghai Key Laboratory of Green Chemistry and Chemical ProcessesShanghai Engineering Research Center of Molecular Therapeutics and New Drug DevelopmentSchool of Chemistry and Molecular EngineeringEast China Normal UniversityShanghai200241China
| | - Tong Zhu
- Shanghai Key Laboratory of Green Chemistry and Chemical ProcessesShanghai Engineering Research Center of Molecular Therapeutics and New Drug DevelopmentSchool of Chemistry and Molecular EngineeringEast China Normal UniversityShanghai200241China
- Shanghai Innovation InstituteShanghai200003China
- Institute for Advanced Algorithms ResearchShanghai200062China
| | - Hao Pei
- Shanghai Key Laboratory of Green Chemistry and Chemical ProcessesShanghai Engineering Research Center of Molecular Therapeutics and New Drug DevelopmentSchool of Chemistry and Molecular EngineeringEast China Normal UniversityShanghai200241China
| |
Collapse
|
20
|
David R, de la Puente M, Gomez A, Anton O, Stirnemann G, Laage D. ArcaNN: automated enhanced sampling generation of training sets for chemically reactive machine learning interatomic potentials. DIGITAL DISCOVERY 2024:d4dd00209a. [PMID: 39553851 PMCID: PMC11563209 DOI: 10.1039/d4dd00209a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2024] [Accepted: 10/21/2024] [Indexed: 11/19/2024]
Abstract
The emergence of artificial intelligence is profoundly impacting computational chemistry, particularly through machine-learning interatomic potentials (MLIPs). Unlike traditional potential energy surface representations, MLIPs overcome the conventional computational scaling limitations by offering an effective combination of accuracy and efficiency for calculating atomic energies and forces to be used in molecular simulations. These MLIPs have significantly enhanced molecular simulations across various applications, including large-scale simulations of materials, interfaces, chemical reactions, and beyond. Despite these advances, the construction of training datasets-a critical component for the accuracy of MLIPs-has not received proportional attention, especially in the context of chemical reactivity, which depends on rare barrier-crossing events that are not easily included in the datasets. Here we address this gap by introducing ArcaNN, a comprehensive framework designed for generating training datasets for reactive MLIPs. ArcaNN employs a concurrent learning approach combined with advanced sampling techniques to ensure an accurate representation of high-energy geometries. The framework integrates automated processes for iterative training, exploration, new configuration selection, and energy and force labeling, all while ensuring reproducibility and documentation. We demonstrate ArcaNN's capabilities through two paradigm reactions: a nucleophilic substitution and a Diels-Alder reaction. These examples showcase its effectiveness, the uniformly low error of the resulting MLIP everywhere along the chemical reaction coordinate, and its potential for broad applications in reactive molecular dynamics. Finally, we provide guidelines for assessing the quality of MLIPs in reactive systems.
Collapse
Affiliation(s)
- Rolf David
- PASTEUR, Département de Chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS 75005 Paris France
| | - Miguel de la Puente
- PASTEUR, Département de Chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS 75005 Paris France
| | - Axel Gomez
- PASTEUR, Département de Chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS 75005 Paris France
| | - Olaia Anton
- PASTEUR, Département de Chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS 75005 Paris France
| | - Guillaume Stirnemann
- PASTEUR, Département de Chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS 75005 Paris France
| | - Damien Laage
- PASTEUR, Département de Chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS 75005 Paris France
| |
Collapse
|
21
|
Jia L, Brémond É, Zaida L, Gaüzère B, Tognetti V, Joubert L. Predicting redox potentials by graph-based machine learning methods. J Comput Chem 2024; 45:2383-2396. [PMID: 38923574 DOI: 10.1002/jcc.27380] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 03/25/2024] [Accepted: 04/19/2024] [Indexed: 06/28/2024]
Abstract
The evaluation of oxidation and reduction potentials is a pivotal task in various chemical fields. However, their accurate prediction by theoretical computations, which is a complementary task and sometimes the only alternative to experimental measurement, may be often resource-intensive and time-consuming. This paper addresses this challenge through the application of machine learning techniques, with a particular focus on graph-based methods (such as graph edit distances, graph kernels, and graph neural networks) that are reviewed to enlighten their deep links with theoretical chemistry. To this aim, we establish the ORedOx159 database, a comprehensive, homogeneous (with reference values stemming from density functional theory calculations), and reliable resource containing 318 one-electron reduction and oxidation reactions and featuring 159 large organic compounds. Subsequently, we provide an instructive overview of the good practice in machine learning and of commonly utilized machine learning models. We then assess their predictive performances on the ORedOx159 dataset through extensive analyses. Our simulations using descriptors that are computed in an almost instantaneous way result in a notable improvement in prediction accuracy, with mean absolute error (MAE) values equal to 5.6 kcal mol- 1 for reduction and 7.2 kcal mol- 1 for oxidation potentials, which paves a way toward efficient in silico design of new electrochemical systems.
Collapse
Affiliation(s)
- Linlin Jia
- The PRG Group, Institute of Computer Science, University of Bern, Bern, Switzerland
| | - Éric Brémond
- Université Paris Cité, ITODYS, CNRS, Paris, France
| | | | - Benoit Gaüzère
- LITIS, Univ Rouen Normandie, INSA Rouen Normandie, Université Le Havre Normandie, Normandie Univ, Rouen, France
| | - Vincent Tognetti
- Normandy Univ., COBRA UMR 6014 & FR 3038, Université de Rouen, INSA Rouen, CNRS, Mont St Aignan Cedex, France
| | - Laurent Joubert
- Normandy Univ., COBRA UMR 6014 & FR 3038, Université de Rouen, INSA Rouen, CNRS, Mont St Aignan Cedex, France
| |
Collapse
|
22
|
Bylaska EJ, Panyala A, Bauman NP, Peng B, Pathak H, Mejia-Rodriguez D, Govind N, Williams-Young DB, Aprà E, Bagusetty A, Mutlu E, Jackson KA, Baruah T, Yamamoto Y, Pederson MR, Withanage KPK, Pedroza-Montero JN, Bilbrey JA, Choudhury S, Firoz J, Herman KM, Xantheas SS, Rigor P, Vila FD, Rehr JJ, Fung M, Grofe A, Johnston C, Baker N, Kaneko K, Liu H, Kowalski K. Electronic structure simulations in the cloud computing environment. J Chem Phys 2024; 161:150902. [PMID: 39431777 DOI: 10.1063/5.0226437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Accepted: 09/15/2024] [Indexed: 10/22/2024] Open
Abstract
The transformative impact of modern computational paradigms and technologies, such as high-performance computing (HPC), quantum computing, and cloud computing, has opened up profound new opportunities for scientific simulations. Scalable computational chemistry is one beneficiary of this technological progress. The main focus of this paper is on the performance of various quantum chemical formulations, ranging from low-order methods to high-accuracy approaches, implemented in different computational chemistry packages and libraries, such as NWChem, NWChemEx, Scalable Predictive Methods for Excitations and Correlated Phenomena, ExaChem, and Fermi-Löwdin orbital self-interaction correction on Azure Quantum Elements, Microsoft's cloud services platform for scientific discovery. We pay particular attention to the intricate workflows for performing complex chemistry simulations, associated data curation, and mechanisms for accuracy assessment, which is demonstrated with the Arrows automated workflow for high throughput simulations. Finally, we provide a perspective on the role of cloud computing in supporting the mission of leadership computational facilities.
Collapse
Affiliation(s)
- Eric J Bylaska
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Ajay Panyala
- Advanced Computing, Mathematics, and Data Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Nicholas P Bauman
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Bo Peng
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Himadri Pathak
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Daniel Mejia-Rodriguez
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Niranjan Govind
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - David B Williams-Young
- Applied Mathematics and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Edoardo Aprà
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99352, USA
| | - Abhishek Bagusetty
- Argonne Leadership Computing Facility, Argonne National Laboratory, 9700 South Cass Avenue, Building 240, Argonne, Illinois 60439, USA
| | - Erdal Mutlu
- Advanced Computing, Mathematics, and Data Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Koblar A Jackson
- Physics Department, Central Michigan University, Mt. Pleasant, Michigan 48859, USA
| | - Tunna Baruah
- Department of Physics, University of Texas at El Paso, El Paso, Texas 79968, USA
| | - Yoh Yamamoto
- Department of Physics, University of Texas at El Paso, El Paso, Texas 79968, USA
| | - Mark R Pederson
- Department of Physics, University of Texas at El Paso, El Paso, Texas 79968, USA
| | | | | | - Jenna A Bilbrey
- Artificial Intelligence and Data Analytics Division, Pacific Northwest National Laboratory, Richland, Washington 99352, USA
| | - Sutanay Choudhury
- Advanced Computing, Mathematics, and Data Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Jesun Firoz
- Advanced Computing, Mathematics, and Data Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Kristina M Herman
- Department of Chemistry, University of Washington, Seattle, Washington 98195, USA
| | - Sotiris S Xantheas
- Advanced Computing, Mathematics, and Data Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
- Department of Chemistry, University of Washington, Seattle, Washington 98195, USA
| | - Paul Rigor
- Center for Cloud Computing, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Fernando D Vila
- Department of Physics, University of Washington, Seattle, Washington 98195, USA
| | - John J Rehr
- Department of Physics, University of Washington, Seattle, Washington 98195, USA
| | - Mimi Fung
- Microsoft Azure Quantum, Redmond, Washington 98052, USA
| | - Adam Grofe
- Microsoft Azure Quantum, Redmond, Washington 98052, USA
| | | | - Nathan Baker
- Microsoft Azure Quantum, Redmond, Washington 98052, USA
| | - Ken Kaneko
- Microsoft Azure Quantum, Redmond, Washington 98052, USA
| | - Hongbin Liu
- Microsoft Azure Quantum, Redmond, Washington 98052, USA
| | - Karol Kowalski
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| |
Collapse
|
23
|
Ashbrook SE. Concluding remarks: Faraday Discussion on NMR crystallography. Faraday Discuss 2024. [PMID: 39420802 DOI: 10.1039/d4fd00155a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2024]
Abstract
This Faraday Discussion explored the field of NMR crystallography, and considered recent developments in experimental and theoretical approaches, new advances in machine learning and in the generation and handling of large amounts of data. Applications to a wide range of disordered, amorphous and dynamic systems demonstrated the range and quality of information available from this approach and the challenges that are faced in exploiting automation and developing best practice. In these closing remarks I will reflect on the discussions on the current state of the art, questions about what we want from these studies, how accurate we need results to be, how we best generate models for complex materials and what machine learning approaches can offer. These remarks close with thoughts about the future direction of the field, who will be carrying out this type of research, how they might be doing it and what their focus will be, along with likely possible challenges and opportunities.
Collapse
Affiliation(s)
- Sharon E Ashbrook
- School of Chemistry, EaStCHEM and Centre of Magnetic Resonance, University of St Andrews, North Haugh, St Andrews KY16 9ST, UK.
| |
Collapse
|
24
|
Schreder L, Luber S. Implementation of frozen density embedding in CP2K and OpenMolcas: CASSCF wavefunctions embedded in a Gaussian and plane wave DFT environment. J Chem Phys 2024; 161:144110. [PMID: 39387407 DOI: 10.1063/5.0222409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Accepted: 09/18/2024] [Indexed: 10/15/2024] Open
Abstract
Most chemical processes happen at a local scale where only a subset of molecular orbitals is directly involved and only a subset of covalent bonds may be rearranged. To model such reactions, Density Functional Theory (DFT) is often inadequate, and the use of computationally more expensive correlated wavefunction (WF) methods is required for accurate results. Mixed-resolution approaches backed by quantum embedding theory have been used extensively to approach this imbalance. Based on the frozen density embedding freeze-and-thaw algorithm, we describe an approach to embed complete active space self-consistent field simulations run in the OpenMolcas code in a DFT environment calculated in CP2K without requiring any external tools. This makes it possible to study a local, active part of a chemical system in a larger and relatively static environment with a computational cost balanced between the accuracy of a WF method and the efficiency of DFT, which we test on environment-subsystem pairs. Finally, we apply the implementation to an oxygen molecule leaving an aluminum (111) surface and a ruthenium(IV) oxide (110) surface.
Collapse
Affiliation(s)
- Lukas Schreder
- University of Zürich, Winterthurerstrasse 190, 8057 Zürich, Switzerland
| | - Sandra Luber
- University of Zürich, Winterthurerstrasse 190, 8057 Zürich, Switzerland
| |
Collapse
|
25
|
Cheng AH, Ser CT, Skreta M, Guzmán-Cordero A, Thiede L, Burger A, Aldossary A, Leong SX, Pablo-García S, Strieth-Kalthoff F, Aspuru-Guzik A. Spiers Memorial Lecture: How to do impactful research in artificial intelligence for chemistry and materials science. Faraday Discuss 2024. [PMID: 39400305 DOI: 10.1039/d4fd00153b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2024]
Abstract
Machine learning has been pervasively touching many fields of science. Chemistry and materials science are no exception. While machine learning has been making a great impact, it is still not reaching its full potential or maturity. In this perspective, we first outline current applications across a diversity of problems in chemistry. Then, we discuss how machine learning researchers view and approach problems in the field. Finally, we provide our considerations for maximizing impact when researching machine learning for chemistry.
Collapse
Affiliation(s)
- Austin H Cheng
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada.
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | - Cher Tian Ser
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada.
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | - Marta Skreta
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | - Andrés Guzmán-Cordero
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
- Tinbergen Institute, University of Amsterdam, Amsterdam, Netherlands
| | - Luca Thiede
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | - Andreas Burger
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | | | - Shi Xuan Leong
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada.
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, Singapore 63737, Singapore
| | | | | | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada.
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
- Acceleration Consortium, Toronto, Ontario M5G 1X6, Canada
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Canada
- Department of Materials Science and Engineering, University of Toronto, Canada
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), Canada
| |
Collapse
|
26
|
Kubečka J, Ayoubi D, Tang Z, Knattrup Y, Engsvang M, Wu H, Elm J. Accurate modeling of the potential energy surface of atmospheric molecular clusters boosted by neural networks. ENVIRONMENTAL SCIENCE. ADVANCES 2024; 3:1438-1451. [PMID: 39176037 PMCID: PMC11334116 DOI: 10.1039/d4va00255e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Accepted: 08/09/2024] [Indexed: 08/24/2024]
Abstract
The computational cost of accurate quantum chemistry (QC) calculations of large molecular systems can often be unbearably high. Machine learning offers a lower computational cost compared to QC methods while maintaining their accuracy. In this study, we employ the polarizable atom interaction neural network (PaiNN) architecture to train and model the potential energy surface of molecular clusters relevant to atmospheric new particle formation, such as sulfuric acid-ammonia clusters. We compare the differences between PaiNN and previous kernel ridge regression modeling for the Clusteromics I-V data sets. We showcase three models capable of predicting electronic binding energies and interatomic forces with mean absolute errors of <0.3 kcal mol-1 and <0.2 kcal mol-1 Å-1, respectively. Furthermore, we demonstrate that the error of the modeled properties remains below the chemical accuracy of 1 kcal mol-1 even for clusters vastly larger than those in the training database (up to (H2SO4)15(NH3)15 clusters, containing 30 molecules). Consequently, we emphasize the potential applications of these models for faster and more thorough configurational sampling and for boosting molecular dynamics studies of large atmospheric molecular clusters.
Collapse
Affiliation(s)
- Jakub Kubečka
- Department of Chemistry, Aarhus University Langelandsgade 140 8000 Aarhus C Denmark +420 724946622
| | - Daniel Ayoubi
- Department of Chemistry, Aarhus University Langelandsgade 140 8000 Aarhus C Denmark +420 724946622
| | - Zeyuan Tang
- Center for Interstellar Catalysis, Department of Physics and Astronomy, Aarhus University Ny Munkegade 120 8000 Aarhus C Denmark
| | - Yosef Knattrup
- Department of Chemistry, Aarhus University Langelandsgade 140 8000 Aarhus C Denmark +420 724946622
| | - Morten Engsvang
- Department of Chemistry, Aarhus University Langelandsgade 140 8000 Aarhus C Denmark +420 724946622
| | - Haide Wu
- Department of Chemistry, Aarhus University Langelandsgade 140 8000 Aarhus C Denmark +420 724946622
| | - Jonas Elm
- Department of Chemistry, Aarhus University Langelandsgade 140 8000 Aarhus C Denmark +420 724946622
| |
Collapse
|
27
|
Shahabuddin, Uzma, Azam M, Parveen M, Kadir NHA, Min K, Alam M. Exploring 7β-amino-6-nitrocholestens as COVID-19 antivirals: in silico, synthesis, evaluation, and integration of artificial intelligence (AI) in drug design: assessing the cytotoxicity and antioxidant activity of 3β-acetoxynitrocholestane. RSC Med Chem 2024:d4md00257a. [PMID: 39430952 PMCID: PMC11485945 DOI: 10.1039/d4md00257a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2024] [Accepted: 09/22/2024] [Indexed: 10/22/2024] Open
Abstract
In light of the ongoing pandemic caused by SARS-CoV-2, effective and clinically translatable treatments are desperately needed for COVID-19 and its emerging variants. In this study, some derivatives, including 7β-aminocholestene compounds, and 3β-acetoxy-6-nitrocholesta-4,6-diene were synthesized, in quantitative yields from 7β-bromo-6-nitrocholest-5-enes (1-3) with a small library of amines. The synthesized steroidal products were then thoroughly characterized using a range of physicochemical techniques, including IR, NMR, UV, MS, and elemental analysis. Next, a virtual screening based on structures using docking studies was conducted to investigate the potential of these synthesized compounds as therapeutic candidates against SARS-CoV-2. Specifically, we evaluated the compounds' binding energy of the reactants and their products with three SARS-CoV-2 functional proteins: the papain-like protease, 3C-like protease or main protease, and RNA-dependent RNA polymerase. Our results indicate that the 7β-aminocholestene derivatives (4-8) display intermediate to excellent binding energy, suggesting that they interact strongly with the receptor's active amino acids and may be promising drug candidates for inhibiting SARS-CoV-2. Although the starting steroid derivatives; 7β-bromo-6-nitrocholest-5-enes (1-3) and one steroid product; 3β-acetoxy-6-nitrocholesta-4,6-diene (9) exhibited strong binding energies with various SARS-CoV-2 receptors, they did not meet the Lipinski Rule and ADMET properties required for drug development. These compounds showed either mutagenic or reproductive/developmental toxicity when assessed using toxicity prediction software. The findings based on structure-based virtual screening, suggest that 7β-aminocholestaines (4-8) may be useful for reducing the susceptibility to SARS-CoV-2 infection. The docking pose of compound 4, which has a high score of -7.4 kcal mol-1, was subjected to AI-assisted deep learning to generate 60 AI-designed molecules for drug design. Molecular docking of these AI molecules was performed to select optimal candidates for further analysis and visualization. The cytotoxicity and antioxidant effects of 3β-acetoxy-6-nitrocholesta-4,6-diene were tested in vitro, showing marked cytotoxicity and antioxidant activity. To elucidate the molecular basis for these effects, steroidal compound 9 was subjected to molecular docking analysis to identify potential binding interactions. The stability of the top-ranked docking pose was subsequently assessed using molecular dynamics simulations.
Collapse
Affiliation(s)
- Shahabuddin
- Department of Applied Chemistry, Z. H. College of Engineering & Technology, Aligarh Muslim University Aligarh 202002 India
| | - Uzma
- Division of Organic Synthesis, Department of Chemistry, Aligarh Muslim University Aligarh 202002 India
| | - Mohammad Azam
- Department of Chemistry, College of Science, King Saud University PO 2455 Riyadh 11451 Saudi Arabia
| | - Mehtab Parveen
- Division of Organic Synthesis, Department of Chemistry, Aligarh Muslim University Aligarh 202002 India
| | - Nurul Huda Abd Kadir
- Faculty of Science and Environmental Marine, Universiti Malaysia Terengganu 21030 Terengganu Malaysia
| | - Kim Min
- Department of Safety Engineering, Dongguk University 123 Dongdae-ro Gyeongju-si Gyeongbuk 780714 South Korea
| | - Mahboob Alam
- Department of Safety Engineering, Dongguk University 123 Dongdae-ro Gyeongju-si Gyeongbuk 780714 South Korea
| |
Collapse
|
28
|
Middleton C, Curchod BFE, Penfold TJ. Partial density of states representation for accurate deep neural network predictions of X-ray spectra. Phys Chem Chem Phys 2024; 26:24477-24487. [PMID: 39264269 DOI: 10.1039/d4cp01368a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/13/2024]
Abstract
The performance of a machine learning (ML) algorithm for chemistry is highly contingent upon the architect's choice of input representation. This work introduces the partial density of states (p-DOS) descriptor: a novel, quantum-inspired structural representation which encodes relevant electronic information for machine learning models seeking to simulate X-ray spectroscopy. p-DOS uses a minimal basis set in conjunction with a guess (non-optimised) electronic configuration to extract and then discretise the density of states (DOS) of the absorbing atom to form the input vector. We demonstrate that while the electronically-focused p-DOS performs well in isolation, optimal performance is achieved when supplemented with nuclear structural information imparted via a geometric representation. p-DOS provides a description of the key electronic properties of a system which is not only concise and computationally efficient, but also independent of molecular size or choice of basis set. It can be rapidly generated, facilitating its application with large training sets. Its performance is demonstrated using a wide variety of examples at the sulphur K-edge, including the prediction of ultrafast X-ray spectroscopic signal associated with photoexcited 2(5H)-thiophenone. These results highlight the potential for ML models developed using p-DOS to contribute to the interpretation and prediction of experimental results e.g. in operando measurements of batteries and/or catalysts and femtosecond time-resolved studies, especially those made possible by emergent cutting-edge technologies, especially X-ray free electron lasers.
Collapse
Affiliation(s)
- Clelia Middleton
- Chemistry, School of Natural and Environmental Sciences, Newcastle University, Great North Road, Newcastle upon Tyne, NE1 7RU, UK.
| | - Basile F E Curchod
- Centre for Computational Chemistry, School of Chemistry, Cantock's Close, University of Bristol, Bristol, BS8 1TS, UK
| | - Thomas J Penfold
- Chemistry, School of Natural and Environmental Sciences, Newcastle University, Great North Road, Newcastle upon Tyne, NE1 7RU, UK.
| |
Collapse
|
29
|
Crusius D, Cipcigan F, Biggin PC. Are we fitting data or noise? Analysing the predictive power of commonly used datasets in drug-, materials-, and molecular-discovery. Faraday Discuss 2024. [PMID: 39308206 DOI: 10.1039/d4fd00091a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2024]
Abstract
Data-driven techniques for establishing quantitative structure property relations are a pillar of modern materials and molecular discovery. Fuelled by the recent progress in deep learning methodology and the abundance of new algorithms, it is tempting to chase benchmarks and incrementally build ever more capable machine learning (ML) models. While model evaluation has made significant progress, the intrinsic limitations arising from the underlying experimental data are often overlooked. In the chemical sciences data collection is costly, thus datasets are small and experimental errors can be significant. These limitations of such datasets affect their predictive power, a fact that is rarely considered in a quantitative way. In this study, we analyse commonly used ML datasets for regression and classification from drug discovery, molecular discovery, and materials discovery. We derived maximum and realistic performance bounds for nine such datasets by introducing noise based on estimated or actual experimental errors. We then compared the estimated performance bounds to the reported performance of leading ML models in the literature. Out of the nine datasets and corresponding ML models considered, four were identified to have reached or surpassed dataset performance limitations and thus, they may potentially be fitting noise. More generally, we systematically examine how data range, the magnitude of experimental error, and the number of data points influence dataset performance bounds. Alongside this paper, we release the Python package NoiseEstimator and provide a web-based application for computing realistic performance bounds. This study and the resulting tools will help practitioners in the field understand the limitations of datasets and set realistic expectations for ML model performance. This work stands as a reference point, offering analysis and tools to guide development of future ML models in the chemical sciences.
Collapse
Affiliation(s)
- Daniel Crusius
- Department of Biochemistry, University of Oxford, South Parks Road, Oxford OX1 3QU, UK.
| | - Flaviu Cipcigan
- IBM Research Europe, The Hartree Centre STFC Laboratory, Sci-Tech Daresbury, Warrington WA4 4AD, UK
| | - Philip C Biggin
- Department of Biochemistry, University of Oxford, South Parks Road, Oxford OX1 3QU, UK.
| |
Collapse
|
30
|
Askerova U, Abdullayev Y, Shikhaliyev N, Maharramov A, Nenajdenko VG, Autschbach J. Computational exploration of the copper(I)-catalyzed conversion of hydrazones to dihalogenated vinyldiazene derivatives. J Comput Chem 2024; 45:2098-2103. [PMID: 38760058 DOI: 10.1002/jcc.27433] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 04/29/2024] [Accepted: 05/03/2024] [Indexed: 05/19/2024]
Abstract
This computational study explores the copper (I) chloride catalyzed synthesis of (E)-1-(2,2-dichloro-1-phenylvinyl)-2-phenyldiazene (2Cl-VD) from readily available hydrazone derivative and carbon tetrachloride (CCl4). 2Cl-VD has been extensively utilized to synthesize variety of heterocyclic organic compounds in mild conditions. The present computational investigations primarily focus on understanding the role of copper (I) and N1,N1,N2,N2-tetramethylethane-1,2-diamine (TMEDA) in this reaction, TMEDA often being considered a proton scavenger by experimentalists. Considering TMEDA as a ligand significantly alters the energy barrier. In fact, it is only 8.3 kcal/mol higher compared to the ligand-free (LF) route for the removal of a chlorine atom to form the radical ·CCl3 but the following steps are almost barrierless. This intermediate then participates in attacking the electrophilic carbon in the hydrazone. Crucially, the study reveals that the overall potential energy surface is thermodynamically favorable, and the theoretical turnover frequency (TOF) value is higher in the case of Cu(I)-TMEDA complex catalyzed pathway.
Collapse
Affiliation(s)
- Ulviyya Askerova
- Organic Chemistry Department, Baku State University, Baku, Azerbaijan
| | - Yusif Abdullayev
- Department of Chemical Engineering, Baku Engineering University, Baku, Azerbaijan
- Institute of Petrochemical Processes, Azerbaijan National Academy of Sciences, Baku, Azerbaijan
- Department of Chemistry, Sumgait State University, Sumgait, Azerbaijan
| | - Namiq Shikhaliyev
- Organic Chemistry Department, Baku State University, Baku, Azerbaijan
| | - Abel Maharramov
- Organic Chemistry Department, Baku State University, Baku, Azerbaijan
| | | | - Jochen Autschbach
- Department of Chemistry, University at Buffalo, State University of New York, Buffalo, New York, USA
| |
Collapse
|
31
|
Masmali I, Nadeem MF, Mufti ZS, Ahmad A, Koam ANA, Ghazwani H. Data-driven approaches to study the spectral properties of chemical structures. Heliyon 2024; 10:e37459. [PMID: 39290266 PMCID: PMC11407057 DOI: 10.1016/j.heliyon.2024.e37459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 09/03/2024] [Accepted: 09/04/2024] [Indexed: 09/19/2024] Open
Abstract
The molecular energy, which is the sum of all eigenvalues, is crucial in determining the total π-electron energy of conjugated hydrocarbon molecules. We used machine learning techniques to calculate the energy, inertia, nullity, signature, and Estrada index of molecular graphs for bismuth tri-iodide and benzene rings embedded in P-type surfaces within 2D networks. We applied MATLAB to extract the actual eigenvalues from the data and developed general equations for these molecular properties. We then used these equations to estimate the values and compared them to the actual values through graphical analysis. Our results demonstrate the potential of data-driven techniques in predicting molecular properties and enhancing our understanding of spectral theory.
Collapse
Affiliation(s)
- Ibtisam Masmali
- Department of Mathematics, College of Science, Jazan University, Jazan, 45142, Saudi Arabia
| | - Muhammad Faisal Nadeem
- Department of Mathematics, COMSATS University Islamabad Lahore Campus, Lahore, 54000, Pakistan
| | - Zeeshan Saleem Mufti
- Department of Mathematics and Statistics, The University of Lahore, Lahore, 54000, Pakistan
| | - Ali Ahmad
- Department of Computer Science, College of Engineering and Computer Science, Jazan University, Jazan 45142, Kingdom of Saudi Arabia
| | - Ali N A Koam
- Department of Mathematics, College of Science, Jazan University, Jazan, 45142, Saudi Arabia
| | - Haleemah Ghazwani
- Department of Mathematics, College of Science, Jazan University, Jazan, 45142, Saudi Arabia
| |
Collapse
|
32
|
Schmid SP, Schlosser L, Glorius F, Jorner K. Catalysing (organo-)catalysis: Trends in the application of machine learning to enantioselective organocatalysis. Beilstein J Org Chem 2024; 20:2280-2304. [PMID: 39290209 PMCID: PMC11406055 DOI: 10.3762/bjoc.20.196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Accepted: 08/09/2024] [Indexed: 09/19/2024] Open
Abstract
Organocatalysis has established itself as a third pillar of homogeneous catalysis, besides transition metal catalysis and biocatalysis, as its use for enantioselective reactions has gathered significant interest over the last decades. Concurrent to this development, machine learning (ML) has been increasingly applied in the chemical domain to efficiently uncover hidden patterns in data and accelerate scientific discovery. While the uptake of ML in organocatalysis has been comparably slow, the last two decades have showed an increased interest from the community. This review gives an overview of the work in the field of ML in organocatalysis. The review starts by giving a short primer on ML for experimental chemists, before discussing its application for predicting the selectivity of organocatalytic transformations. Subsequently, we review ML employed for privileged catalysts, before focusing on its application for catalyst and reaction design. Concluding, we give our view on current challenges and future directions for this field, drawing inspiration from the application of ML to other scientific domains.
Collapse
Affiliation(s)
- Stefan P Schmid
- Institute of Chemical and Bioengineering, Department of Chemistry and Applied Biosciences, ETH Zurich, Zurich CH-8093, Switzerland
| | - Leon Schlosser
- Organisch-Chemisches Institut, Universität Münster, 48149 Münster, Germany
| | - Frank Glorius
- Organisch-Chemisches Institut, Universität Münster, 48149 Münster, Germany
| | - Kjell Jorner
- Institute of Chemical and Bioengineering, Department of Chemistry and Applied Biosciences, ETH Zurich, Zurich CH-8093, Switzerland
- National Centre of Competence in Research (NCCR) Catalysis, ETH Zurich, Zurich CH-8093, Switzerland
| |
Collapse
|
33
|
Zare M, Sahsah D, Saleheen M, Behler J, Heyden A. Hybrid Quantum Mechanical, Molecular Mechanical, and Machine Learning Potential for Computing Aqueous-Phase Adsorption Free Energies on Metal Surfaces. J Chem Theory Comput 2024. [PMID: 39254514 DOI: 10.1021/acs.jctc.4c00869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/11/2024]
Abstract
Performing reliable computer simulations of elementary processes occurring at metal-water interfaces is pivotal for novel catalyst design in sustainable energy applications. Computational catalyst design hinges on the ability to reliably and efficiently compute the potential energy surface (PES) of the system. Due to the large system sizes needed for studying processes at liquid water-metal interfaces, these systems can currently not be described using density functional theory (DFT). In this work, we used a hybrid quantum mechanical, molecular mechanical, and machine learning potential for studying the adsorption behavior of phenol, atomic hydrogen, 2-butanol, and 2-butanone on the (0001) facet of Ru under reducing conditions when Ru is not oxidized. Specifically, we describe the adsorbate and the surrounding metal atoms at the DFT level of theory. Here, we also considered the electrostatic field effect of the water molecules on adsorbate-metal interactions. Next, for the water-water and water-adsorbate interactions, we used established classical force fields. Finally, for the water-Ru surface interaction, for which no reliable force fields have been published, we used Behler-Parrinello high-dimensional neural network potentials (HDNNPs). Employing this setup, we used our explicit solvation for metal surface (eSMS) approach to compute the aqueous-phase effect on the low-coverage adsorption of selected molecules and atoms on the (0001) facet of Ru. In agreement with previous experimental and computational studies of oxygenated molecules over transition metal facets, we found that liquid water destabilizes the tested adsorbates on Ru(0001). Interestingly, our findings indicate that adsorbates on Ru are less affected by the presence of an aqueous phase than on other transition metals (e.g., Pt), highlighting the necessity of experimental investigations of Ru-based catalytic systems in liquid water.
Collapse
Affiliation(s)
- Mehdi Zare
- Department of Chemical Engineering, University of South Carolina, Columbia, South Carolina 29208, United States
| | - Dia Sahsah
- Department of Chemical Engineering, University of South Carolina, Columbia, South Carolina 29208, United States
| | - Mohammad Saleheen
- Department of Chemical Engineering, University of South Carolina, Columbia, South Carolina 29208, United States
| | - Jörg Behler
- Lehrstuhl für Theoretische Chemie II, Ruhr-Universität Bochum, Bochum 44780, Germany
- Research Center Chemical Sciences and Sustainability, Research Alliance Ruhr, Bochum 44780, Germany
| | - Andreas Heyden
- Department of Chemical Engineering, University of South Carolina, Columbia, South Carolina 29208, United States
| |
Collapse
|
34
|
Hou P, Tian Y, Meng X. Improving Molecular-Dynamics Simulations for Solid-Liquid Interfaces with Machine-Learning Interatomic Potentials. Chemistry 2024; 30:e202401373. [PMID: 38877181 DOI: 10.1002/chem.202401373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2024] [Revised: 06/13/2024] [Accepted: 06/14/2024] [Indexed: 06/16/2024]
Abstract
Emerging developments in artificial intelligence have opened infinite possibilities for material simulation. Depending on the powerful fitting of machine learning algorithms to first-principles data, machine learning interatomic potentials (MLIPs) can effectively balance the accuracy and efficiency problems in molecular dynamics (MD) simulations, serving as powerful tools in various complex physicochemical systems. Consequently, this brings unprecedented enthusiasm for researchers to apply such novel technology in multiple fields to revisit the major scientific problems that have remained controversial owing to the limitations of previous computational methods. Herein, we introduce the evolution of MLIPs, provide valuable application examples for solid-liquid interfaces, and present current challenges. Driven by solving multitudinous difficulties in terms of the accuracy, efficiency, and versatility of MLIPs, this booming technique, combined with molecular simulation methods, will provide an underlying and valuable understanding of interdisciplinary scientific challenges, including materials, physics, and chemistry.
Collapse
Affiliation(s)
- Pengfei Hou
- Key Laboratory of Physics and Technology for Advanced Batteries (Ministry of Education), College of Physics, Jilin University, Changchun, 130012, China
- Key Laboratory of Material Simulation Methods and Software of Ministry of Education, College of Physics, Jilin University, Changchun, 130012, China
| | - Yumiao Tian
- Key Laboratory of Physics and Technology for Advanced Batteries (Ministry of Education), College of Physics, Jilin University, Changchun, 130012, China
- Key Laboratory of Material Simulation Methods and Software of Ministry of Education, College of Physics, Jilin University, Changchun, 130012, China
| | - Xing Meng
- Key Laboratory of Physics and Technology for Advanced Batteries (Ministry of Education), College of Physics, Jilin University, Changchun, 130012, China
- Key Laboratory of Material Simulation Methods and Software of Ministry of Education, College of Physics, Jilin University, Changchun, 130012, China
| |
Collapse
|
35
|
Tom G, Schmid SP, Baird SG, Cao Y, Darvish K, Hao H, Lo S, Pablo-García S, Rajaonson EM, Skreta M, Yoshikawa N, Corapi S, Akkoc GD, Strieth-Kalthoff F, Seifrid M, Aspuru-Guzik A. Self-Driving Laboratories for Chemistry and Materials Science. Chem Rev 2024; 124:9633-9732. [PMID: 39137296 PMCID: PMC11363023 DOI: 10.1021/acs.chemrev.4c00055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/15/2024]
Abstract
Self-driving laboratories (SDLs) promise an accelerated application of the scientific method. Through the automation of experimental workflows, along with autonomous experimental planning, SDLs hold the potential to greatly accelerate research in chemistry and materials discovery. This review provides an in-depth analysis of the state-of-the-art in SDL technology, its applications across various scientific disciplines, and the potential implications for research and industry. This review additionally provides an overview of the enabling technologies for SDLs, including their hardware, software, and integration with laboratory infrastructure. Most importantly, this review explores the diverse range of scientific domains where SDLs have made significant contributions, from drug discovery and materials science to genomics and chemistry. We provide a comprehensive review of existing real-world examples of SDLs, their different levels of automation, and the challenges and limitations associated with each domain.
Collapse
Affiliation(s)
- Gary Tom
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
| | - Stefan P. Schmid
- Department
of Chemistry and Applied Biosciences, ETH
Zurich, Vladimir-Prelog-Weg 1, CH-8093 Zurich, Switzerland
| | - Sterling G. Baird
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
| | - Yang Cao
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
| | - Kourosh Darvish
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
| | - Han Hao
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
| | - Stanley Lo
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
| | - Sergio Pablo-García
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
| | - Ella M. Rajaonson
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
| | - Marta Skreta
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
| | - Naruki Yoshikawa
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
| | - Samantha Corapi
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
| | - Gun Deniz Akkoc
- Forschungszentrum
Jülich GmbH, Helmholtz Institute
for Renewable Energy Erlangen-Nürnberg, Cauerstr. 1, 91058 Erlangen, Germany
- Department
of Chemical and Biological Engineering, Friedrich-Alexander Universität Erlangen-Nürnberg, Egerlandstr. 3, 91058 Erlangen, Germany
| | - Felix Strieth-Kalthoff
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- School of
Mathematics and Natural Sciences, University
of Wuppertal, Gaußstraße
20, 42119 Wuppertal, Germany
| | - Martin Seifrid
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Department
of Materials Science and Engineering, North
Carolina State University, Raleigh, North Carolina 27695, United States of America
| | - Alán Aspuru-Guzik
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
- Department
of Chemical Engineering & Applied Chemistry, University of Toronto, Toronto, Ontario M5S 3E5, Canada
- Department
of Materials Science & Engineering, University of Toronto, Toronto, Ontario M5S 3E4, Canada
- Lebovic
Fellow, Canadian Institute for Advanced
Research (CIFAR), 661
University Ave, Toronto, Ontario M5G 1M1, Canada
| |
Collapse
|
36
|
Williams CD, Kalayan J, Burton NA, Bryce RA. Stable and accurate atomistic simulations of flexible molecules using conformationally generalisable machine learned potentials. Chem Sci 2024; 15:12780-12795. [PMID: 39148799 PMCID: PMC11323334 DOI: 10.1039/d4sc01109k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Accepted: 07/07/2024] [Indexed: 08/17/2024] Open
Abstract
Computational simulation methods based on machine learned potentials (MLPs) promise to revolutionise shape prediction of flexible molecules in solution, but their widespread adoption has been limited by the way in which training data is generated. Here, we present an approach which allows the key conformational degrees of freedom to be properly represented in reference molecular datasets. MLPs trained on these datasets using a global descriptor scheme are generalisable in conformational space, providing quantum chemical accuracy for all conformers. These MLPs are capable of propagating long, stable molecular dynamics trajectories, an attribute that has remained a challenge. We deploy the MLPs in obtaining converged conformational free energy surfaces for flexible molecules via well-tempered metadynamics simulations; this approach provides a hitherto inaccessible route to accurately computing the structural, dynamical and thermodynamical properties of a wide variety of flexible molecular systems. It is further demonstrated that MLPs must be trained on reference datasets with complete coverage of conformational space, including in barrier regions, to achieve stable molecular dynamics trajectories.
Collapse
Affiliation(s)
- Christopher D Williams
- Division of Pharmacy and Optometry, School of Health Sciences, Faculty of Biology, Medicine and Health, The University of Manchester Oxford Road Manchester M13 9PL UK
| | - Jas Kalayan
- Science and Technologies Facilities Council (STFC), Daresbury Laboratory Keckwick Lane, Daresbury Warrington WA4 4AD UK
| | - Neil A Burton
- Department of Chemistry, School of Natural Sciences, Faculty of Science and Engineering, The University of Manchester Oxford Road Manchester M13 9PL UK
| | - Richard A Bryce
- Division of Pharmacy and Optometry, School of Health Sciences, Faculty of Biology, Medicine and Health, The University of Manchester Oxford Road Manchester M13 9PL UK
| |
Collapse
|
37
|
Sarangi R, Maity S, Acharya A. Machine Learning Approach to Vertical Energy Gap in Redox Processes. J Chem Theory Comput 2024; 20:6747-6755. [PMID: 39044422 PMCID: PMC11325558 DOI: 10.1021/acs.jctc.4c00715] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/25/2024]
Abstract
A straightforward approach to calculating the free energy change (ΔG) and reorganization energy of a redox process is linear response approximation (LRA). However, accurate prediction of redox properties is still challenging due to difficulties in conformational sampling and vertical energy-gap sampling. Expensive hybrid quantum mechanical/molecular mechanical (QM/MM) calculations are typically employed in sampling energy gaps using conformations from simulations. To alleviate the computational cost associated with the expensive QM method in the QM/MM calculation, we propose machine learning (ML) methods to predict the vertical energy gaps (VEGs). We tested several ML models to predict the VEGs and observed that simple models like linear regression show excellent performance (mean absolute error ∼0.1 eV) in predicting VEGs in all test systems, even when using features extracted from cheaper semiempirical methods. Our best ML model (extra trees regressor) shows a mean absolute error of around 0.1 eV while using features from the cheapest QM method. We anticipate our approach can be generalized to larger macromolecular systems with more complex redox centers.
Collapse
Affiliation(s)
- Ronit Sarangi
- Department of Chemistry, Syracuse University, Syracuse, New York 13244, United States
| | - Suman Maity
- Department of Chemistry, Syracuse University, Syracuse, New York 13244, United States
| | - Atanu Acharya
- Department of Chemistry, Syracuse University, Syracuse, New York 13244, United States
- BioInspired Syracuse, Syracuse University, Syracuse, New York 13244, United States
| |
Collapse
|
38
|
Cao Y, Balduf T, Beachy MD, Bennett MC, Bochevarov AD, Chien A, Dub PA, Dyall KG, Furness JW, Halls MD, Hughes TF, Jacobson LD, Kwak HS, Levine DS, Mainz DT, Moore KB, Svensson M, Videla PE, Watson MA, Friesner RA. Quantum chemical package Jaguar: A survey of recent developments and unique features. J Chem Phys 2024; 161:052502. [PMID: 39092934 DOI: 10.1063/5.0213317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Accepted: 07/12/2024] [Indexed: 08/04/2024] Open
Abstract
This paper is dedicated to the quantum chemical package Jaguar, which is commercial software developed and distributed by Schrödinger, Inc. We discuss Jaguar's scientific features that are relevant to chemical research as well as describe those aspects of the program that are pertinent to the user interface, the organization of the computer code, and its maintenance and testing. Among the scientific topics that feature prominently in this paper are the quantum chemical methods grounded in the pseudospectral approach. A number of multistep workflows dependent on Jaguar are covered: prediction of protonation equilibria in aqueous solutions (particularly calculations of tautomeric stability and pKa), reactivity predictions based on automated transition state search, assembly of Boltzmann-averaged spectra such as vibrational and electronic circular dichroism, as well as nuclear magnetic resonance. Discussed also are quantum chemical calculations that are oriented toward materials science applications, in particular, prediction of properties of optoelectronic materials and organic semiconductors, and molecular catalyst design. The topic of treatment of conformations inevitably comes up in real world research projects and is considered as part of all the workflows mentioned above. In addition, we examine the role of machine learning methods in quantum chemical calculations performed by Jaguar, from auxiliary functions that return the approximate calculation runtime in a user interface, to prediction of actual molecular properties. The current work is second in a series of reviews of Jaguar, the first having been published more than ten years ago. Thus, this paper serves as a rare milestone on the path that is being traversed by Jaguar's development in more than thirty years of its existence.
Collapse
Affiliation(s)
- Yixiang Cao
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Ty Balduf
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Michael D Beachy
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - M Chandler Bennett
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Art D Bochevarov
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Alan Chien
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Pavel A Dub
- Schrödinger, Inc., 9868 Scranton Road, Suite 3200, San Diego, California 92121, USA
| | - Kenneth G Dyall
- Schrödinger, Inc., 101 SW Main St., Suite 1300, Portland, Oregon 97204, USA
| | - James W Furness
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Mathew D Halls
- Schrödinger, Inc., 9868 Scranton Road, Suite 3200, San Diego, California 92121, USA
| | - Thomas F Hughes
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Leif D Jacobson
- Schrödinger, Inc., 101 SW Main St., Suite 1300, Portland, Oregon 97204, USA
| | - H Shaun Kwak
- Schrödinger, Inc., 101 SW Main St., Suite 1300, Portland, Oregon 97204, USA
| | - Daniel S Levine
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Daniel T Mainz
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Kevin B Moore
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Mats Svensson
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Pablo E Videla
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Mark A Watson
- Schrödinger, Inc., 1540 Broadway, Floor 24, New York, New York 10036, USA
| | - Richard A Friesner
- Department of Chemistry, Columbia University, 3000 Broadway, New York, New York 10027, USA
| |
Collapse
|
39
|
Cho Y, Laplaza R, Vela S, Corminboeuf C. Automated prediction of ground state spin for transition metal complexes. DIGITAL DISCOVERY 2024; 3:1638-1647. [PMID: 39118977 PMCID: PMC11305380 DOI: 10.1039/d4dd00093e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Accepted: 07/10/2024] [Indexed: 08/10/2024]
Abstract
Exploiting crystallographic data repositories for large-scale quantum chemical computations requires the rapid and accurate extraction of the molecular structure, charge and spin from the crystallographic information file. Here, we develop a general approach to assign the ground state spin of transition metal complexes, in complement to our previous efforts on determining metal oxidation states and bond order within the cell2mol software. Starting from a database of 31k transition metal complexes extracted from the Cambridge Structural Database with cell2mol, we construct the TM-GSspin dataset, which contains 2063 mononuclear first row transition metal complexes and their computed ground state spins. TM-GSspin is highly diverse in terms of metals, metal oxidation states, coordination geometries, and coordination sphere compositions. Based on TM-GSspin, we identify correlations between structural and electronic features of the complexes and their ground state spins to develop a rule-based spin state assignment model. Leveraging this knowledge, we construct interpretable descriptors and build a statistical model achieving 98% cross-validated accuracy in predicting the ground state spin across the board. Our approach provides a practical way to determine the ground state spin of transition metal complexes directly from crystal structures without additional computations, thus enabling the automated use of crystallographic data for large-scale computations involving transition metal complexes.
Collapse
Affiliation(s)
- Yuri Cho
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne Lausanne Switzerland
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne Lausanne Switzerland
| | - Ruben Laplaza
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne Lausanne Switzerland
- National Centre for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale de Lausanne Lausanne Switzerland
| | - Sergi Vela
- Departament de Ciència de Materials i Química Física and IQTCUB, Universitat de Barcelona Barcelona Spain
- Institut de Química Avançada de Catalunya (IQAC-CSIC) Barcelona Spain
| | - Clémence Corminboeuf
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne Lausanne Switzerland
- National Centre for Computational Design and Discovery of Novel Materials (MARVEL), École Polytechnique Fédérale de Lausanne Lausanne Switzerland
- National Centre for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale de Lausanne Lausanne Switzerland
| |
Collapse
|
40
|
Frank JT, Unke OT, Müller KR, Chmiela S. A Euclidean transformer for fast and stable machine learned force fields. Nat Commun 2024; 15:6539. [PMID: 39107296 PMCID: PMC11303804 DOI: 10.1038/s41467-024-50620-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 07/10/2024] [Indexed: 08/10/2024] Open
Abstract
Recent years have seen vast progress in the development of machine learned force fields (MLFFs) based on ab-initio reference calculations. Despite achieving low test errors, the reliability of MLFFs in molecular dynamics (MD) simulations is facing growing scrutiny due to concerns about instability over extended simulation timescales. Our findings suggest a potential connection between robustness to cumulative inaccuracies and the use of equivariant representations in MLFFs, but the computational cost associated with these representations can limit this advantage in practice. To address this, we propose a transformer architecture called SO3KRATES that combines sparse equivariant representations (Euclidean variables) with a self-attention mechanism that separates invariant and equivariant information, eliminating the need for expensive tensor products. SO3KRATES achieves a unique combination of accuracy, stability, and speed that enables insightful analysis of quantum properties of matter on extended time and system size scales. To showcase this capability, we generate stable MD trajectories for flexible peptides and supra-molecular structures with hundreds of atoms. Furthermore, we investigate the PES topology for medium-sized chainlike molecules (e.g., small peptides) by exploring thousands of minima. Remarkably, SO3KRATES demonstrates the ability to strike a balance between the conflicting demands of stability and the emergence of new minimum-energy conformations beyond the training data, which is crucial for realistic exploration tasks in the field of biochemistry.
Collapse
Affiliation(s)
- J Thorben Frank
- Machine Learning Group, TU Berlin, Berlin, Germany
- BIFOLD, Berlin Institute for the Foundations of Learning and Data, Berlin, Germany
| | | | - Klaus-Robert Müller
- Machine Learning Group, TU Berlin, Berlin, Germany.
- BIFOLD, Berlin Institute for the Foundations of Learning and Data, Berlin, Germany.
- Google DeepMind, Berlin, Germany.
- Department of Artificial Intelligence, Korea University, Seoul, Korea.
- Max Planck Institut für Informatik, Saarbrücken, Germany.
| | - Stefan Chmiela
- Machine Learning Group, TU Berlin, Berlin, Germany.
- BIFOLD, Berlin Institute for the Foundations of Learning and Data, Berlin, Germany.
| |
Collapse
|
41
|
Bianchi P, Monbaliu JCM. New Opportunities for Organic Synthesis with Superheated Flow Chemistry. Acc Chem Res 2024; 57:2207-2218. [PMID: 39043368 PMCID: PMC11308364 DOI: 10.1021/acs.accounts.4c00340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2024] [Revised: 07/08/2024] [Accepted: 07/09/2024] [Indexed: 07/25/2024]
Abstract
ConspectusFlow chemistry has brought a fresh breeze with great promises for chemical manufacturing, yet critical deterrents persist. To remain economically viable at production scales, flow processes demand quick reactions, which are actually not that common. Superheated flow technology stands out as a promising alternative poised to confront modern chemistry challenges. While continuous micro- and mesofluidic reactors offer uniform heating and rapid cooling across different scales, operating above solvent boiling points (i.e., operating under superheated conditions) significantly enhances reaction rates. Despite the energy costs associated with high temperatures, superheated flow chemistry aligns with sustainability goals by improving productivity (process intensification), offering solvent flexibility, and enhancing safety.However, navigating the unconventional chemical space of superheated flow chemistry can be cumbersome, particularly for neophytes. Expanding the temperature/pressure process window beyond the conventional boiling point under the atmospheric pressure limit vastly increases the optimization space. When associated with conventional trial-and-error approaches, this can become exceedingly wasteful, resource-intensive, and discouraging. Over the years, flow chemists have developed various tools to mitigate these challenges, with an increased reliance on statistical models, artificial intelligence, and experimental (kinetics, preliminary test reactions under microwave irradiation) or theoretical (quantum mechanics) a priori knowledge. Yet, the rationale for using superheated conditions has been slow to emerge, despite the growing emphasis on predictive methodologies.To fill this gap, this Account provides a concise yet comprehensive overview of superheated flow chemistry. Key concepts are illustrated with examples from our laboratory's research, as well as other relevant examples from the literature. These examples have been thoroughly studied to answer the main questions Why? At what cost? How? For what? The answers we provide will encourage educated and widespread adoption. The discussion begins with a demonstration of the various advantages arising from superheated flow chemistry. Different reactor alternatives suitable for high temperatures and pressures are then presented. Next, a clear workflow toward strategic adoption of superheated conditions is resorted either using Design of Experiments (DoE), microwave test chemistry, kinetics data, or Quantum Mechanics (QM). We provide rationalization for chemistries that are well suited for superheated conditions (e.g., additions to carbonyl functions, aromatic substitutions, as well as C-Y [Y = N, O, S, C, Br, Cl] heterolytic cleavages). Lastly, we bring the reader to a rational decision analysis toward superheated flow conditions. We believe this Account will become a reference guide for exploring extended chemical spaces, accelerating organic synthesis, and advancing molecular sciences.
Collapse
Affiliation(s)
- Pauline Bianchi
- Center
for Integrated Technology and Organic Synthesis, MolSys Research Unit, University of Liège, Allée du Six Août 13, 4000 Liège (Sart
Tilman), Belgium
| | - Jean-Christophe M. Monbaliu
- Center
for Integrated Technology and Organic Synthesis, MolSys Research Unit, University of Liège, Allée du Six Août 13, 4000 Liège (Sart
Tilman), Belgium
- WEL
Research Institute, Avenue
Pasteur 6, 1300 Wavre, Belgium
| |
Collapse
|
42
|
Voss J. Machine learning for accuracy in density functional approximations. J Comput Chem 2024; 45:1829-1845. [PMID: 38668453 DOI: 10.1002/jcc.27366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 02/16/2024] [Accepted: 03/25/2024] [Indexed: 07/21/2024]
Abstract
Machine learning techniques have found their way into computational chemistry as indispensable tools to accelerate atomistic simulations and materials design. In addition, machine learning approaches hold the potential to boost the predictive power of computationally efficient electronic structure methods, such as density functional theory, to chemical accuracy and to correct for fundamental errors in density functional approaches. Here, recent progress in applying machine learning to improve the accuracy of density functional and related approximations is reviewed. Promises and challenges in devising machine learning models transferable between different chemistries and materials classes are discussed with the help of examples applying promising models to systems far outside their training sets.
Collapse
Affiliation(s)
- Johannes Voss
- SUNCAT Center for Interface Science and Catalysis, SLAC National Accelerator Laboratory, Menlo Park, California, USA
| |
Collapse
|
43
|
Chaudhry I, Hu G, Ye H, Jensen L. Toward Modeling the Complexity of the Chemical Mechanism in SERS. ACS NANO 2024. [PMID: 39087679 DOI: 10.1021/acsnano.4c07198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/02/2024]
Abstract
Surface-enhanced Raman scattering (SERS) provides detailed information about the binding of molecules at interfaces and their interactions with the local environment due to the large enhancement of Raman scattering. This enhancement arises from a combination of the electromagnetic mechanism (EM) and chemical mechanism (CM). While it is commonly accepted that EM gives rise to most of the enhancement, large spectral changes originate from CM. To elucidate the rich information contained in SERS spectra about molecules at interfaces, a comprehensive understanding of the enhancement mechanisms is necessary. In this Perspective, we discuss the current understanding of the enhancement mechanisms and highlight their interplay in complex local environments. We will also discuss emerging areas where the development of computational and theoretical models is needed with specific attention given to how the CM contributes to the spectral changes. Future efforts in modeling should focus on overcoming the challenges presented in this review in order to capture the complexity of CM in SERS.
Collapse
Affiliation(s)
- Imran Chaudhry
- Department of Chemistry, The Pennsylvania State University, 104 Benkovic Building, University Park, Pennsylvania 16802, United States
| | - Gaohe Hu
- Department of Chemistry, The Pennsylvania State University, 104 Benkovic Building, University Park, Pennsylvania 16802, United States
| | - Hepeng Ye
- Department of Chemistry, The Pennsylvania State University, 104 Benkovic Building, University Park, Pennsylvania 16802, United States
| | - Lasse Jensen
- Department of Chemistry, The Pennsylvania State University, 104 Benkovic Building, University Park, Pennsylvania 16802, United States
| |
Collapse
|
44
|
Kwapiszewska K. Physicochemical Perspective of Biological Heterogeneity. ACS PHYSICAL CHEMISTRY AU 2024; 4:314-321. [PMID: 39069985 PMCID: PMC11274282 DOI: 10.1021/acsphyschemau.3c00079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 03/25/2024] [Accepted: 03/25/2024] [Indexed: 07/30/2024]
Abstract
The vast majority of chemical processes that govern our lives occur within living cells. At the core of every life process, such as gene expression or metabolism, are chemical reactions that follow the fundamental laws of chemical kinetics and thermodynamics. Understanding these reactions and the factors that govern them is particularly important for the life sciences. The physicochemical environment inside cells, which can vary between cells and organisms, significantly impacts various biochemical reactions and increases the extent of population heterogeneity. This paper discusses using physical chemistry approaches for biological studies, including methods for studying reactions inside cells and monitoring their conditions. The potential for development in this field and possible new research areas are highlighted. By applying physical chemistry methodology to biochemistry in vivo, we may gain new insights into biology, potentially leading to new ways of controlling biochemical reactions.
Collapse
Affiliation(s)
- Karina Kwapiszewska
- Institute of Physical Chemistry, Polish
Academy of Sciences, Kasprzaka 44/52, Warsaw 01-224, Poland
| |
Collapse
|
45
|
He X, Lu T, Rong C, Liu W, Ayers PW, Liu S. Energetic Information from Information-Theoretic Approach in Density Functional Theory as Quantitative Measures of Physicochemical Properties. J Chem Theory Comput 2024; 20:6049-6061. [PMID: 38995176 DOI: 10.1021/acs.jctc.4c00697] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/13/2024]
Abstract
The Hohenberg-Kohn theorem of density functional theory (DFT) stipulates that energy is a universal functional of electron density in the ground state, so energy can be thought of having encoded essential information for the density. Based on this, we recently proposed to quantify energetic information within the framework of information-theoretic approach (ITA) of DFT (J. Chem. Phys. 2022, 157, 101103). In this study, we systematically apply energetic information to a variety of chemical phenomena to validate the use of energetic information as quantitative measures of physicochemical properties. To that end, we employed six ITA quantities such as Shannon entropy and Fisher information for five energetic densities, yielding twenty-six viable energetic information quantities. Then, they are applied to correlate with physicochemical properties of molecular systems, including chemical bonding, conformational stability, intermolecular interactions, acidity, aromaticity, cooperativity, electrophilicity, nucleophilicity, and reactivity. Our results show that different quantities of energetic information often behave differently for different properties but a few of them, such as Shannon entropy of the total kinetic energy density and information gain of the Pauli energy density, stand out and strongly correlate with several properties across different categories of molecular systems. These results suggest that they can be employed as quantitative measures of physicochemical properties. This work not only enriches the body of our knowledge about the relationship between energy and information, but also provides scores of newly introduced explicit density functionals to quantify physicochemical properties, which can serve as robust features for building machine learning models in future studies.
Collapse
Affiliation(s)
- Xin He
- Institute of Frontier Chemistry, School of Chemistry and Chemical Engineering, Shandong University, Qingdao, Shandong 266237, China
| | - Tian Lu
- Beijing Kein Research Center for Natural Sciences, Beijing 100022, China
| | - Chunying Rong
- College of Chemistry and Chemical Engineering, Hunan Normal University, Changsha, Hunan 410081, China
| | - Wenjian Liu
- Institute of Frontier Chemistry, School of Chemistry and Chemical Engineering, Shandong University, Qingdao, Shandong 266237, China
| | - Paul W Ayers
- Department of Chemistry and Chemical Biology, McMaster University, Hamilton, Ontario ONL8S, Canada
| | - Shubin Liu
- Research Computing Center, University of North Carolina, Chapel Hill, North Carolina 27599-3420, United States
- Department of Chemistry, University of North Carolina, Chapel Hill, North Carolina 27599-3290, United States
| |
Collapse
|
46
|
Wasilewski T, Kamysz W, Gębicki J. AI-Assisted Detection of Biomarkers by Sensors and Biosensors for Early Diagnosis and Monitoring. BIOSENSORS 2024; 14:356. [PMID: 39056632 PMCID: PMC11274923 DOI: 10.3390/bios14070356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Revised: 06/25/2024] [Accepted: 06/28/2024] [Indexed: 07/28/2024]
Abstract
The steady progress in consumer electronics, together with improvement in microflow techniques, nanotechnology, and data processing, has led to implementation of cost-effective, user-friendly portable devices, which play the role of not only gadgets but also diagnostic tools. Moreover, numerous smart devices monitor patients' health, and some of them are applied in point-of-care (PoC) tests as a reliable source of evaluation of a patient's condition. Current diagnostic practices are still based on laboratory tests, preceded by the collection of biological samples, which are then tested in clinical conditions by trained personnel with specialistic equipment. In practice, collecting passive/active physiological and behavioral data from patients in real time and feeding them to artificial intelligence (AI) models can significantly improve the decision process regarding diagnosis and treatment procedures via the omission of conventional sampling and diagnostic procedures while also excluding the role of pathologists. A combination of conventional and novel methods of digital and traditional biomarker detection with portable, autonomous, and miniaturized devices can revolutionize medical diagnostics in the coming years. This article focuses on a comparison of traditional clinical practices with modern diagnostic techniques based on AI and machine learning (ML). The presented technologies will bypass laboratories and start being commercialized, which should lead to improvement or substitution of current diagnostic tools. Their application in PoC settings or as a consumer technology accessible to every patient appears to be a real possibility. Research in this field is expected to intensify in the coming years. Technological advancements in sensors and biosensors are anticipated to enable the continuous real-time analysis of various omics fields, fostering early disease detection and intervention strategies. The integration of AI with digital health platforms would enable predictive analysis and personalized healthcare, emphasizing the importance of interdisciplinary collaboration in related scientific fields.
Collapse
Affiliation(s)
- Tomasz Wasilewski
- Department of Inorganic Chemistry, Faculty of Pharmacy, Medical University of Gdansk, Hallera 107, 80-416 Gdansk, Poland
| | - Wojciech Kamysz
- Department of Inorganic Chemistry, Faculty of Pharmacy, Medical University of Gdansk, Hallera 107, 80-416 Gdansk, Poland
| | - Jacek Gębicki
- Department of Process Engineering and Chemical Technology, Faculty of Chemistry, Gdansk University of Technology, Narutowicza 11/12, 80-233 Gdansk, Poland;
| |
Collapse
|
47
|
Stishenko P, McSloy A, Onat B, Hourahine B, Maurer RJ, Kermode JR, Logsdail A. Integrated workflows and interfaces for data-driven semi-empirical electronic structure calculations. J Chem Phys 2024; 161:012502. [PMID: 38958157 DOI: 10.1063/5.0209742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Accepted: 06/07/2024] [Indexed: 07/04/2024] Open
Abstract
Modern software engineering of electronic structure codes has seen a paradigm shift from monolithic workflows toward object-based modularity. Software objectivity allows for greater flexibility in the application of electronic structure calculations, with particular benefits when integrated with approaches for data-driven analysis. Here, we discuss different approaches to create deep modular interfaces that connect big-data workflows and electronic structure codes and explore the diversity of use cases that they can enable. We present two such interface approaches for the semi-empirical electronic structure package, DFTB+. In one case, DFTB+ is applied as a library and provides data to an external workflow; in another, DFTB+receives data via external bindings and processes the information subsequently within an internal workflow. We provide a general framework to enable data exchange workflows for embedding new machine-learning-based Hamiltonians within DFTB+ or enabling deep integration of DFTB+ in multiscale embedding workflows. These modular interfaces demonstrate opportunities in emergent software and workflows to accelerate scientific discovery by harnessing existing software capabilities.
Collapse
Affiliation(s)
- Pavel Stishenko
- Cardiff Catalysis Institute, School of Chemistry, Cardiff University, Park Place, Cardiff CF10 3AT, United Kingdom
| | - Adam McSloy
- Warwick Centre for Predictive Modelling, School of Engineering, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Berk Onat
- Warwick Centre for Predictive Modelling, School of Engineering, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Ben Hourahine
- SUPA, Department of Physics, John Anderson Building, University of Strathclyde, 107 Rottenrow, Glasgow G4 0NG, United Kingdom
| | - Reinhard J Maurer
- Department of Chemistry, University of Warwick, Coventry CV4 7AL, United Kingdom and Department of Physics, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - James R Kermode
- Warwick Centre for Predictive Modelling, School of Engineering, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Andrew Logsdail
- Cardiff Catalysis Institute, School of Chemistry, Cardiff University, Park Place, Cardiff CF10 3AT, United Kingdom
| |
Collapse
|
48
|
Yang L, Guo Q, Zhang L. AI-assisted chemistry research: a comprehensive analysis of evolutionary paths and hotspots through knowledge graphs. Chem Commun (Camb) 2024; 60:6977-6987. [PMID: 38910536 DOI: 10.1039/d4cc01892c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/25/2024]
Abstract
Artificial intelligence (AI) offers transformative potential for chemical research through its ability to optimize reactions and processes, enhance energy efficiency, and reduce waste. AI-assisted chemical research (AI + chem) has become a global hotspot. To better understand the current research status of "AI + chem", this study conducted a scientific bibliometric investigation using CiteSpace. The web of science core collection was utilized to retrieve original articles related to "AI + chem" published from 2000 to 2024. The obtained data allowed for the visualization of the knowledge background, current research status, and latest knowledge structure of "AI + chem". The "AI + chem" has entered a stage of explosive growth, and the number of papers will maintain long-term high-speed growth. This article systematically analyzes the latest progress in "AI + chem" and objectively predicts future trends, including molecular design, reaction prediction, materials design, drug design, and quantum chemistry. The outcomes of this study will provide readers with a comprehensive understanding of the overall landscape of "AI + chem".
Collapse
Affiliation(s)
- Lin Yang
- School of Intellectual Property, Dalian University of Technology, Dalian 116024, Liaoning, P. R. China
| | - Qingle Guo
- School of Intellectual Property, Dalian University of Technology, Dalian 116024, Liaoning, P. R. China
| | - Lijing Zhang
- School of Chemistry, Dalian University of Technology, Dalian 116024, Liaoning, P. R. China.
| |
Collapse
|
49
|
Lalith N, Singh AR, Gauthier JA. The Importance of Reaction Energy in Predicting Chemical Reaction Barriers with Machine Learning Models. Chemphyschem 2024; 25:e202300933. [PMID: 38517585 DOI: 10.1002/cphc.202300933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 03/21/2024] [Accepted: 03/22/2024] [Indexed: 03/24/2024]
Abstract
Improving our fundamental understanding of complex heterocatalytic processes increasingly relies on electronic structure simulations and microkinetic models based on calculated energy differences. In particular, calculation of activation barriers, usually achieved through compute-intensive saddle point search routines, remains a serious bottleneck in understanding trends in catalytic activity for highly branched reaction networks. Although the well-known Brønsted-Evans-Polyani (BEP) scaling - a one-feature linear regression model - has been widely applied in such microkinetic models, they still rely on calculated reaction energies and may not generalize beyond a single facet on a single class of materials, e. g., a terrace sites on transition metals. For highly branched and energetically shallow reaction networks, such as electrochemical CO2 reduction or wastewater remediation, calculating even reaction energies on many surfaces can become computationally intractable due to the combinatorial explosion of states that must be considered. Here, we investigate the feasibility of activation barrier prediction without knowledge of the reaction energy using linear and nonlinear machine learning (ML) models trained on a new database of over 500 dehydrogenation activation barriers. We also find that inclusion of the reaction energy significantly improves both classes of ML models, but complex nonlinear models can achieve performance similar to the simplest BEP scaling when predicting activation barriers on new systems. Additionally, inclusion of the reaction energy significantly improves generalizability to new systems beyond the training set. Our results suggest that the reaction energy is a critical feature to consider when building models to predict activation barriers, indicating that efforts to reliably predict reaction energies through, e. g., the Open Catalyst Project and others, will be an important route to effective model development for more complex systems.
Collapse
Affiliation(s)
- Nithin Lalith
- Department of Chemical Engineering, Texas Tech University, Lubbock, TX 79409, USA
| | | | - Joseph A Gauthier
- Department of Chemical Engineering, Texas Tech University, Lubbock, TX 79409, USA
| |
Collapse
|
50
|
Aldossary A, Campos-Gonzalez-Angulo JA, Pablo-García S, Leong SX, Rajaonson EM, Thiede L, Tom G, Wang A, Avagliano D, Aspuru-Guzik A. In Silico Chemical Experiments in the Age of AI: From Quantum Chemistry to Machine Learning and Back. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2402369. [PMID: 38794859 DOI: 10.1002/adma.202402369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Revised: 04/28/2024] [Indexed: 05/26/2024]
Abstract
Computational chemistry is an indispensable tool for understanding molecules and predicting chemical properties. However, traditional computational methods face significant challenges due to the difficulty of solving the Schrödinger equations and the increasing computational cost with the size of the molecular system. In response, there has been a surge of interest in leveraging artificial intelligence (AI) and machine learning (ML) techniques to in silico experiments. Integrating AI and ML into computational chemistry increases the scalability and speed of the exploration of chemical space. However, challenges remain, particularly regarding the reproducibility and transferability of ML models. This review highlights the evolution of ML in learning from, complementing, or replacing traditional computational chemistry for energy and property predictions. Starting from models trained entirely on numerical data, a journey set forth toward the ideal model incorporating or learning the physical laws of quantum mechanics. This paper also reviews existing computational methods and ML models and their intertwining, outlines a roadmap for future research, and identifies areas for improvement and innovation. Ultimately, the goal is to develop AI architectures capable of predicting accurate and transferable solutions to the Schrödinger equation, thereby revolutionizing in silico experiments within chemistry and materials science.
Collapse
Affiliation(s)
- Abdulrahman Aldossary
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | | | - Sergio Pablo-García
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
| | - Shi Xuan Leong
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Ella Miray Rajaonson
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Luca Thiede
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Gary Tom
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Andrew Wang
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Davide Avagliano
- Chimie ParisTech, PSL University, CNRS, Institute of Chemistry for Life and Health Sciences (iCLeHS UMR 8060), Paris, F-75005, France
| | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
- Department of Materials Science & Engineering, University of Toronto, 184 College St., Toronto, ON, M5S 3E4, Canada
- Department of Chemical Engineering & Applied Chemistry, University of Toronto, 200 College St., Toronto, ON, M5S 3E5, Canada
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), 66118 University Ave., Toronto, M5G 1M1, Canada
- Acceleration Consortium, 80 St George St, Toronto, M5S 3H6, Canada
| |
Collapse
|