1
|
Chakraborty R, de Moraes MMF, Boguslawski K, Nowak A, Świerczyński J, Tecmer P. Toward Reliable Dipole Moments without Single Excitations: The Role of Orbital Rotations and Dynamical Correlation. J Chem Theory Comput 2024; 20:4689-4702. [PMID: 38809012 PMCID: PMC11171297 DOI: 10.1021/acs.jctc.4c00471] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 05/15/2024] [Accepted: 05/17/2024] [Indexed: 05/30/2024]
Abstract
The dipole moment is a crucial molecular property linked to a molecular system's bond polarity and overall electronic structure. To that end, the electronic dipole moment, which results from the electron density of a system, is often used to assess the accuracy and reliability of new electronic structure methods. This work analyses electronic dipole moments computed with the pair coupled cluster doubles (pCCD) ansätze and its linearized coupled cluster (pCCD-LCC) corrections using the canonical Hartree-Fock and pCCD-optimized (localized) orbital bases. The accuracy of pCCD-based dipole moments is assessed against experimental and CCSD(T) reference values using relaxed and unrelaxed density matrices and different basis set sizes. Our test set comprises molecules of various bonding patterns and electronic structures, exposing pCCD-based methods to a wide range of electron correlation effects. Additionally, we investigate the performance of pCCD-in-DFT dipole moments of some model complexes. Finally, our work indicates the importance of orbital relaxation in the pCCD model and shows the limitations of the linearized couple cluster corrections in predicting electronic dipole moments of multiple-bonded systems. Most importantly, pCCD with a linearized CCD correction can reproduce the dipole moment surfaces in singly bonded molecules, which are comparable to the multireference ones.
Collapse
Affiliation(s)
- Rahul Chakraborty
- Institute
of Physics, Faculty of Physics, Astronomy, and Informatics, Nicolaus Copernicus University in Toruń, Grudziadzka 5, 87-100 Toruń, Poland
| | - Matheus Morato F. de Moraes
- Institute
of Physics, Faculty of Physics, Astronomy, and Informatics, Nicolaus Copernicus University in Toruń, Grudziadzka 5, 87-100 Toruń, Poland
| | - Katharina Boguslawski
- Institute
of Physics, Faculty of Physics, Astronomy, and Informatics, Nicolaus Copernicus University in Toruń, Grudziadzka 5, 87-100 Toruń, Poland
| | - Artur Nowak
- Institute
of Physics, Faculty of Physics, Astronomy, and Informatics, Nicolaus Copernicus University in Toruń, Grudziadzka 5, 87-100 Toruń, Poland
| | - Julian Świerczyński
- Institute
of Engineering and Technology, Faculty of Physics, Astronomy, and
Informatics, Nicolaus Copernicus University
in Toruń, Grudzia̧dzka
5, 87-100 Toruń, Poland
| | - Paweł Tecmer
- Institute
of Physics, Faculty of Physics, Astronomy, and Informatics, Nicolaus Copernicus University in Toruń, Grudziadzka 5, 87-100 Toruń, Poland
| |
Collapse
|
2
|
García-Andrade X, García Tahoces P, Pérez-Ríos J, Martínez Núñez E. Barrier Height Prediction by Machine Learning Correction of Semiempirical Calculations. J Phys Chem A 2023; 127:2274-2283. [PMID: 36877614 PMCID: PMC10845151 DOI: 10.1021/acs.jpca.2c08340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 02/19/2023] [Indexed: 03/07/2023]
Abstract
Different machine learning (ML) models are proposed in the present work to predict density functional theory-quality barrier heights (BHs) from semiempirical quantum mechanical (SQM) calculations. The ML models include a multitask deep neural network, gradient-boosted trees by means of the XGBoost interface, and Gaussian process regression. The obtained mean absolute errors are similar to those of previous models considering the same number of data points. The ML corrections proposed in this paper could be useful for rapid screening of the large reaction networks that appear in combustion chemistry or in astrochemistry. Finally, our results show that 70% of the features with the highest impact on model output are bespoke predictors. This custom-made set of predictors could be employed by future Δ-ML models to improve the quantitative prediction of other reaction properties.
Collapse
Affiliation(s)
| | - Pablo García Tahoces
- Department
of Electronics and Computer Science, University
of Santiago de Compostela, Santiago de Compostela 15782, Spain
| | - Jesús Pérez-Ríos
- Department
of Physics, Stony Brook University, Stony Brook, New York 11794, United States
- Institute
for Advanced Computational Science, Stony
Brook University, Stony
Brook, New York 11794-3800, United States
| | - Emilio Martínez Núñez
- Department
of Physical Chemistry, University of Santiago
de Compostela, Santiago
de Compostela 15782, Spain
| |
Collapse
|
3
|
Liu X, McKemmish L, Pérez-Ríos J. The performance of CCSD(T) for the calculation of dipole moments in diatomics. Phys Chem Chem Phys 2023; 25:4093-4104. [PMID: 36651174 DOI: 10.1039/d2cp05060a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
This work analyzes the accuracy of the coupled cluster with single, double, and perturbative triple excitation [CCSD(T)] method for predicting dipole moments. In particular, we benchmark CCSD(T) predictions for the equilibrium bond length, vibrational frequency, and dipole moment versus accurate experimental data. As a result, we find that CCSD(T) leads to accurate dipole moments. However, in some cases, it disagrees with the experimental values, and the disagreement can not be satisfactorily explained via relativistic or multi-reference effects. Therefore, our results indicate that benchmark studies for energy and geometry properties do not accurately describe other electron density magnitudes.
Collapse
Affiliation(s)
- Xiangyue Liu
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, 14195 Berlin, Germany
| | - Laura McKemmish
- School of Chemistry, UNSW Sydney, Sydney, NSW 2052, Australia
| | - Jesús Pérez-Ríos
- Department of Physics and Astronomy, Stony Brook University, Stony Brook 11794, New York, USA. .,Institute for Advanced Computational Science, Stony Brook University, Stony Brook, NY 11794-3800, USA
| |
Collapse
|
4
|
Synthesis of Synthetic Musks: A Theoretical Study Based on the Relationships between Structure and Properties at Molecular Scale. Int J Mol Sci 2023; 24:ijms24032768. [PMID: 36769089 PMCID: PMC9917709 DOI: 10.3390/ijms24032768] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Revised: 01/28/2023] [Accepted: 01/30/2023] [Indexed: 02/04/2023] Open
Abstract
Synthetic musks (SMs), as an indispensable odor additive, are widely used in various personal care products. However, due to their physico-chemical properties, SMs were detected in various environmental media, even in samples from arctic regions, leading to severe threats to human health (e.g., abortion risk). Environmentally friendly and functionally improved SMs have been theoretically designed in previous studies. However, the synthesizability of these derivatives has barely been proven. Thus, this study developed a method to verify the synthesizability of previously designed SM derivatives using machine learning, 2D-QSAR, 3D-QSAR, and high-throughput density functional theory in order to screen for synthesizable, high-performance (odor sensitivity), and environmentally friendly SM derivatives. In this study, three SM derivatives (i.e., D52, D37, and D25) were screened and recommended due to their good performances (i.e., high synthesizability and odor sensitivity; low abortion risk; and bioaccumulation ability in skin keratin). In addition, the synthesizability mechanism of SM derivatives was also analyzed. Results revealed that high intramolecular hydrogen bond strength, electrostatic interaction, qH+ value, energy gap, and low EHOMO would lead to a higher synthesizability of SMs and their derivatives. This study broke the synthesizability bottleneck of theoretically designed environment-friendly SM derivatives and advanced the mechanism of screening functional derivatives.
Collapse
|
5
|
Li S, Liu Y, Chen D, Jiang Y, Nie Z, Pan F. Encoding the atomic structure for machine learning in materials science. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1558] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Shunning Li
- School of Advanced Materials Peking University, Shenzhen Graduate School Shenzhen China
| | - Yuanji Liu
- School of Advanced Materials Peking University, Shenzhen Graduate School Shenzhen China
| | - Dong Chen
- School of Advanced Materials Peking University, Shenzhen Graduate School Shenzhen China
| | - Yi Jiang
- School of Advanced Materials Peking University, Shenzhen Graduate School Shenzhen China
| | - Zhiwei Nie
- School of Advanced Materials Peking University, Shenzhen Graduate School Shenzhen China
| | - Feng Pan
- School of Advanced Materials Peking University, Shenzhen Graduate School Shenzhen China
| |
Collapse
|
6
|
Lykhin AO, Truhlar DG, Gagliardi L. Dipole Moment Calculations Using Multiconfiguration Pair-Density Functional Theory and Hybrid Multiconfiguration Pair-Density Functional Theory. J Chem Theory Comput 2021; 17:7586-7601. [PMID: 34793166 DOI: 10.1021/acs.jctc.1c00915] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The dipole moment is the molecular property that most directly indicates molecular polarity. The accuracy of computed dipole moments depends strongly on the quality of the calculated electron density, and the breakdown of single-reference methods for strongly correlated systems can lead to poor predictions of the dipole moments in those cases. Here, we derive the analytical expression for obtaining the electric dipole moment by multiconfiguration pair-density functional theory (MC-PDFT), and we assess the accuracy of MC-PDFT for predicting dipole moments at equilibrium and nonequilibrium geometries. We show that MC-PDFT dipole moment curves have reasonable behavior even for stretched geometries, and they significantly improve upon the CASSCF results by capturing more electron correlation. The analysis of a dataset consisting of 18 first-row transition-metal diatomics and 6 main-group polyatomic molecules with a multireference character suggests that MC-PDFT and its hybrid extension (HMC-PDFT) perform comparably to CASPT2 and MRCISD+Q methods and have a mean unsigned deviation of 0.2-0.3 D with respect to the best available dipole moment reference values. We explored the dependence of the predicted dipole moments upon the choice of the on-top density functional and active space, and we recommend the tPBE and hybrid tPBE0 on-top choices for the functionals combined with the moderate correlated-participating-orbitals scheme for selecting the active space. With these choices, the mean unsigned deviations (in debyes) of the calculated equilibrium dipole moments from the best estimates are 0.77 for CASSCF, 0.29 for MC-PDFT, 0.24 for HMC-PDFT, 0.28 for CASPT2, and 0.25 for MRCISD+Q. These results are encouraging because the computational cost of MC-PDFT or HMC-PDFT is largely reduced compared to the CASPT2 and MRCISD+Q methods.
Collapse
Affiliation(s)
- Aleksandr O Lykhin
- Department of Chemistry, Pritzker School of Molecular Engineering, The James Franck Institute and Chicago Center for Theoretical Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
| | - Donald G Truhlar
- Department of Chemistry, Chemical Theory Center, and Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - Laura Gagliardi
- Department of Chemistry, Pritzker School of Molecular Engineering, The James Franck Institute and Chicago Center for Theoretical Chemistry, The University of Chicago, Chicago, Illinois 60637, United States.,Argonne National Laboratory, Lemont, Illinois 60439, United States
| |
Collapse
|
7
|
Liu X, Meijer G, Pérez-Ríos J. On the relationship between spectroscopic constants of diatomic molecules: a machine learning approach. RSC Adv 2021; 11:14552-14561. [PMID: 35423993 PMCID: PMC8697859 DOI: 10.1039/d1ra02061g] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Accepted: 04/01/2021] [Indexed: 11/21/2022] Open
Abstract
Through a machine learning approach, we show that the equilibrium distance, harmonic vibrational frequency and binding energy of diatomic molecules are related, independently of the nature of the bond of a molecule; they depend solely on the group and period of the constituent atoms. As a result, we show that by employing the group and period of the atoms that form a molecule, the spectroscopic constants are predicted with an accuracy of <5%, whereas for the A-excited electronic state it is needed to include other atomic properties leading to an accuracy of <11%. Through a machine learning approach, we show that the equilibrium distance, harmonic vibrational frequency and the binding energy of diatomic molecules are universally related, independently of the nature of the bond of a molecule.![]()
Collapse
Affiliation(s)
- Xiangyue Liu
- Fritz-Haber-Institut der Max-Planck-Gesellschaft Faradayweg 4-6 14195 Berlin Germany
| | - Gerard Meijer
- Fritz-Haber-Institut der Max-Planck-Gesellschaft Faradayweg 4-6 14195 Berlin Germany
| | - Jesús Pérez-Ríos
- Fritz-Haber-Institut der Max-Planck-Gesellschaft Faradayweg 4-6 14195 Berlin Germany
| |
Collapse
|
8
|
Cretu MT, Pérez-Ríos J. Predicting second virial coefficients of organic and inorganic compounds using Gaussian process regression. Phys Chem Chem Phys 2021; 23:2891-2898. [PMID: 33475124 DOI: 10.1039/d0cp05509c] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
We show that by using intuitive and accessible molecular features it is possible to predict the temperature-dependent second virial coefficient of organic and inorganic compounds with Gaussian process regression. In particular, we built a low dimensional representation of features based on intrinsic molecular properties, topology and physical properties relevant for the characterization of molecule-molecule interactions. The featurization was used to predict second virial coefficients in the interpolative regime with a relative error ⪅1% and to extrapolate the prediction to temperatures outside of the training range for each compound in the dataset with a relative error of 2.1%. Additionally, the model's predictive abilities were extended to organic molecules unseen in the training process, yielding a prediction with a relative error of 2.7%. Test molecules must be well-represented in the training set by instances of their families, which are high in variety. The method shows a generally better performance when compared to several semi-empirical procedures employed in the prediction of the quantity. Therefore, apart from being robust, the present Gaussian process regression model is extensible to a variety of organic and inorganic compounds.
Collapse
Affiliation(s)
- Miruna T Cretu
- Department of Chemistry, Imperial College London, London SW7 2AZ, UK and Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, 14195 Berlin, Germany.
| | - Jesús Pérez-Ríos
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Faradayweg 4-6, 14195 Berlin, Germany.
| |
Collapse
|