1
|
Shaban Tameh M, Coropceanu V, Purcell TAR, Brédas JL. Prediction of the Infrared Absorbance Intensities and Frequencies of Hydrocarbons: A Message Passing Neural Network Approach. J Phys Chem A 2024; 128:9695-9706. [PMID: 39466724 DOI: 10.1021/acs.jpca.4c06745] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/30/2024]
Abstract
Accurately and efficiently predicting the infrared (IR) spectra of a molecule can provide insights into the structure-properties relationships of molecular species, which has led to a proliferation of machine learning tools designed for this purpose. However, earlier studies have focused primarily on obtaining normalized IR spectra, which limits their potential for a comprehensive analysis of molecular behavior in the IR range. For instance, to fully understand and predict the optical properties, such as the transparency characteristics, it is necessary to predict the molar absorptivity IR spectra instead. Here, we propose a graph-based communicative message passing neural network algorithm that can predict both the peak positions and absolute intensities corresponding to density functional theory calculated molar absorptivities in the IR domain. By modifying existing spectral loss functions, we show that our method is able to predict with DFT-accuracy level the IR molar absorptivities of a series of hydrocarbons containing up to ten carbon atoms and apply the model to a set of larger molecules. We also compare the predicted spectra with those generated by the direct message passing neural network. The results suggest that both algorithms demonstrate similar predictive capabilities for hydrocarbons, indicating that either model could be effectively used in future research on spectral prediction for such systems.
Collapse
Affiliation(s)
- Maliheh Shaban Tameh
- Department of Chemistry and Biochemistry, The University of Arizona, Tucson, Arizona 85721-0041, United States
| | - Veaceslav Coropceanu
- Department of Chemistry and Biochemistry, The University of Arizona, Tucson, Arizona 85721-0041, United States
| | - Thomas A R Purcell
- Department of Chemistry and Biochemistry, The University of Arizona, Tucson, Arizona 85721-0041, United States
| | - Jean-Luc Brédas
- Department of Chemistry and Biochemistry, The University of Arizona, Tucson, Arizona 85721-0041, United States
| |
Collapse
|
2
|
Khanifaev J, Schrader T, Perlt E. Machine-learning to predict anharmonic frequencies: a study of models and transferability. Phys Chem Chem Phys 2024; 26:23495-23502. [PMID: 39222042 DOI: 10.1039/d4cp01789g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/04/2024]
Abstract
With more and more accurate electronic structure methods at hand, the inclusion of anharmonic effects in the post-processing of such data towards thermochemical properties is the next step. In this context, the description of anharmonicity has been an important topic of physical chemistry and chemical physics for a long time. In this study, anharmonic frequencies of various hydrogen-halides and halogenated hydrocarbon molecular clusters are calculated using harmonic as well as explicitly anharmonic methods, i.e., normal mode analysis and vibrational self-consistent field. Simple harmonic model based descriptors were used to predict anharmonic frequencies via multilinear regression and gradient boosting regression. Gradient boosting regression is capable of predicting reliable anharmonic data and even the simple multilinear regression model yields reasonable predictions that can account for mode-to-mode couplings. Moreover, the transferability to unseen chemical systems is assessed and it is confirmed that the machine-learned models can be applied to larger, unseen molecules.
Collapse
Affiliation(s)
| | - Tim Schrader
- Friedrich Schiller University Jena, Löbdergraben 32, 07743 Jena, Germany.
| | - Eva Perlt
- Friedrich Schiller University Jena, Löbdergraben 32, 07743 Jena, Germany.
| |
Collapse
|
3
|
Stienstra CMK, Hebert L, Thomas P, Haack A, Guo J, Hopkins WS. Graphormer-IR: Graph Transformers Predict Experimental IR Spectra Using Highly Specialized Attention. J Chem Inf Model 2024; 64:4613-4629. [PMID: 38845400 DOI: 10.1021/acs.jcim.4c00378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/25/2024]
Abstract
Infrared (IR) spectroscopy is an important analytical tool in various chemical and forensic domains and a great deal of effort has gone into developing in silico methods for predicting experimental spectra. A key challenge in this regard is generating highly accurate spectra quickly to enable real-time feedback between computation and experiment. Here, we employ Graphormer, a graph neural network (GNN) transformer, to predict IR spectra using only simplified molecular-input line-entry system (SMILES) strings. Our data set includes 53,528 high-quality spectra, measured in five different experimental media (i.e., phases), for molecules containing the elements H, C, N, O, F, Si, S, P, Cl, Br, and I. When using only atomic numbers for node encodings, Graphormer-IR achieved a mean test spectral information similarity (SISμ) value of 0.8449 ± 0.0012 (n = 5), which surpasses that the current state-of-the-art model Chemprop-IR (SISμ = 0.8409 ± 0.0014, n = 5) with only 36% of the encoded information. Augmenting node embeddings with additional node-level descriptors in learned embeddings generated through a multilayer perceptron improves scores to SISμ = 0.8523 ± 0.0006, a total improvement of 19.7σ (t = 19). These improved scores show how Graphormer-IR excels in capturing long-range interactions like hydrogen bonding, anharmonic peak positions in experimental spectra, and stretching frequencies of uncommon functional groups. Scaling our architecture to 210 attention heads demonstrates specialist-like behavior for distinct IR frequencies that improves model performance. Our model utilizes novel architectures, including a global node for phase encoding, learned node feature embeddings, and a one-dimensional (1D) smoothing convolutional neural network (CNN). Graphormer-IR's innovations underscore its value over traditional message-passing neural networks (MPNNs) due to its expressive embeddings and ability to capture long-range intramolecular relationships.
Collapse
Affiliation(s)
- Cailum M K Stienstra
- Department of Chemistry, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| | - Liam Hebert
- Cheriton School of Computer Science, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| | - Patrick Thomas
- Department of Chemistry, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| | - Alexander Haack
- Department of Chemistry, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| | - Jason Guo
- Department of Chemistry, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| | - W Scott Hopkins
- Department of Chemistry, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
- Watermine Innovation, Waterloo, Ontario N0B 2T0, Canada
- Centre for Eye and Vision Research, Hong Kong Science Park, New Territories 999077, Hong Kong
| |
Collapse
|
4
|
Khanifaev J, Schrader T, Perlt E. The effect of machine learning predicted anharmonic frequencies on thermodynamic properties of fluid hydrogen fluoride. J Chem Phys 2024; 160:124302. [PMID: 38516969 DOI: 10.1063/5.0195386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Accepted: 03/02/2024] [Indexed: 03/23/2024] Open
Abstract
Anharmonic effects play a crucial role in determining thermochemical properties of liquids and gases. For such extended phases, the inclusion of anharmonicity in reliable electronic structure methods is computationally extremely demanding, and hence, anharmonic effects are often lacking in thermochemical calculations. In this study, we apply the quantum cluster equilibrium method to transfer density functional theory calculations at the cluster level to the macroscopic, liquid, and gaseous phase of hydrogen fluoride. This allows us to include anharmonicity, either via vibrational self-consistent field calculations for smaller clusters or using a regression model for larger clusters. We obtain the structural composition of the fluid phases in terms of the population of different clusters as well as isobaric heat capacities as an example for thermodynamic properties. We study the role of anharmonicities for these analyses and observe that, in particular, the dominating structural motifs are rather sensitive to the anharmonicity in vibrational frequencies. The regression model proves to be a promising way to get access to anharmonic features, and the extension to more sophisticated machine-learning models is promising.
Collapse
Affiliation(s)
- Jamoliddin Khanifaev
- Otto Schott Institute of Materials Research, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Tim Schrader
- Otto Schott Institute of Materials Research, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Eva Perlt
- Otto Schott Institute of Materials Research, Friedrich Schiller University Jena, 07743 Jena, Germany
| |
Collapse
|
5
|
Gromoff Q, Benzo P, Saidi WA, Andolina CM, Casanove MJ, Hungria T, Barre S, Benoit M, Lam J. Exploring the formation of gold/silver nanoalloys with gas-phase synthesis and machine-learning assisted simulations. NANOSCALE 2023; 16:384-393. [PMID: 38063839 DOI: 10.1039/d3nr04471h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2023]
Abstract
While nanoalloys are of paramount scientific and practical interest, the main processes leading to their formation are still poorly understood. Key structural features in the alloy systems, including the crystal phase, chemical ordering, and morphology, are challenging to control at the nanoscale, making it difficult to extend their use to industrial applications. In this contribution, we focus on the gold/silver system that has two of the most prevalent noble metals and combine experiments with simulations to uncover the formation mechanisms at the atomic level. Nanoparticles were produced using a state-of-the-art inert-gas aggregation source and analyzed using transmission electron microscopy and energy-dispersive X-ray spectroscopy. Machine-learning-assisted molecular dynamics simulations were employed to model the crystallization process from liquid droplets to nanocrystals. Our study finds a preponderance of nanoparticles with five-fold symmetric morphology, including icosahedra and decahedra which is consistent with previous results on mono-metallic nanoparticles. However, we observed that gold atoms, rather than silver atoms, segregate at the surface of the obtained nanoparticles for all the considered alloy compositions. These segregation tendencies are in contrast to previous studies and have consequences on the crystallization dynamics and the subsequent crystal ordering. We finally showed that the underpinning of this surprising segregation dynamics is due to charge transfer and electrostatic interactions rather than surface energy considerations.
Collapse
Affiliation(s)
- Quentin Gromoff
- CEMES, CNRS and Université de Toulouse, 29 rue Jeanne Marvig, 31055 Toulouse Cedex, France
| | - Patrizio Benzo
- CEMES, CNRS and Université de Toulouse, 29 rue Jeanne Marvig, 31055 Toulouse Cedex, France
| | - Wissam A Saidi
- National Energy Technology Laboratory, United States Department of Energy, Pittsburgh, PA 15236, USA
- Department of Mechanical Engineering and Materials Science, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Christopher M Andolina
- Department of Mechanical Engineering and Materials Science, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Marie-José Casanove
- CEMES, CNRS and Université de Toulouse, 29 rue Jeanne Marvig, 31055 Toulouse Cedex, France
| | - Teresa Hungria
- Centre de MicroCaractérisation Raimond Castaing, Université de Toulouse, 3 rue Caroline Aigle, F-31400 Toulouse, France
| | - Sophie Barre
- CEMES, CNRS and Université de Toulouse, 29 rue Jeanne Marvig, 31055 Toulouse Cedex, France
| | - Magali Benoit
- CEMES, CNRS and Université de Toulouse, 29 rue Jeanne Marvig, 31055 Toulouse Cedex, France
| | - Julien Lam
- CEMES, CNRS and Université de Toulouse, 29 rue Jeanne Marvig, 31055 Toulouse Cedex, France
- Univ. Lille, CNRS, INRA, ENSCL, UMR 8207, UMET, Unité Matériaux et Transformations, F 59000 Lille, France.
| |
Collapse
|
6
|
Tang Z, Bromley ST, Hammer B. A machine learning potential for simulating infrared spectra of nanosilicate clusters. J Chem Phys 2023; 158:2895243. [PMID: 37290080 DOI: 10.1063/5.0150379] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 05/23/2023] [Indexed: 06/10/2023] Open
Abstract
The use of machine learning (ML) in chemical physics has enabled the construction of interatomic potentials having the accuracy of ab initio methods and a computational cost comparable to that of classical force fields. Training an ML model requires an efficient method for the generation of training data. Here, we apply an accurate and efficient protocol to collect training data for constructing a neural network-based ML interatomic potential for nanosilicate clusters. Initial training data are taken from normal modes and farthest point sampling. Later on, the set of training data is extended via an active learning strategy in which new data are identified by the disagreement between an ensemble of ML models. The whole process is further accelerated by parallel sampling over structures. We use the ML model to run molecular dynamics simulations of nanosilicate clusters with various sizes, from which infrared spectra with anharmonicity included can be extracted. Such spectroscopic data are needed for understanding the properties of silicate dust grains in the interstellar medium and in circumstellar environments.
Collapse
Affiliation(s)
- Zeyuan Tang
- Center for Interstellar Catalysis, Department of Physics and Astronomy, Aarhus University, Ny Munkegade 120, Aarhus C 8000, Denmark
| | - Stefan T Bromley
- Departament de Ciència de Materials i Química Física and Institut de Química Teòrica i Computatcional (IQTCUB), Universitat de Barcelona, c/Martí i Franquès 1-11, 08028 Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluis Companys 23, 08010 Barcelona, Spain
| | - Bjørk Hammer
- Center for Interstellar Catalysis, Department of Physics and Astronomy, Aarhus University, Ny Munkegade 120, Aarhus C 8000, Denmark
| |
Collapse
|
7
|
Bac S, Patra A, Kron KJ, Mallikarjun Sharada S. Recent Advances toward Efficient Calculation of Higher Nuclear Derivatives in Quantum Chemistry. J Phys Chem A 2022; 126:7795-7805. [DOI: 10.1021/acs.jpca.2c05459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Selin Bac
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, California90089, United States
| | - Abhilash Patra
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, California90089, United States
| | - Kareesa J. Kron
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, California90089, United States
| | - Shaama Mallikarjun Sharada
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, California90089, United States
- Department of Chemistry, University of Southern California, Los Angeles, California90089, United States
| |
Collapse
|
8
|
Kuntz D, Wilson AK. Machine learning, artificial intelligence, and chemistry: how smart algorithms are reshaping simulation and the laboratory. PURE APPL CHEM 2022. [DOI: 10.1515/pac-2022-0202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Abstract
Machine learning and artificial intelligence are increasingly gaining in prominence through image analysis, language processing, and automation, to name a few applications. Machine learning is also making profound changes in chemistry. From revisiting decades-old analytical techniques for the purpose of creating better calibration curves, to assisting and accelerating traditional in silico simulations, to automating entire scientific workflows, to being used as an approach to deduce underlying physics of unexplained chemical phenomena, machine learning and artificial intelligence are reshaping chemistry, accelerating scientific discovery, and yielding new insights. This review provides an overview of machine learning and artificial intelligence from a chemist’s perspective and focuses on a number of examples of the use of these approaches in computational chemistry and in the laboratory.
Collapse
Affiliation(s)
- David Kuntz
- Department of Chemistry , University of North Texas , Denton , TX 76201 , USA
| | - Angela K. Wilson
- Department of Chemistry , Michigan State University , East Lansing , MI 48824 , USA
| |
Collapse
|
9
|
Calvo F, Simon A, Parneix P, Falvo C, Dubosq C. Infrared Spectroscopy of Chemically Diverse Carbon Clusters: A Data-Driven Approach. J Phys Chem A 2021; 125:5509-5518. [PMID: 34138562 DOI: 10.1021/acs.jpca.1c03368] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Carbon clusters exhibit a broad diversity of topologies and shapes, encompassing fullerene-like cages, graphene-like flakes, and more disordered pretzel-like and branched structures. Here, we examine computationally their infrared spectra in relation with these structures from a statistical perspective. Individual spectra for broad samples of isomers were determined by means of the self-consistent charge density functional-based tight-binding method, and an interpolation scheme is designed to reproduce the spectral features by regression on a much smaller subset of the sample. This interpolation proceeds by encoding the structures using appropriate descriptors and selecting them through principal component analysis, Gaussian regression or inverse distance weighting providing the nonlinear weighting functions. Metric learning is employed to reduce the global error on a preselected testing set. The interpolated spectra satisfactorily reproduce the specific spectral features and their dependence on the size and shape, enabling quantitative prediction away from the testing set. Finally, the classification of structures within the four proposed families is critically discussed through a statistical analysis of the sample based on iterative label spreading.
Collapse
Affiliation(s)
- Florent Calvo
- Univ. Grenoble Alpes, CNRS, LiPhy, 38000 Grenoble, France
| | - Aude Simon
- Laboratoire de Chimie et Physique Quantiques LCPQ/FeRMI, UMR5626, Université de Toulouse (UPS) and CNRS, 31062 Toulouse, France
| | - Pascal Parneix
- Université Paris-Saclay, CNRS, Institut des Sciences Moléculaires d'Orsay, 91405 Orsay, France
| | - Cyril Falvo
- Univ. Grenoble Alpes, CNRS, LiPhy, 38000 Grenoble, France.,Université Paris-Saclay, CNRS, Institut des Sciences Moléculaires d'Orsay, 91405 Orsay, France
| | - Clément Dubosq
- Laboratoire de Chimie et Physique Quantiques LCPQ/FeRMI, UMR5626, Université de Toulouse (UPS) and CNRS, 31062 Toulouse, France
| |
Collapse
|
10
|
McGill C, Forsuelo M, Guan Y, Green WH. Predicting Infrared Spectra with Message Passing Neural Networks. J Chem Inf Model 2021; 61:2594-2609. [PMID: 34048221 DOI: 10.1021/acs.jcim.1c00055] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Infrared (IR) spectroscopy remains an important tool for chemical characterization and identification. Chemprop-IR has been developed as a software package for the prediction of IR spectra through the use of machine learning. This work serves the dual purpose of providing a trained general-purpose model for the prediction of IR spectra with ease and providing the Chemprop-IR software framework for the training of new models. In Chemprop-IR, molecules are encoded using a directed message passing neural network, allowing for molecule latent representations to be learned and optimized for the task of spectral predictions. Model training incorporates spectra metrics and normalization techniques that offer better performance with spectral predictions than standard practice in regression models. The model makes use of pretraining using quantum chemistry calculations and ensembling of multiple submodels to improve generalizability and performance. The spectral predictions that result are of high quality, showing capability to capture the extreme diversity of spectral forms over chemical space and represent complex peak structures.
Collapse
Affiliation(s)
- Charles McGill
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Michael Forsuelo
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Yanfei Guan
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - William H Green
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
11
|
Laurens G, Rabary M, Lam J, Peláez D, Allouche AR. Infrared spectra of neutral polycyclic aromatic hydrocarbons based on machine learning potential energy surface and dipole mapping. Theor Chem Acc 2021. [DOI: 10.1007/s00214-021-02773-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
12
|
Geindre H, Allouche AR, Peláez D. Non long-range corrected density functionals incorrectly describe the intensity of the CH stretching band in polycyclic aromatic hydrocarbons. J Comput Chem 2021; 42:1018-1027. [PMID: 33760242 DOI: 10.1002/jcc.26520] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Revised: 02/24/2021] [Accepted: 02/26/2021] [Indexed: 11/12/2022]
Abstract
We present a comprehensive study of the most relevant numerical aspects influencing frequencies and intensities in the infrared spectrum of isolated polycyclic aromatic hydrocarbons (PAHs) regarding the overestimate of the IR CH-stretching bands. We use naphthalene as benchmark and show the validity of our results to different members of the PAH family. Our analysis relies on widely employed density functional theory methods and second-order vibrational perturbational theory for the computation of vibrational eigenstates. We have focused on the elucidation of the origin of the systematic overestimate of the intensities in the CH-stretching region. To rule out nonfundamental numerical errors, we have initially considered the influence of the electronic basis set and various other parameters on the different stages of the vibrational analysis. In a second stage, we have benchmarked the results of different density functional theory functionals with respect to the aforementioned overestimate taken as the ratio between the most prominent features of the spectrum, the CH-bending and the CH-stretching bands. Our results unambiguously indicate that the long-range correction plays a major role in this spurious numerical issue. More specifically, this phenomenon is due to an incorrect description of the charge distribution (and hence dipole) within the symmetrically relevant CH bonds. Long-range correction specifically remedies this issue. It improves the description of the intensities in the stretching region while at the same time it does not perturb significantly the rest of the spectrum. With respect to the frequencies, we have observed an overall improvement when compared to noncorrected functionals.
Collapse
Affiliation(s)
- Hugo Geindre
- Université Lille, UMR 8523 - Physique des Lasers, Atomes et Molécules, Lille, France
| | - Abdul-Rahman Allouche
- Institut Lumière Matière, UMR5306 Université Lyon 1-CNRS, Université de Lyon, Villeurbanne Cedex, France
| | - Daniel Peláez
- Institut des Sciences Moléculaires d'Orsay (ISMO) - UMR 8214. Bât. 520, Université Paris-Saclay, Orsay Cedex, France
| |
Collapse
|
13
|
Pracht P, Grant DF, Grimme S. Comprehensive Assessment of GFN Tight-Binding and Composite Density Functional Theory Methods for Calculating Gas-Phase Infrared Spectra. J Chem Theory Comput 2020; 16:7044-7060. [PMID: 33054183 DOI: 10.1021/acs.jctc.0c00877] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Vibrational spectroscopy is a valuable and widely used analytical tool for the characterization of chemical substances. We investigate the performance of semiempirical quantum mechanical GFN tight-binding and force-field methods for the calculation of gas-phase infrared spectra in comparison to experiment and low-cost (B3LYP-3c) density functional theory. A data set of 7247 experimental references was used to evaluate method performance based on automatic spectra comparison. Various quantitative spectral similarity measures were employed for the comparison between theory and experiment and for determining empirical scaling factors. It is shown that the scaling of atomic masses provides an accurate yet simple alternative to standard global frequency scaling in density functional theory (DFT) and semiempirical calculations. Furthermore, the method performance for 58 exemplary transition metal complexes was investigated. The efficient DFT composite method B3LYP-3c, that was introduced in the course of this work, was found to be excellently suited for general IR spectra calculations. The GFN1- and GFN2-xTB tight-binding methods clearly outperformed the PMx competitors. Conformational changes were investigated for a subset of the data and are found to have a mediocre strong influence on the simulated spectra suggesting that the corresponding elaborate sampling steps may be neglected in automated compound identification workflows.
Collapse
Affiliation(s)
- Philipp Pracht
- Mulliken Center for Theoretical Chemistry, Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstrasse 4, 53115 Bonn, Germany
| | - David F Grant
- Department of Pharmaceutical Sciences, University of Connecticut, Storrs, Connecticut 06268, United States
| | - Stefan Grimme
- Mulliken Center for Theoretical Chemistry, Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstrasse 4, 53115 Bonn, Germany
| |
Collapse
|