1
|
Knattrup Y, Kubečka J, Wu H, Jensen F, Elm J. Reparameterization of GFN1-xTB for atmospheric molecular clusters: applications to multi-acid-multi-base systems. RSC Adv 2024; 14:20048-20055. [PMID: 38911834 PMCID: PMC11191700 DOI: 10.1039/d4ra03021d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Accepted: 06/16/2024] [Indexed: 06/25/2024] Open
Abstract
Atmospheric molecular clusters, the onset of secondary aerosol formation, are a major part of the current uncertainty in modern climate models. Quantum chemical (QC) methods are usually employed in a funneling approach to identify the lowest free energy cluster structures. However, the funneling approach highly depends on the accuracy of low-cost methods to ensure that important low-lying minima are not missed. Here we present a reparameterized GFN1-xTB model based on the clusteromics I-V datasets for studying atmospheric molecular clusters (AMC), denoted AMC-xTB. The AMC-xTB model reduces the mean of electronic binding energy errors from 7-11.8 kcal mol-1 to roughly 0 kcal mol-1 and the root mean square deviation from 7.6-12.3 kcal mol-1 to 0.81-1.45 kcal mol-1. In addition, the minimum structures obtained with AMC-xTB are closer to the ωB97X-D/6-31++G(d,p) level of theory compared to GFN1-xTB. We employ the new parameterization in two new configurational sampling workflows that include an additional meta-dynamics sampling step using CREST with the AMC-xTB model. The first workflow, denoted the "independent workflow", is a commonly used funneling approach with an additional CREST step, and the second, the "improvement workflow", is where the best configuration currently known in the literature is improved with a CREST + AMC-xTB step. Testing the new workflow we find configurations lower in free energy for all the literature clusters with the largest improvement being up to 21 kcal mol-1. Lastly, by employing the improvement workflow we massively screened 288 new multi-acid-multi-base clusters containing up to 8 different species. For these new multi-acid-multi-base cluster systems we observe that the improvement workflow finds configurations lower in free energy for 245 out of 288 (85.1%) cluster structures. Most of the improvements are within 2 kcal mol-1, but we see improvements up to 8.3 kcal mol-1. Hence, we can recommend this new workflow based on the AMC-xTB model for future studies on atmospheric molecular clusters.
Collapse
Affiliation(s)
- Yosef Knattrup
- Department of Chemistry, Aarhus University Langelandsgade 140, Aarhus C 8000 Denmark +45 28938085
| | - Jakub Kubečka
- Department of Chemistry, Aarhus University Langelandsgade 140, Aarhus C 8000 Denmark +45 28938085
| | - Haide Wu
- Department of Chemistry, Aarhus University Langelandsgade 140, Aarhus C 8000 Denmark +45 28938085
| | - Frank Jensen
- Department of Chemistry, Aarhus University Langelandsgade 140, Aarhus C 8000 Denmark +45 28938085
| | - Jonas Elm
- Department of Chemistry, Aarhus University Langelandsgade 140, Aarhus C 8000 Denmark +45 28938085
| |
Collapse
|
2
|
Kubečka J, Besel V, Neefjes I, Knattrup Y, Kurtén T, Vehkamäki H, Elm J. Computational Tools for Handling Molecular Clusters: Configurational Sampling, Storage, Analysis, and Machine Learning. ACS OMEGA 2023; 8:45115-45128. [PMID: 38046354 PMCID: PMC10688175 DOI: 10.1021/acsomega.3c07412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Revised: 10/25/2023] [Accepted: 10/26/2023] [Indexed: 12/05/2023]
Abstract
Computational modeling of atmospheric molecular clusters requires a comprehensive understanding of their complex configurational spaces, interaction patterns, stabilities against fragmentation, and even dynamic behaviors. To address these needs, we introduce the Jammy Key framework, a collection of automated scripts that facilitate and streamline molecular cluster modeling workflows. Jammy Key handles file manipulations between varieties of integrated third-party programs. The framework is divided into three main functionalities: (1) Jammy Key for configurational sampling (JKCS) to perform systematic configurational sampling of molecular clusters, (2) Jammy Key for quantum chemistry (JKQC) to analyze commonly used quantum chemistry output files and facilitate database construction, handling, and analysis, and (3) Jammy Key for machine learning (JKML) to manage machine learning methods in optimizing molecular cluster modeling. This automation and machine learning utilization significantly reduces manual labor, greatly speeds up the search for molecular cluster configurations, and thus increases the number of systems that can be studied. Following the example of the Atmospheric Cluster Database (ACDB) of Elm (ACS Omega, 4, 10965-10984, 2019), the molecular clusters modeled in our group using the Jammy Key framework have been stored in an improved online GitHub repository named ACDB 2.0. In this work, we present the Jammy Key package alongside its assorted applications, which underline its versatility. Using several illustrative examples, we discuss how to choose appropriate combinations of methodologies for treating particular cluster types, including reactive, multicomponent, charged, or radical clusters, as well as clusters containing flexible or multiconformer monomers or heavy atoms. Finally, we present a detailed example of using the tools for atmospheric acid-base clusters.
Collapse
Affiliation(s)
- Jakub Kubečka
- Aarhus
University, Department of Chemistry, Langelandsgade 140, Aarhus 8000, Denmark
| | - Vitus Besel
- University
of Helsinki, Institute for Atmospheric and
Earth System Research/Physics, Faculty of Science, P.O. Box 64, Helsinki 00140, Finland
| | - Ivo Neefjes
- University
of Helsinki, Institute for Atmospheric and
Earth System Research/Physics, Faculty of Science, P.O. Box 64, Helsinki 00140, Finland
| | - Yosef Knattrup
- Aarhus
University, Department of Chemistry, Langelandsgade 140, Aarhus 8000, Denmark
| | - Theo Kurtén
- University
of Helsinki, Institute for Atmospheric and
Earth System Research/Chemistry, Faculty of Science, P.O. Box 64, Helsinki 00140, Finland
| | - Hanna Vehkamäki
- University
of Helsinki, Institute for Atmospheric and
Earth System Research/Physics, Faculty of Science, P.O. Box 64, Helsinki 00140, Finland
| | - Jonas Elm
- Aarhus
University, Department of Chemistry, Langelandsgade 140, Aarhus 8000, Denmark
| |
Collapse
|
3
|
Engsvang M, Kubečka J, Elm J. Toward Modeling the Growth of Large Atmospheric Sulfuric Acid-Ammonia Clusters. ACS OMEGA 2023; 8:34597-34609. [PMID: 37779982 PMCID: PMC10536041 DOI: 10.1021/acsomega.3c03521] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/20/2023] [Accepted: 08/30/2023] [Indexed: 10/03/2023]
Abstract
Studying large atmospheric molecular clusters is needed to understand the transition between clusters and aerosol particles. In this work, we studied the (SA)n(AM)n clusters with n up to 30 and the (SA)m(AM)m±2 clusters, with m = 6-20. The cluster configurations are sampled using the ABCluster program, and the cluster geometries and thermochemical parameters are calculated using GFN1-xTB. The cluster binding energies are calculated using B97-3c. We find that the addition of sulfuric acid is preferred to the addition of ammonia. The addition free energies were found to have large uncertainties, which could potentially be attributed to errors in the applied level of theory. Based on DLPNO-CCSD(T0)/aug-cc-pVTZ benchmarks of the binding energies of the large (SA)8-9(AM)10 and (SA)10(AM)10-11 clusters, we find that ωB97X-D3BJ with a large basis set is required to yield accurate binding and addition energies. However, based on recalculations of the single-point energy at r2SCAN-3c and ωB97X-D3BJ/6-311++G(3df,3pd), we show that the single-point energy contribution is not the primary source of error. We hypothesize that a larger source of error might be present in the form of insufficient configurational sampling. Finally, we train Δ machine learning model on (SA)n(AM)n clusters with n up to 5 and show that we can predict the binding energies of clusters up to sizes of (SA)30(AM)30 with a binding energy error below 0.6 %. This is an encouraging approach for accurately modeling the binding energies of large acid-base clusters in the future.
Collapse
Affiliation(s)
- Morten Engsvang
- Department
of Chemistry, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| | - Jakub Kubečka
- Department
of Chemistry, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| | - Jonas Elm
- Department
of Chemistry, iClimate, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| |
Collapse
|
4
|
Knattrup Y, Kubečka J, Elm J. Nitric Acid and Organic Acids Suppress the Role of Methanesulfonic Acid in Atmospheric New Particle Formation. J Phys Chem A 2023; 127:7568-7578. [PMID: 37651638 DOI: 10.1021/acs.jpca.3c04393] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
Abstract
Multicomponent atmospheric molecular clusters, typically comprising a combination of acids and bases, play a pivotal role in our climate system and contribute to the perplexing uncertainties embedded in modern climate models. Our understanding of cluster formation is limited by the lack of studies on complex mixed-acid-mixed-base systems. Here, we investigate multicomponent clusters consisting of mixtures of several acid and base molecules: sulfuric acid (SA), methanesulfonic acid (MSA), nitric acid (NA), formic acid (FA), along with methylamine (MA), dimethylamine (DMA), and trimethylamine (TMA). We calculated the binding free energies of a comprehensive set of 252 mixed-acid-mixed-base clusters at the DLPNO-CCSD(T0)/aug-cc-pVTZ//ωB97X-D/6-31++G(d,p) level of theory. Combined with the existing datasets, we simulated the new particle formation (NPF) rates using the Atmospheric Cluster Dynamics Code (ACDC). We find that the presence of NA and FA had a substantial impact, increasing the NPF rate by 60% at realistic conditions. Intriguingly, we find that NA and FA suppress the role of MSA in NPF. These findings suggest that even high concentration of MSA has a limited impact on NPF in polluted regions with high FA and NA. We outline a method for generating a lookup table that could potentially be used in climate models that sufficiently incorporates all the required chemistry. By unraveling the molecular mechanisms of mixed-acid-mixed-base clusters, we get one step closer to comprehending their implications for our global climate system.
Collapse
Affiliation(s)
- Yosef Knattrup
- Department of Chemistry, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| | - Jakub Kubečka
- Department of Chemistry, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| | - Jonas Elm
- Department of Chemistry, iClimate, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| |
Collapse
|
5
|
Knattrup Y, Kubečka J, Ayoubi D, Elm J. Clusterome: A Comprehensive Data Set of Atmospheric Molecular Clusters for Machine Learning Applications. ACS OMEGA 2023; 8:25155-25164. [PMID: 37483242 PMCID: PMC10357536 DOI: 10.1021/acsomega.3c02203] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Accepted: 06/16/2023] [Indexed: 07/25/2023]
Abstract
Formation and growth of atmospheric molecular clusters into aerosol particles impact the global climate and contribute to the high uncertainty in modern climate models. Cluster formation is usually studied using quantum chemical methods, which quickly becomes computationally expensive when system sizes grow. In this work, we present a large database of ∼250k atmospheric relevant cluster structures, which can be applied for developing machine learning (ML) models. The database is used to train the ML model kernel ridge regression (KRR) with the FCHL19 representation. We test the ability of the model to extrapolate from smaller clusters to larger clusters, between different molecules, between equilibrium structures and out-of-equilibrium structures, and the transferability onto systems with new interactions. We show that KRR models can extrapolate to larger sizes and transfer acid and base interactions with mean absolute errors below 1 kcal/mol. We suggest introducing an iterative ML step in configurational sampling processes, which can reduce the computational expense. Such an approach would allow us to study significantly more cluster systems at higher accuracy than previously possible and thereby allow us to cover a much larger part of relevant atmospheric compounds.
Collapse
Affiliation(s)
- Yosef Knattrup
- Department
of Chemistry, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| | - Jakub Kubečka
- Department
of Chemistry, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| | - Daniel Ayoubi
- Department
of Chemistry, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| | - Jonas Elm
- Department
of Chemistry, iClimate, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| |
Collapse
|
6
|
Fomete S, Kubečka J, Elm J, Jen CN. Limited Role of Malonic Acid in Sulfuric Acid-Dimethylamine New Particle Formation. ACS OMEGA 2023; 8:19807-19815. [PMID: 37305259 PMCID: PMC10249388 DOI: 10.1021/acsomega.3c01643] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 05/08/2023] [Indexed: 06/13/2023]
Abstract
Aerosols play an important role in climate and air quality; however, the mechanisms behind aerosol particle formation in the atmosphere are poorly understood. Studies have identified sulfuric acid, water, oxidized organics, and ammonia/amines as key precursors for forming aerosol particles in the atmosphere. Theoretical and experimental investigations have indicated that other species, such as organic acids, may be involved in atmospheric nucleation and growth of freshly formed aerosol particles. Organic acids, such as dicarboxylic acids, which are abundant in the atmosphere, have been measured in ultrafine aerosol particles. These observations suggest that organic acids may contribute to new particle formation in the atmosphere but their role remains ambiguous. This study examines how malonic acid interacts with sulfuric acid and dimethylamine to form new particles at warm boundary layer conditions using experimental observations from a laminar flow reactor and quantum chemical calculations coupled with cluster dynamics simulations. Observations reveal that malonic acid does not contribute to the initial steps (formation of <1 nm diameter particle) of nucleation with sulfuric acid-dimethylamine. In addition, malonic acid was found to not participate in the subsequent growth of the freshly nucleated 1 nm particles from sulfuric acid-dimethylamine reactions to diameters of 2 nm.
Collapse
Affiliation(s)
- Sandra
K.W. Fomete
- Department
of Chemical Engineering, Carnegie Mellon
University, Pittsburgh, Pennsylvania 15213, United States
- Center
for Atmospheric Particle Studies, Carnegie
Mellon University, Pittsburgh, Pennsylvania 15213, United States
| | - Jakub Kubečka
- Department
of Chemistry, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| | - Jonas Elm
- Department
of Chemistry, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| | - Coty N. Jen
- Department
of Chemical Engineering, Carnegie Mellon
University, Pittsburgh, Pennsylvania 15213, United States
- Center
for Atmospheric Particle Studies, Carnegie
Mellon University, Pittsburgh, Pennsylvania 15213, United States
| |
Collapse
|