1
|
Duan C, Chen S, Taylor MG, Liu F, Kulik HJ. Machine learning to tame divergent density functional approximations: a new path to consensus materials design principles. Chem Sci 2021; 12:13021-13036. [PMID: 34745533 PMCID: PMC8513898 DOI: 10.1039/d1sc03701c] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Accepted: 09/01/2021] [Indexed: 01/17/2023] Open
Abstract
Virtual high-throughput screening (VHTS) with density functional theory (DFT) and machine-learning (ML)-acceleration is essential in rapid materials discovery. By necessity, efficient DFT-based workflows are carried out with a single density functional approximation (DFA). Nevertheless, properties evaluated with different DFAs can be expected to disagree for cases with challenging electronic structure (e.g., open-shell transition-metal complexes, TMCs) for which rapid screening is most needed and accurate benchmarks are often unavailable. To quantify the effect of DFA bias, we introduce an approach to rapidly obtain property predictions from 23 representative DFAs spanning multiple families, “rungs” (e.g., semi-local to double hybrid) and basis sets on over 2000 TMCs. Although computed property values (e.g., spin state splitting and frontier orbital gap) differ by DFA, high linear correlations persist across all DFAs. We train independent ML models for each DFA and observe convergent trends in feature importance, providing DFA-invariant, universal design rules. We devise a strategy to train artificial neural network (ANN) models informed by all 23 DFAs and use them to predict properties (e.g., spin-splitting energy) of over 187k TMCs. By requiring consensus of the ANN-predicted DFA properties, we improve correspondence of computational lead compounds with literature-mined, experimental compounds over the typically employed single-DFA approach. Machine learning (ML)-based feature analysis reveals universal design rules regardless of density functional choices. Using the consensus among multiple functionals, we identify robust lead complexes in ML-accelerated chemical discovery.![]()
Collapse
Affiliation(s)
- Chenru Duan
- Department of Chemical Engineering, Massachusetts Institute of Technology Cambridge MA 02139 USA +1-617-253-4584.,Department of Chemistry, Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Shuxin Chen
- Department of Chemical Engineering, Massachusetts Institute of Technology Cambridge MA 02139 USA +1-617-253-4584.,Department of Chemistry, Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Michael G Taylor
- Department of Chemical Engineering, Massachusetts Institute of Technology Cambridge MA 02139 USA +1-617-253-4584
| | - Fang Liu
- Department of Chemical Engineering, Massachusetts Institute of Technology Cambridge MA 02139 USA +1-617-253-4584
| | - Heather J Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology Cambridge MA 02139 USA +1-617-253-4584
| |
Collapse
|
2
|
Maurer LR, Bursch M, Grimme S, Hansen A. Assessing Density Functional Theory for Chemically Relevant Open-Shell Transition Metal Reactions. J Chem Theory Comput 2021; 17:6134-6151. [PMID: 34546754 DOI: 10.1021/acs.jctc.1c00659] [Citation(s) in RCA: 55] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Due to the principle lack of systematic improvement possibilities of density functional theory, careful assessment of the performance of density functional approximations (DFAs) on well-designed benchmark sets, for example, for reaction energies and barrier heights, is crucial. While main-group chemistry is well covered by several available sets, benchmark data for transition metal chemistry is sparse. This is especially the case for larger, chemically relevant molecules. Addressing this issue, we recently introduced the MOR41 benchmark which covers chemically relevant reactions of closed-shell complexes. In this work, we extend these efforts to single-reference open-shell systems and introduce the "reactions of open-shell single-reference transition metal complexes" (ROST61) benchmark set. ROST61 includes accurate coupled-cluster reference values for 61 reaction energies with a mean reaction energy of -42.8 kcal mol-1. Complexes with 13-93 atoms covering 20 d-block elements are included, but due to the restriction to single-reference open-shell systems, important elements such as iron or platinum could not be taken into account, or only to a small extent. We assess the performance of 31 DFAs in combination with three London dispersion (LD) correction schemes. Further, DFT-based composite methods, MP2, and a few semiempirical quantum chemical methods are evaluated. Consistent with the results for the MOR41 closed-shell benchmark, we find that the ordering of DFAs according to Jacob's ladder is preserved and that adding an LD correction is crucial, clearly improving almost all tested methods. The recently introduced r2SCAN-3c composite method stands out with a remarkable mean absolute deviation (MAD) of only 2.9 kcal mol-1, which is surpassed only by hybrid DFAs with low amounts of Fock exchange (e.g., 2.3 kcal mol-1 for TPSS0-D4/def2-QZVPP) and double-hybrid (DH) DFAs but at a significantly higher computational cost. The lowest MAD of only 1.6 kcal mol-1 is obtained with the DH DFA PWPB95-D4 in the def2-QZVPP basis set approaching the estimated accuracy of the reference method. Overall, the ROST61 set adds important reference data to a sparsely sampled but practically relevant area of chemistry. At this point, it provides valuable orientation for the application and development of new DFAs and electronic structure methods in general.
Collapse
Affiliation(s)
- Leonard R Maurer
- Mulliken Center for Theoretical Chemistry, Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| | - Markus Bursch
- Mulliken Center for Theoretical Chemistry, Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| | - Stefan Grimme
- Mulliken Center for Theoretical Chemistry, Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| | - Andreas Hansen
- Mulliken Center for Theoretical Chemistry, Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| |
Collapse
|
3
|
Santra G, Cho M, Martin JML. Exploring Avenues beyond Revised DSD Functionals: I. Range Separation, with xDSD as a Special Case. J Phys Chem A 2021; 125:4614-4627. [PMID: 34009986 PMCID: PMC8279641 DOI: 10.1021/acs.jpca.1c01294] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Revised: 05/06/2021] [Indexed: 01/16/2023]
Abstract
We have explored the use of range separation as a possible avenue for further improvement on our revDSD minimally empirical double hybrid functionals. Such ωDSD functionals encompass the XYG3 type of double hybrid (i.e., xDSD) as a special case for ω → 0. As in our previous studies, the large and chemically diverse GMTKN55 benchmark suite was used for evaluation. Especially when using the D4 rather than D3BJ dispersion model, xDSD has a slight performance advantage in WTMAD2. As in previous studies, PBEP86 is the winning combination for the semilocal parts. xDSDn-PBEP86-D4 marginally outperforms the previous "best in class" ωB97M(2) Berkeley double hybrid but without range separation and using fewer than half the number of empirical parameters. Range separation turns out to offer only marginal further improvements on GMTKN55 itself. While ωB97M(2) still yields better performance for small-molecule thermochemistry, this is compensated in WTMAD2 by the superior performance of the new functionals for conformer equilibria. Results for two external test sets with pronounced static correlation effects may indicate that range-separated double hybrids are more resilient to such effects.
Collapse
Affiliation(s)
- Golokesh Santra
- Department
of Organic Chemistry, Weizmann Institute
of Science, 7610001 Reḥovot, Israel
| | - Minsik Cho
- Department
of Organic Chemistry, Weizmann Institute
of Science, 7610001 Reḥovot, Israel
- Department
of Chemistry, Brown University, Providence, Rhode Island 02912, United States
| | - Jan M. L. Martin
- Department
of Organic Chemistry, Weizmann Institute
of Science, 7610001 Reḥovot, Israel
| |
Collapse
|
4
|
Daas T, Fabiano E, Della Sala F, Gori-Giorgi P, Vuckovic S. Noncovalent Interactions from Models for the Møller-Plesset Adiabatic Connection. J Phys Chem Lett 2021; 12:4867-4875. [PMID: 34003655 PMCID: PMC8280728 DOI: 10.1021/acs.jpclett.1c01157] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2021] [Accepted: 05/13/2021] [Indexed: 05/08/2023]
Abstract
Given the omnipresence of noncovalent interactions (NCIs), their accurate simulations are of crucial importance across various scientific disciplines. Here we construct accurate models for the description of NCIs by an interpolation along the Møller-Plesset adiabatic connection (MP AC). Our interpolation approximates the correlation energy, by recovering MP2 at small coupling strengths and the correct large-coupling strength expansion of the MP AC, recently shown to be a functional of the Hartree-Fock density. Our models are size consistent for fragments with nondegenerate ground states, have the same cost as double hybrids, and require no dispersion corrections to capture NCIs accurately. These interpolations greatly reduce large MP2 errors for typical π-stacking complexes (e.g., benzene-pyridine dimers) and for the L7 data set. They are also competitive with state-of-the-art dispersion enhanced functionals and can even significantly outperform them for a variety of data sets, such as CT7 and L7.
Collapse
Affiliation(s)
- Timothy
J. Daas
- Department
of Chemistry & Pharmaceutical Sciences and Amsterdam Institute
of Molecular and Life Sciences (AIMMS), Faculty of Science, Vrije Universiteit, De Boelelaan 1083, 1081HV Amsterdam, The Netherlands
| | - Eduardo Fabiano
- Institute
for Microelectronics and Microsystems (CNR-IMM), Via Monteroni, Campus Unisalento, 73100 Lecce, Italy
- Center
for Biomolecular Nanotechnologies, Istituto
Italiano di Tecnologia, Via Barsanti 14, 73010 Arnesano (LE), Italy
| | - Fabio Della Sala
- Institute
for Microelectronics and Microsystems (CNR-IMM), Via Monteroni, Campus Unisalento, 73100 Lecce, Italy
- Center
for Biomolecular Nanotechnologies, Istituto
Italiano di Tecnologia, Via Barsanti 14, 73010 Arnesano (LE), Italy
| | - Paola Gori-Giorgi
- Department
of Chemistry & Pharmaceutical Sciences and Amsterdam Institute
of Molecular and Life Sciences (AIMMS), Faculty of Science, Vrije Universiteit, De Boelelaan 1083, 1081HV Amsterdam, The Netherlands
| | - Stefan Vuckovic
- Department
of Chemistry & Pharmaceutical Sciences and Amsterdam Institute
of Molecular and Life Sciences (AIMMS), Faculty of Science, Vrije Universiteit, De Boelelaan 1083, 1081HV Amsterdam, The Netherlands
- Physical
and Theoretical Chemistry, University of
Saarland, 66123 Saarbrücken, Germany
- Department
of Chemistry, University of California, Irvine, California 92697, United States
| |
Collapse
|
5
|
Bodo E, Bonomo M, Mariani A. Assessing the Structure of Protic Ionic Liquids Based on Triethylammonium and Organic Acid Anions. J Phys Chem B 2021; 125:2781-2792. [PMID: 33719447 PMCID: PMC8041315 DOI: 10.1021/acs.jpcb.1c00249] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
![]()
We present a computational
analysis of the short-range structure
of three protic ionic liquids based on strong organic acids: trifluoracetate,
methanesulfonate, and triflate of triethylammonium. Accurate ab initio computations carried out on the gas-phase dimers
show that the protonation of triethylamine is spontaneous. We have
identified the anion-cation binding motif that is due to the presence
of a strong hydrogen bond and to electrostatic interactions. The strength
of the hydrogen bond and the magnitude of the binding energy decrease
in the order trifluoroacetate ≳ methanesulfonate > triflate.
The corresponding simulations of the bulk phases, obtained using a
semiempirical evaluation of the interatomic forces, reveal that on
short timescales, the state of the three liquids remains highly ionized
and that the gas-phase cation-/anion-binding motif is preserved while
no other peculiar structural features seem to emerge.
Collapse
Affiliation(s)
- Enrico Bodo
- Chemistry Department, University of Rome "La Sapienza", Piazzale A. Moro 5, 00185 Rome, Italy
| | - Matteo Bonomo
- Chemistry Department, University of Rome "La Sapienza", Piazzale A. Moro 5, 00185 Rome, Italy.,Department of Chemistry, NIS Interdepartmental Centre, INSTM Reference Centre, University of Turin, Via Gioacchino Quarello 15/A, 10125 Turin, Italy
| | - Alessandro Mariani
- Chemistry Department, University of Rome "La Sapienza", Piazzale A. Moro 5, 00185 Rome, Italy.,Helmholtz Institute Ulm (HIU), Helmholtzstrasse 11, Ulm 89081, Germany.,Karlsruhe Institute of Technology (KIT), P.O. Box 3640, Karlsruhe 76021, Germany
| |
Collapse
|