1
|
Snowdon C, Barca GMJ. An Efficient RI-MP2 Algorithm for Distributed Many-GPU Architectures. J Chem Theory Comput 2024. [PMID: 39422609 DOI: 10.1021/acs.jctc.4c00814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2024]
Abstract
Second-order Møller-Plesset perturbation theory (MP2) using the Resolution of the Identity approximation (RI-MP2) is a widely used method for computing molecular energies beyond the Hartree-Fock mean-field approximation. However, its high computational cost and lack of efficient algorithms for modern supercomputing architectures limit its applicability to large molecules. In this paper, we present the first distributed-memory many-GPU RI-MP2 algorithm explicitly designed to utilize hundreds of GPU accelerators for every step of the computation. Our novel algorithm achieves near-peak performance on GPU-based supercomputers through the development of a distributed memory algorithm for forming RI-MP2 intermediate tensors with zero internode communication, except for a single O ( N 2 ) asynchronous broadcast, and a distributed memory algorithm for the O ( N 5 ) energy reduction step, capable of sustaining near-peak performance on clusters with several hundred GPUs. Comparative analysis shows our implementation outperforms state-of-the-art quantum chemistry software by over 3.5 times in speed while achieving an 8-fold reduction in computational power consumption. Benchmarking on the Perlmutter supercomputer, our algorithm achieves 11.8 PFLOP/s (83% of peak performance) performing and the RI-MP2 energy calculation on a 314-water cluster with 7850 primary and 30,144 auxiliary basis functions in 4 min on 180 nodes and 720 A100 GPUs. This performance represents a substantial improvement over traditional CPU-based methods, demonstrating significant time-to-solution and power consumption benefits of leveraging modern GPU-accelerated computing environments for quantum chemistry calculations.
Collapse
Affiliation(s)
- Calum Snowdon
- School of Computing, Australian National University, Canberra 2600, Australia
| | - Giuseppe M J Barca
- School of Computing and Information Systems, University of Melbourne, Melbourne 3010, Australia
| |
Collapse
|
2
|
Schmid SP, Schlosser L, Glorius F, Jorner K. Catalysing (organo-)catalysis: Trends in the application of machine learning to enantioselective organocatalysis. Beilstein J Org Chem 2024; 20:2280-2304. [PMID: 39290209 PMCID: PMC11406055 DOI: 10.3762/bjoc.20.196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Accepted: 08/09/2024] [Indexed: 09/19/2024] Open
Abstract
Organocatalysis has established itself as a third pillar of homogeneous catalysis, besides transition metal catalysis and biocatalysis, as its use for enantioselective reactions has gathered significant interest over the last decades. Concurrent to this development, machine learning (ML) has been increasingly applied in the chemical domain to efficiently uncover hidden patterns in data and accelerate scientific discovery. While the uptake of ML in organocatalysis has been comparably slow, the last two decades have showed an increased interest from the community. This review gives an overview of the work in the field of ML in organocatalysis. The review starts by giving a short primer on ML for experimental chemists, before discussing its application for predicting the selectivity of organocatalytic transformations. Subsequently, we review ML employed for privileged catalysts, before focusing on its application for catalyst and reaction design. Concluding, we give our view on current challenges and future directions for this field, drawing inspiration from the application of ML to other scientific domains.
Collapse
Affiliation(s)
- Stefan P Schmid
- Institute of Chemical and Bioengineering, Department of Chemistry and Applied Biosciences, ETH Zurich, Zurich CH-8093, Switzerland
| | - Leon Schlosser
- Organisch-Chemisches Institut, Universität Münster, 48149 Münster, Germany
| | - Frank Glorius
- Organisch-Chemisches Institut, Universität Münster, 48149 Münster, Germany
| | - Kjell Jorner
- Institute of Chemical and Bioengineering, Department of Chemistry and Applied Biosciences, ETH Zurich, Zurich CH-8093, Switzerland
- National Centre of Competence in Research (NCCR) Catalysis, ETH Zurich, Zurich CH-8093, Switzerland
| |
Collapse
|
3
|
Erdmann P, Sigmund LM, Schmitt M, Hähnel T, Dittmer LB, Greb L. A Benchmark Study of DFT-Computed p-Block Element Lewis Pair Formation Enthalpies Against Experimental Calorimetric Data. Chemphyschem 2024:e202400761. [PMID: 39219146 DOI: 10.1002/cphc.202400761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2024] [Revised: 08/30/2024] [Accepted: 08/30/2024] [Indexed: 09/04/2024]
Abstract
The quantification of Lewis acidity is of fundamental and applied importance in chemistry. While the computed fluoride ion affinity (FIA) is the most widely accepted thermodynamic metric, only sparse experimental values exist. Accordingly, a benchmark of methods for computing Lewis pair formation enthalpies, also with a broader set of Lewis bases against experimental data, is missing. Herein, we evaluate different density functionals against a set of 112 experimentally determined Lewis acid/base binding enthalpies and gauge influences such as solvation correction in structure optimization. From that, we can recommend r2SCAN-3c for robust quantification of this omnipresent interaction.
Collapse
Affiliation(s)
- Philipp Erdmann
- Anorganisch-Chemisches Institut, Ruprecht-Karls Universität Heidelberg, Im Neuenheimer Feld 270, 69120, Heidelberg, Germany
| | - Lukas M Sigmund
- Anorganisch-Chemisches Institut, Ruprecht-Karls Universität Heidelberg, Im Neuenheimer Feld 270, 69120, Heidelberg, Germany
| | - Manuel Schmitt
- Anorganisch-Chemisches Institut, Ruprecht-Karls Universität Heidelberg, Im Neuenheimer Feld 270, 69120, Heidelberg, Germany
| | - Theresa Hähnel
- Anorganisch-Chemisches Institut, Ruprecht-Karls Universität Heidelberg, Im Neuenheimer Feld 270, 69120, Heidelberg, Germany
| | - Linus B Dittmer
- Anorganisch-Chemisches Institut, Ruprecht-Karls Universität Heidelberg, Im Neuenheimer Feld 270, 69120, Heidelberg, Germany
| | - Lutz Greb
- Anorganisch-Chemisches Institut, Ruprecht-Karls Universität Heidelberg, Im Neuenheimer Feld 270, 69120, Heidelberg, Germany
| |
Collapse
|
4
|
Nagy PR. State-of-the-art local correlation methods enable affordable gold standard quantum chemistry for up to hundreds of atoms. Chem Sci 2024:d4sc04755a. [PMID: 39246365 PMCID: PMC11376132 DOI: 10.1039/d4sc04755a] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Accepted: 07/30/2024] [Indexed: 09/10/2024] Open
Abstract
In this feature, we review the current capabilities of local electron correlation methods up to the coupled cluster model with single, double, and perturbative triple excitations [CCSD(T)], which is a gold standard in quantum chemistry. The main computational aspects of the local method types are assessed from the perspective of applications, but the focus is kept on how to achieve chemical accuracy (i.e., <1 kcal mol-1 uncertainty), as well as on the broad scope of chemical problems made accessible. The performance of state-of-the-art methods is also compared, including the most employed DLPNO and, in particular, our local natural orbital (LNO) CCSD(T) approach. The high accuracy and efficiency of the LNO method makes chemically accurate CCSD(T) computations accessible for molecules of hundreds of atoms with resources affordable to a broad computational community (days on a single CPU and 10-100 GB of memory). Recent developments in LNO-CCSD(T) enable systematic convergence and robust error estimates even for systems of complicated electronic structure or larger size (up to 1000 atoms). The predictive power of current local CCSD(T) methods, usually at about 1-2 order of magnitude higher cost than hybrid density functional theory (DFT), has become outstanding on the palette of computational chemistry applicable for molecules of practical interest. We also review more than 50 LNO-based and other advanced local-CCSD(T) applications for realistic, large systems across molecular interactions as well as main group, transition metal, bio-, and surface chemistry. The examples show that properly executed local-CCSD(T) can contribute to binding, reaction equilibrium, rate constants, etc. which are able to match measurements within the error estimates. These applications demonstrate that modern, open-access, and broadly affordable local methods, such as LNO-CCSD(T), already enable predictive computations and atomistic insight for complicated, real-life molecular processes in realistic environments.
Collapse
Affiliation(s)
- Péter R Nagy
- Department of Physical Chemistry and Materials Science, Faculty of Chemical Technology and Biotechnology, Budapest University of Technology and Economics Műegyetem rkp. 3. H-1111 Budapest Hungary
- HUN-REN-BME Quantum Chemistry Research Group Műegyetem rkp. 3. H-1111 Budapest Hungary
- MTA-BME Lendület Quantum Chemistry Research Group Műegyetem rkp. 3. H-1111 Budapest Hungary
| |
Collapse
|
5
|
Zhao H, Gould T, Vuckovic S. Deep Mind 21 functional does not extrapolate to transition metal chemistry. Phys Chem Chem Phys 2024; 26:12289-12298. [PMID: 38597718 DOI: 10.1039/d4cp00878b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/11/2024]
Abstract
The development of density functional approximations stands at a crossroads: while machine-learned functionals show potential to surpass their human-designed counterparts, their extrapolation to unseen chemistry lags behind. Here we assess how well the recent Deep Mind 21 (DM21) machine-learned functional [Science, 2021, 374, 1385-1389], trained on main-group chemistry, extrapolates to transition metal chemistry (TMC). We show that DM21 demonstrates comparable or occasionally superior accuracy to B3LYP for TMC, but consistently struggles with achieving self-consistent field convergence for TMC molecules. We also compare main-group and TMC machine-learning DM21 features to shed light on DM21's challenges in TMC. We finally propose strategies to overcome limitations in the extrapolative capabilities of machine-learned functionals in TMC.
Collapse
Affiliation(s)
- Heng Zhao
- Department of Chemistry, University of Fribourg, Fribourg, Switzerland.
| | - Tim Gould
- Queensland Micro- and Nanotechnology Centre, Griffith University, Nathan, Qld 4111, Australia
| | - Stefan Vuckovic
- Department of Chemistry, University of Fribourg, Fribourg, Switzerland.
| |
Collapse
|
6
|
Stocks R, Palethorpe E, Barca GMJ. High-Performance Multi-GPU Analytic RI-MP2 Energy Gradients. J Chem Theory Comput 2024; 20:2505-2519. [PMID: 38456899 DOI: 10.1021/acs.jctc.3c01424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/09/2024]
Abstract
This article presents a novel algorithm for the calculation of analytic energy gradients from second-order Møller-Plesset perturbation theory within the Resolution-of-the-Identity approximation (RI-MP2), which is designed to achieve high performance on clusters with multiple graphical processing units (GPUs). The algorithm uses GPUs for all major steps of the calculation, including integral generation, formation of all required intermediate tensors, solution of the Z-vector equation and gradient accumulation. The implementation in the EXtreme Scale Electronic Structure System (EXESS) software package includes a tailored, highly efficient, multistream scheduling system to hide CPU-GPU data transfer latencies and allows nodes with 8 A100 GPUs to operate at over 80% of theoretical peak floating-point performance. Comparative performance analysis shows a significant reduction in computational time relative to traditional multicore CPU-based methods, with our approach achieving up to a 95-fold speedup over the single-node performance of established software such as Q-Chem and ORCA. Additionally, we demonstrate that pairing our implementation with the molecular fragmentation framework in EXESS can drastically lower the computational scaling of RI-MP2 gradient calculations from quintic to subquadratic, enabling further substantial savings in runtime while retaining high numerical accuracy in the resulting gradients.
Collapse
Affiliation(s)
- Ryan Stocks
- School of Computing, Australian National University, Canberra, ACT 2601, Australia
| | - Elise Palethorpe
- School of Computing, Australian National University, Canberra, ACT 2601, Australia
| | - Giuseppe M J Barca
- School of Computing, Australian National University, Canberra, ACT 2601, Australia
| |
Collapse
|
7
|
Kaur R, Frederickson A, Wetmore SD. Elucidation of the catalytic mechanism of a single-metal dependent homing endonuclease using QM and QM/MM approaches: the case study of I- PpoI. Phys Chem Chem Phys 2024; 26:8919-8931. [PMID: 38426850 DOI: 10.1039/d3cp06201e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2024]
Abstract
Homing endonucleases (HEs) are highly specific DNA cleaving enzymes, with I-PpoI having been suggested to use a single metal to accelerate phosphodiester bond cleavage. Although an I-PpoI mechanism has been proposed based on experimental structural data, no consensus has been reached regarding the roles of the metal or key active site amino acids. This study uses QM cluster and QM/MM calculations to provide atomic-level details of the I-PpoI catalytic mechanism. Minimal QM cluster and large-scale QM/MM models demonstrate that the experimentally-proposed pathway involving direct Mg2+ coordination to the substrate coupled with leaving group protonation through a metal-activated water is not feasible due to an inconducive I-PpoI active site alignment. Despite QM cluster models of varying size uncovering a pathway involving leaving group protonation by a metal-activated water, indirect (water-mediated) metal coordination to the substrate is required to afford this pathway, which renders this mechanism energetically infeasible. Instead, QM cluster models reveal that the preferred pathway involves direct Mg2+-O3' coordination to stabilize the charged substrate and assist leaving group departure, while H98 activates the water nucleophile. These calculations also underscore that both catalytic residues that directly interact with the substrate and secondary amino acids that position or stabilize these residues are required for efficient catalysis. QM/MM calculations on the solvated enzyme-DNA complex verify the preferred mechanism, which is fully consistent with experimental kinetic, structural, and mutational data. The fundamental understanding of the I-PpoI mechanism of action, gained from the present work can be used to further explore potential uses of this enzyme in biotechnology and medicine, and direct future computational investigations of other members of the understudied HE family.
Collapse
Affiliation(s)
- Rajwinder Kaur
- Department of Chemistry and Biochemistry, University of Lethbridge, 4401 University Drive West, Lethbridge, Alberta T1K 3M4, Canada.
| | - Angela Frederickson
- Department of Chemistry and Biochemistry, University of Lethbridge, 4401 University Drive West, Lethbridge, Alberta T1K 3M4, Canada.
| | - Stacey D Wetmore
- Department of Chemistry and Biochemistry, University of Lethbridge, 4401 University Drive West, Lethbridge, Alberta T1K 3M4, Canada.
| |
Collapse
|
8
|
Wappett DA, Goerigk L. Exploring CPS-Extrapolated DLPNO-CCSD(T 1) Reference Values for Benchmarking DFT Methods on Enzymatically Catalyzed Reactions. J Phys Chem A 2024; 128:62-72. [PMID: 38124376 DOI: 10.1021/acs.jpca.3c05086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
Domain-based local pair natural orbital coupled-cluster singles doubles with perturbative triples [DLPNO-CCSD(T)] is regularly used to calculate reliable benchmark reference values at a computational cost significantly lower than that of canonical CCSD(T). Recent work has shown that even greater accuracy can be obtained at only a small additional cost through extrapolation to the complete PNO space (CPS) limit. Herein, we test two levels of CPS extrapolation, CPS(5,6), which approximates the accuracy of standard TightPNO, and CPS(6,7), which surpasses it, as benchmark values to test density functional approximations (DFAs) on a small set of organic and transition-metal-dependent enzyme active site models. Between the different reference levels of theory, there are changes in the magnitudes of the absolute deviations for all functionals, but these are small and there is minimal impact on the relative rankings of the tested DFAs. The differences are more significant for the metalloenzymes than the organic enzymes, so we repeat the tests on our entire ENZYMES22 set of organic enzyme active site models [Wappett, D. A.; Goerigk, L. J. Phys. Chem. A 2019, 123, 7057-7074] to confirm that using the CPS extrapolations for the reference values has negligible impact on the benchmarking outcomes. This means that we can particularly recommend CPS(5,6) as an alternative to standard TightPNO settings for calculating reference values, increasing the applicability of DLPNO-CCSD(T) in benchmarking reaction energies and barrier heights of larger models of organic enzymes. DLPNO-CCSD(T1)/CPS(6,7) energies for ENZYMES22 are finally presented as updated reference values for the set, reflecting the recent improvements in the method.
Collapse
Affiliation(s)
- Dominique A Wappett
- School of Chemistry, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Lars Goerigk
- School of Chemistry, The University of Melbourne, Parkville, Victoria 3010, Australia
| |
Collapse
|