1
|
Cohen RD, Wood JS, Lam YH, Buevich AV, Sherer EC, Reibarkh M, Williamson RT, Martin GE. DELTA50: A Highly Accurate Database of Experimental 1H and 13C NMR Chemical Shifts Applied to DFT Benchmarking. Molecules 2023; 28:molecules28062449. [PMID: 36985422 PMCID: PMC10051451 DOI: 10.3390/molecules28062449] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 02/23/2023] [Accepted: 02/28/2023] [Indexed: 03/30/2023] Open
Abstract
Density functional theory (DFT) benchmark studies of 1H and 13C NMR chemical shifts often yield differing conclusions, likely due to non-optimal test molecules and non-standardized data acquisition. To address this issue, we carefully selected and measured 1H and 13C NMR chemical shifts for 50 structurally diverse small organic molecules containing atoms from only the first two rows of the periodic table. Our NMR dataset, DELTA50, was used to calculate linear scaling factors and to evaluate the accuracy of 73 density functionals, 40 basis sets, 3 solvent models, and 3 gauge-referencing schemes. The best performing DFT methodologies for 1H and 13C NMR chemical shift predictions were WP04/6-311++G(2d,p) and ωB97X-D/def2-SVP, respectively, when combined with the polarizable continuum solvent model (PCM) and gauge-independent atomic orbital (GIAO) method. Geometries should be optimized at the B3LYP-D3/6-311G(d,p) level including the PCM solvent model for the best accuracy. Predictions of 20 organic compounds and natural products from a separate probe set had root-mean-square deviations (RMSD) of 0.07 to 0.19 for 1H and 0.5 to 2.9 for 13C. Maximum deviations were less than 0.5 and 6.5 ppm for 1H and 13C, respectively.
Collapse
Affiliation(s)
- Ryan D Cohen
- Analytical Research and Development, Merck & Co., Inc., Rahway, NJ 07065, USA
- Department of Chemistry and Biochemistry, Seton Hall University, South Orange, NJ 07079, USA
| | - Jared S Wood
- Analytical Research and Development, Merck & Co., Inc., Rahway, NJ 07065, USA
- Department of Chemistry and Biochemistry, University of North Carolina Wilmington, Wilmington, NC 28409, USA
| | - Yu-Hong Lam
- Department of Computational and Structural Chemistry, Merck & Co., Inc., Rahway, NJ 07065, USA
| | - Alexei V Buevich
- Analytical Research and Development, Merck & Co., Inc., Rahway, NJ 07065, USA
| | - Edward C Sherer
- Analytical Research and Development, Merck & Co., Inc., Rahway, NJ 07065, USA
| | - Mikhail Reibarkh
- Analytical Research and Development, Merck & Co., Inc., Rahway, NJ 07065, USA
| | - R Thomas Williamson
- Department of Chemistry and Biochemistry, University of North Carolina Wilmington, Wilmington, NC 28409, USA
| | - Gary E Martin
- Department of Chemistry and Biochemistry, Seton Hall University, South Orange, NJ 07079, USA
| |
Collapse
|
2
|
Jana S, Herbert JM. Slater transition methods for core-level electron binding energies. J Chem Phys 2023; 158:094111. [PMID: 36889976 DOI: 10.1063/5.0134459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023] Open
Abstract
Methods for computing core-level ionization energies using self-consistent field (SCF) calculations are evaluated and benchmarked. These include a "full core hole" (or "ΔSCF") approach that fully accounts for orbital relaxation upon ionization, but also methods based on Slater's transition concept in which the binding energy is estimated from an orbital energy level that is obtained from a fractional-occupancy SCF calculation. A generalization that uses two different fractional-occupancy SCF calculations is also considered. The best of the Slater-type methods afford mean errors of 0.3-0.4 eV with respect to experiment for a dataset of K-shell ionization energies, a level of accuracy that is competitive with more expensive many-body techniques. An empirical shifting procedure with one adjustable parameter reduces the average error below 0.2 eV. This shifted Slater transition method is a simple and practical way to compute core-level binding energies using only initial-state Kohn-Sham eigenvalues. It requires no more computational effort than ΔSCF and may be especially useful for simulating transient x-ray experiments where core-level spectroscopy is used to probe an excited electronic state, for which the ΔSCF approach requires a tedious state-by-state calculation of the spectrum. As an example, we use Slater-type methods to model x-ray emission spectroscopy.
Collapse
Affiliation(s)
- Subrata Jana
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210, USA
| | - John M Herbert
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210, USA
| |
Collapse
|
3
|
Hersh WH, Chan TY. Improving the accuracy of 31P NMR chemical shift calculations by use of scaling methods. Beilstein J Org Chem 2023; 19:36-56. [PMID: 36726479 PMCID: PMC9843238 DOI: 10.3762/bjoc.19.4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Accepted: 12/14/2022] [Indexed: 01/12/2023] Open
Abstract
Calculation of 31P NMR chemical shifts for a series of tri- and tetracoordinate phosphorus compounds using several basis sets and density functional theory (DFT) functionals gave a modest fit to experimental chemical shifts, but an excellent linear fit when plotted against the experimental values. The resultant scaling methods were then applied to a variety of "large" compounds previously selected by Latypov et al. and a set of stereoisomeric and unusual compounds selected here. No one method was best for all structural types. For compounds that contain P-P bonds and P-C multiple bonds, the Latypov et al. method using the PBE0 functional was best (mean absolute deviation/root mean square deviation (MAD/RMSD) = 6.9/8.5 ppm and 6.6/8.2 ppm, respectively), but for the full set of compounds gave higher deviations (MAD/RMSD = 8.2/12.3 ppm), and failed by over 60 ppm for a three-membered phosphorus heterocycle. Use of the M06-2X functional for both the structural optimization and NMR chemical shift calculation was best overall for the compounds without P-C multiple bonds (MAD/RMSD = 5.4/7.1 ppm), but failed by 30-49 ppm for compounds having any P-C multiple-bond character. Failures of these magnitudes have not been reported previously for these widely used functionals. These failures were then used to screen a variety of recommended functionals, leading to better overall methods for calculation of these chemical shifts: optimization with the M06-2X functional and NMR calculation with the PBE0 or ωB97x-D functionals gave values for MAD/RMSD = 6.9/8.5 ppm and 6.8/9.1 ppm, respectively, over an experimental chemical shift range of -181 to 356 ppm. Due to the unexplained failures observed, we recommend use of more than one method when looking at novel structures.
Collapse
Affiliation(s)
- William H Hersh
- Department of Chemistry and Biochemistry, Queens College, Queens, NY 11367-1597, USA,Ph.D. Program in Chemistry, The Graduate Center of the City University of New York, New York, NY 10016, USA
| | - Tsz-Yeung Chan
- Department of Chemistry and Biochemistry, Queens College, Queens, NY 11367-1597, USA,Ph.D. Program in Chemistry, The Graduate Center of the City University of New York, New York, NY 10016, USA
| |
Collapse
|
4
|
Gray M, Bowling PE, Herbert JM. Systematic Evaluation of Counterpoise Correction in Density Functional Theory. J Chem Theory Comput 2022; 18:6742-6756. [PMID: 36251499 DOI: 10.1021/acs.jctc.2c00883] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A widespread belief persists that the Boys-Bernardi function counterpoise (CP) procedure "overcorrects" supramolecular interaction energies for the effects of basis-set superposition error. To the extent that this is true for correlated wave function methods, it is usually an artifact of low-quality basis sets. The question has not been considered systematically in the context of density functional theory, however, where basis-set convergence is generally less problematic. We present a systematic assessment of the CP procedure for a representative set of functionals and basis sets, considering both benchmark data sets of small dimers and larger supramolecular complexes. The latter include layered composite polymers with ∼150 atoms and ligand-protein models with ∼300 atoms. Provided that CP correction is used, we find that intermolecular interaction energies of nearly complete-basis quality can be obtained using only double-ζ basis sets. This is less expensive as compared to triple-ζ basis sets without CP correction. CP-corrected interaction energies are less sensitive to the presence of diffuse basis functions as compared to uncorrected energies, which is important because diffuse functions are expensive and often numerically problematic for large systems. Our results upend the conventional wisdom that CP "overcorrects" for basis-set incompleteness. In small basis sets, CP correction is mandatory in order to demonstrate that the results do not rest on error cancellation.
Collapse
Affiliation(s)
- Montgomery Gray
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio43210, United States
| | - Paige E Bowling
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio43210, United States.,Biophysics Graduate Program, The Ohio State University, Columbus, Ohio43210, United States
| | - John M Herbert
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio43210, United States.,Biophysics Graduate Program, The Ohio State University, Columbus, Ohio43210, United States
| |
Collapse
|
5
|
New pecJ-n (n = 1, 2) Basis Sets for High-Quality Calculations of Indirect Nuclear Spin–Spin Coupling Constants Involving 31P and 29Si: The Advanced PEC Method. Molecules 2022; 27:molecules27196145. [PMID: 36234706 PMCID: PMC9573013 DOI: 10.3390/molecules27196145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 09/14/2022] [Accepted: 09/16/2022] [Indexed: 11/21/2022] Open
Abstract
In this paper, we presented new J-oriented basis sets, pecJ-n (n = 1, 2), for phosphorus and silicon, purposed for the high-quality correlated calculations of the NMR spin–spin coupling constants involving these nuclei. The pecJ-n basis sets were generated using the modified version of the property-energy consistent (PEC) method, which was introduced in our earlier paper. The modifications applied to the original PEC procedure increased the overall accuracy and robustness of the generated basis sets in relation to the diversity of electronic systems. Our new basis sets were successfully tested on a great number of spin–spin coupling constants, involving phosphorus or/and silicon, calculated within the SOPPA(CCSD) method. In general, it was found that our new pecJ-1 and pecJ-2 basis sets are very efficient, providing the overall accuracy that can be characterized by MAEs of about 3.80 and 1.98 Hz, respectively, against the benchmark data obtained with a large dyall.aae4z+ basis set of quadruple-ζ quality.
Collapse
|
6
|
Rusakov YY, Rusakova IL. New pecS- n ( n = 1, 2) basis sets for quantum chemical calculations of the NMR chemical shifts of H, C, N, and O nuclei. J Chem Phys 2022; 156:244112. [DOI: 10.1063/5.0096907] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
This paper demonstrates the performance of our previously suggested property-energy consistent method on the example of the generation of effective basis sets, pecS-1 and pecS-2, suited for the calculation of hydrogen, carbon, nitrogen, and oxygen chemical shifts. The new basis sets were successfully approbated in the GIAO-DFT calculations of the chemical shifts of 35 molecules using six different functionals. The pecS-1 basis set demonstrated very good accuracy, which makes this small basis set an effective means for the large-scale computations. At the same time, the pecS-2 basis set also gave very accurate results, thus putting it on a par with the other commensurate basis sets suited for the chemical shifts calculations.
Collapse
Affiliation(s)
- Yuriy Yu. Rusakov
- A. E. Favorsky Irkutsk Institute of Chemistry, Siberian Branch of the Russian Academy of Sciences, Favorsky St. 1, 664033 Irkutsk, Russian Federation
| | - Irina L. Rusakova
- A. E. Favorsky Irkutsk Institute of Chemistry, Siberian Branch of the Russian Academy of Sciences, Favorsky St. 1, 664033 Irkutsk, Russian Federation
| |
Collapse
|
7
|
Nasir H. Population model of cardiovascular diseases: Global dynamics, sensitivity analysis, and optimal control. JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES 2022. [DOI: 10.1080/02522667.2021.2016983] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Hanis Nasir
- Faculty of Ocean Engineering Technology and Informatics, Universiti Malaysia Terengganu, 21030, Kuala Nerus, Terengganu, Malaysia
| |
Collapse
|
8
|
Strazdaite S, Roeters SJ, Sakalauskas A, Sneideris T, Kirschner J, Pedersen KB, Schiøtt B, Jensen F, Weidner T, Smirnovas V, Niaura G. Interaction of Amyloid-β-(1-42) Peptide and Its Aggregates with Lipid/Water Interfaces Probed by Vibrational Sum-Frequency Generation Spectroscopy. J Phys Chem B 2021; 125:11208-11218. [PMID: 34597059 DOI: 10.1021/acs.jpcb.1c04882] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
In this study, we use surface-sensitive vibrational sum-frequency generation (VSFG) spectroscopy to investigate the interaction between model lipid monolayers and Aβ(1-42) in its monomeric and aggregated states. Combining VSFG with atomic force microscopy (AFM) and thioflavin T (ThT) fluorescence measurements, we found that only small aggregates with probably a β-hairpin-like structure adsorbed to the zwitterionic lipid monolayer (DOPC). In contrast, larger aggregates with an extended β-sheet structure adsorbed to a negatively charged lipid monolayer (DOPG). The adsorption of small, initially formed aggregates strongly destabilized both monolayers, but only the DOPC monolayer was completely disrupted. We showed that the intensity of the amide-II' band in achiral (SSP) and chiral (SPP) polarization combinations increased in time when Aβ(1-42) aggregates accumulated at the DOPG monolayer. Nevertheless, almost no adsorption of preformed mature fibrils to DOPG monolayers was detected. By performing spectral VSFG calculations, we revealed a clear correlation between the amide-II' signal and the degree of amyloid aggregates (e.g., oligomers or (proto)fibrils) of various Aβ(1-42) structures. The calculations showed that only structures with a significant amyloid β-sheet content have a strong amide-II' intensity, in line with previous Raman studies. The combination of the presented results substantiates the amide-II(') band as a legitimate amyloid marker.
Collapse
Affiliation(s)
- S Strazdaite
- Department of Organic Chemistry, Center for Physical Sciences and Technology, Sauletekio Ave. 3, Vilnius LT-10257, Lithuania
| | - S J Roeters
- Department of Chemistry, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| | - A Sakalauskas
- Institute of Biotechnology, Life Sciences Center, Vilnius University, Sauletekio 7, LT-10257 Vilnius, Lithuania
| | - T Sneideris
- Institute of Biotechnology, Life Sciences Center, Vilnius University, Sauletekio 7, LT-10257 Vilnius, Lithuania
| | - J Kirschner
- Institute of Solid State Physics, TU Wien, Wiedner Hauptstrasse 8-10, 1040 Vienna, Austria
| | - K B Pedersen
- Department of Chemistry, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| | - B Schiøtt
- Department of Chemistry, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| | - F Jensen
- Department of Chemistry, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| | - T Weidner
- Department of Chemistry, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark
| | - V Smirnovas
- Institute of Biotechnology, Life Sciences Center, Vilnius University, Sauletekio 7, LT-10257 Vilnius, Lithuania
| | - G Niaura
- Department of Organic Chemistry, Center for Physical Sciences and Technology, Sauletekio Ave. 3, Vilnius LT-10257, Lithuania
| |
Collapse
|
9
|
Duan C, Chen S, Taylor MG, Liu F, Kulik HJ. Machine learning to tame divergent density functional approximations: a new path to consensus materials design principles. Chem Sci 2021; 12:13021-13036. [PMID: 34745533 PMCID: PMC8513898 DOI: 10.1039/d1sc03701c] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Accepted: 09/01/2021] [Indexed: 01/17/2023] Open
Abstract
Virtual high-throughput screening (VHTS) with density functional theory (DFT) and machine-learning (ML)-acceleration is essential in rapid materials discovery. By necessity, efficient DFT-based workflows are carried out with a single density functional approximation (DFA). Nevertheless, properties evaluated with different DFAs can be expected to disagree for cases with challenging electronic structure (e.g., open-shell transition-metal complexes, TMCs) for which rapid screening is most needed and accurate benchmarks are often unavailable. To quantify the effect of DFA bias, we introduce an approach to rapidly obtain property predictions from 23 representative DFAs spanning multiple families, “rungs” (e.g., semi-local to double hybrid) and basis sets on over 2000 TMCs. Although computed property values (e.g., spin state splitting and frontier orbital gap) differ by DFA, high linear correlations persist across all DFAs. We train independent ML models for each DFA and observe convergent trends in feature importance, providing DFA-invariant, universal design rules. We devise a strategy to train artificial neural network (ANN) models informed by all 23 DFAs and use them to predict properties (e.g., spin-splitting energy) of over 187k TMCs. By requiring consensus of the ANN-predicted DFA properties, we improve correspondence of computational lead compounds with literature-mined, experimental compounds over the typically employed single-DFA approach. Machine learning (ML)-based feature analysis reveals universal design rules regardless of density functional choices. Using the consensus among multiple functionals, we identify robust lead complexes in ML-accelerated chemical discovery.![]()
Collapse
Affiliation(s)
- Chenru Duan
- Department of Chemical Engineering, Massachusetts Institute of Technology Cambridge MA 02139 USA +1-617-253-4584.,Department of Chemistry, Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Shuxin Chen
- Department of Chemical Engineering, Massachusetts Institute of Technology Cambridge MA 02139 USA +1-617-253-4584.,Department of Chemistry, Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Michael G Taylor
- Department of Chemical Engineering, Massachusetts Institute of Technology Cambridge MA 02139 USA +1-617-253-4584
| | - Fang Liu
- Department of Chemical Engineering, Massachusetts Institute of Technology Cambridge MA 02139 USA +1-617-253-4584
| | - Heather J Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology Cambridge MA 02139 USA +1-617-253-4584
| |
Collapse
|
10
|
Mazzuca JW, Hanna MC, Loftus CL, Seymour SR. Theoretical description of the preferential hydrolytic deamination of cytosine over adenine. COMPUT THEOR CHEM 2021. [DOI: 10.1016/j.comptc.2021.113354] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
11
|
Nandy A, Duan C, Taylor MG, Liu F, Steeves AH, Kulik HJ. Computational Discovery of Transition-metal Complexes: From High-throughput Screening to Machine Learning. Chem Rev 2021; 121:9927-10000. [PMID: 34260198 DOI: 10.1021/acs.chemrev.1c00347] [Citation(s) in RCA: 99] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Transition-metal complexes are attractive targets for the design of catalysts and functional materials. The behavior of the metal-organic bond, while very tunable for achieving target properties, is challenging to predict and necessitates searching a wide and complex space to identify needles in haystacks for target applications. This review will focus on the techniques that make high-throughput search of transition-metal chemical space feasible for the discovery of complexes with desirable properties. The review will cover the development, promise, and limitations of "traditional" computational chemistry (i.e., force field, semiempirical, and density functional theory methods) as it pertains to data generation for inorganic molecular discovery. The review will also discuss the opportunities and limitations in leveraging experimental data sources. We will focus on how advances in statistical modeling, artificial intelligence, multiobjective optimization, and automation accelerate discovery of lead compounds and design rules. The overall objective of this review is to showcase how bringing together advances from diverse areas of computational chemistry and computer science have enabled the rapid uncovering of structure-property relationships in transition-metal chemistry. We aim to highlight how unique considerations in motifs of metal-organic bonding (e.g., variable spin and oxidation state, and bonding strength/nature) set them and their discovery apart from more commonly considered organic molecules. We will also highlight how uncertainty and relative data scarcity in transition-metal chemistry motivate specific developments in machine learning representations, model training, and in computational chemistry. Finally, we will conclude with an outlook of areas of opportunity for the accelerated discovery of transition-metal complexes.
Collapse
Affiliation(s)
- Aditya Nandy
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States.,Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Chenru Duan
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States.,Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Michael G Taylor
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Fang Liu
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Adam H Steeves
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Heather J Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
12
|
Rusakov YY, Rusakova IL. An efficient method for generating property-energy consistent basis sets. New pecJ- n ( n = 1, 2) basis sets for high-quality calculations of indirect nuclear spin-spin coupling constants involving 1H, 13C, 15N, and 19F nuclei. Phys Chem Chem Phys 2021; 23:14925-14939. [PMID: 34223856 DOI: 10.1039/d1cp01984h] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
This paper presents a new method of generating property-energy consistent (PEC) basis sets that can be applied to any arbitrary molecular property. The PEC method generates a basis set that is optimized for the molecular property under interest, providing the least possible total molecular energy. The main algorithm of the PEC approach involves Monte Carlo simulations to generate random exponents in the predetermined range. In this work, the PEC method is introduced in the example of generation of new pecJ-n (n = 1, 2) basis sets suited for high-quality correlated calculations of indirect nuclear spin-spin coupling constants involving the most popular NMR-active nuclei: 1H, 13C, 15N, and 19F.
Collapse
Affiliation(s)
- Yuriy Yu Rusakov
- A. E. Favorsky Irkutsk Institute of Chemistry, Siberian Branch of the Russian Academy of Sciences, Favorsky St. 1, 664033 Irkutsk, Russian Federation.
| | - Irina L Rusakova
- A. E. Favorsky Irkutsk Institute of Chemistry, Siberian Branch of the Russian Academy of Sciences, Favorsky St. 1, 664033 Irkutsk, Russian Federation.
| |
Collapse
|
13
|
Affiliation(s)
- Frank Jensen
- Department of Chemistry Aarhus University DK-8000 Aarhus Denmark
| |
Collapse
|
14
|
Duan C, Liu F, Nandy A, Kulik HJ. Putting Density Functional Theory to the Test in Machine-Learning-Accelerated Materials Discovery. J Phys Chem Lett 2021; 12:4628-4637. [PMID: 33973793 DOI: 10.1021/acs.jpclett.1c00631] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
Accelerated discovery with machine learning (ML) has begun to provide the advances in efficiency needed to overcome the combinatorial challenge of computational materials design. Nevertheless, ML-accelerated discovery both inherits the biases of training data derived from density functional theory (DFT) and leads to many attempted calculations that are doomed to fail. Many compelling functional materials and catalytic processes involve strained chemical bonds, open-shell radicals and diradicals, or metal-organic bonds to open-shell transition-metal centers. Although promising targets, these materials present unique challenges for electronic structure methods and combinatorial challenges for their discovery. In this Perspective, we describe the advances needed in accuracy, efficiency, and approach beyond what is typical in conventional DFT-based ML workflows. These challenges have begun to be addressed through ML models trained to predict the results of multiple methods or the differences between them, enabling quantitative sensitivity analysis. For DFT to be trusted for a given data point in a high-throughput screen, it must pass a series of tests. ML models that predict the likelihood of calculation success and detect the presence of strong correlation will enable rapid diagnoses and adaptation strategies. These "decision engines" represent the first steps toward autonomous workflows that avoid the need for expert determination of the robustness of DFT-based materials discoveries.
Collapse
Affiliation(s)
- Chenru Duan
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Fang Liu
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Aditya Nandy
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Heather J Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
15
|
Affiliation(s)
- Heather J. Kulik
- Department of Chemical Engineering Massachusetts Institute of Technology 77 Massachusetts Ave Rm 66–464 Cambridge MA 02139 USA
| |
Collapse
|
16
|
Howe PWA. Recent developments in the use of fluorine NMR in synthesis and characterisation. PROGRESS IN NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY 2020; 118-119:1-9. [PMID: 32883447 DOI: 10.1016/j.pnmrs.2020.02.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 02/14/2020] [Accepted: 02/20/2020] [Indexed: 06/11/2023]
Abstract
A review of developments in fluorine NMR of relevance to synthesis, characterisation and industrial applications of small organic molecules. Developments considered include those in spectrometer technology, computational methods and pulse sequences. The review of 80 references outlines applications in areas of identification, quantitation, mixture analysis, reaction monitoring, environmental studies and fragment-based drug design.
Collapse
Affiliation(s)
- Peter W A Howe
- Syngenta, Jealott's Hill Research Centre, Bracknell, Berkshire RG42 6EY, UK.
| |
Collapse
|
17
|
Pernot P, Savin A. Probabilistic performance estimators for computational chemistry methods: Systematic improvement probability and ranking probability matrix. II. Applications. J Chem Phys 2020; 152:164109. [DOI: 10.1063/5.0006204] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Affiliation(s)
- Pascal Pernot
- Institut de Chimie Physique, UMR8000, CNRS, Université Paris-Saclay, 91405 Orsay, France
| | - Andreas Savin
- Laboratoire de Chimie Théorique, CNRS and UPMC Université Paris 06, Sorbonne Universités, 75252 Paris, France
| |
Collapse
|
18
|
Schnack-Petersen AK, Simmermacher M, Fasshauer E, Jensen HJA, Sauer SPA. The Second-Order-Polarization-Propagator-Approximation (SOPPA) in a four-component spinor basis. J Chem Phys 2020; 152:134113. [DOI: 10.1063/5.0002389] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
| | - Mats Simmermacher
- School of Chemistry, University of Edinburgh, Edinburgh, United Kingdom
| | - Elke Fasshauer
- Department of Physics and Astronomy, Aarhus University, Aarhus, Denmark
| | - Hans Jørgen Aa. Jensen
- Department of Physics, Chemistry and Pharmacy, University of Southern Denmark, Odense, Denmark
| | | |
Collapse
|
19
|
Liu F, Kulik HJ. Impact of Approximate DFT Density Delocalization Error on Potential Energy Surfaces in Transition Metal Chemistry. J Chem Theory Comput 2019; 16:264-277. [DOI: 10.1021/acs.jctc.9b00842] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Affiliation(s)
- Fang Liu
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Heather J. Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
20
|
Garcı́a JS, Brémond É, Campetella M, Ciofini I, Adamo C. Small Basis Set Allowing the Recovery of Dispersion Interactions with Double-Hybrid Functionals. J Chem Theory Comput 2019; 15:2944-2953. [DOI: 10.1021/acs.jctc.8b01203] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Juan Sanz Garcı́a
- Chimie ParisTech, PSL Research University, CNRS, Institute of Chemistry for Life and Health Sciences, 11, rue Pierre et Marie Curie, F-75005 Paris, France
| | - Éric Brémond
- Univ Paris Diderot, Sorbonne Paris Cité, ITODYS, UMR CNRS 7086, 15 rue J.-A. de Baïf, F-75013 Paris, France
| | - Marco Campetella
- Chimie ParisTech, PSL Research University, CNRS, Institute of Chemistry for Life and Health Sciences, 11, rue Pierre et Marie Curie, F-75005 Paris, France
| | - Ilaria Ciofini
- Chimie ParisTech, PSL Research University, CNRS, Institute of Chemistry for Life and Health Sciences, 11, rue Pierre et Marie Curie, F-75005 Paris, France
| | - Carlo Adamo
- Chimie ParisTech, PSL Research University, CNRS, Institute of Chemistry for Life and Health Sciences, 11, rue Pierre et Marie Curie, F-75005 Paris, France
- Institut Universitaire de France, 103 Boulevard Saint Michel, F-75005 Paris, France
| |
Collapse
|
21
|
Kuznetsov NY, Malishev VI, Medvedev MG, Bubnov YN. DFT and experimental study of triallylborane-mediated isomerization of α-allylated azaheterocycles. MENDELEEV COMMUNICATIONS 2019. [DOI: 10.1016/j.mencom.2019.03.025] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
22
|
Benchmark DFT studies on C–CN homolytic cleavage and screening the substitution effect on bond dissociation energy. J Mol Model 2019; 25:47. [DOI: 10.1007/s00894-019-3930-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2018] [Accepted: 01/09/2019] [Indexed: 12/28/2022]
|
23
|
Caballero-García G, Mondragón-Solórzano G, Torres-Cadena R, Díaz-García M, Sandoval-Lira J, Barroso-Flores J. Calculation of VS,max and Its Use as a Descriptor for the Theoretical Calculation of p Ka Values for Carboxylic Acids. Molecules 2018; 24:molecules24010079. [PMID: 30587832 PMCID: PMC6337188 DOI: 10.3390/molecules24010079] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2018] [Revised: 12/21/2018] [Accepted: 12/25/2018] [Indexed: 01/30/2023] Open
Abstract
The theoretical calculation of pKa values for Brønsted acids is a challenging task that involves sophisticated and time-consuming methods. Therefore, heuristic approaches are efficient and appealing methodologies to approximate these values. Herein, we used the maximum surface electrostatic potential (VS,max) on the acidic hydrogen atoms of carboxylic acids to describe the H-bond interaction with water (the same descriptor that is used to characterize σ-bonded complexes) and correlate the results with experimental pKa values to obtain a predictive model for other carboxylic acids. We benchmarked six different methods, all including an implicit solvation model (water): Five density functionals and the Møller–Plesset second order perturbation theory in combination with six different basis sets for a total of thirty-six levels of theory. The ωB97X-D/cc-pVDZ level of theory stood out as the best one for consistently reproducing the reported pKa values, with a predictive power of 98% correlation in a test set of ten other carboxylic acids.
Collapse
Affiliation(s)
- Guillermo Caballero-García
- Centro Conjunto de Investigación en Química Sustentable UAEM-UNAM, Unidad San Cayetano, Carretera Toluca⁻Atlacomulco km 14.5, Personal de la UNAM, Toluca 50200, Mexico.
| | - Gustavo Mondragón-Solórzano
- Centro Conjunto de Investigación en Química Sustentable UAEM-UNAM, Unidad San Cayetano, Carretera Toluca⁻Atlacomulco km 14.5, Personal de la UNAM, Toluca 50200, Mexico.
| | - Raúl Torres-Cadena
- Centro Conjunto de Investigación en Química Sustentable UAEM-UNAM, Unidad San Cayetano, Carretera Toluca⁻Atlacomulco km 14.5, Personal de la UNAM, Toluca 50200, Mexico.
| | - Marco Díaz-García
- Centro Conjunto de Investigación en Química Sustentable UAEM-UNAM, Unidad San Cayetano, Carretera Toluca⁻Atlacomulco km 14.5, Personal de la UNAM, Toluca 50200, Mexico.
| | - Jacinto Sandoval-Lira
- Centro Conjunto de Investigación en Química Sustentable UAEM-UNAM, Unidad San Cayetano, Carretera Toluca⁻Atlacomulco km 14.5, Personal de la UNAM, Toluca 50200, Mexico.
| | - Joaquín Barroso-Flores
- Centro Conjunto de Investigación en Química Sustentable UAEM-UNAM, Unidad San Cayetano, Carretera Toluca⁻Atlacomulco km 14.5, Personal de la UNAM, Toluca 50200, Mexico.
| |
Collapse
|