1
|
Achar SK, Bernasconi L, Johnson JK. Machine Learning Electron Density Prediction Using Weighted Smooth Overlap of Atomic Positions. NANOMATERIALS (BASEL, SWITZERLAND) 2023; 13:1853. [PMID: 37368284 DOI: 10.3390/nano13121853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 05/29/2023] [Accepted: 06/11/2023] [Indexed: 06/28/2023]
Abstract
Having access to accurate electron densities in chemical systems, especially for dynamical systems involving chemical reactions, ion transport, and other charge transfer processes, is crucial for numerous applications in materials chemistry. Traditional methods for computationally predicting electron density data for such systems include quantum mechanical (QM) techniques, such as density functional theory. However, poor scaling of these QM methods restricts their use to relatively small system sizes and short dynamic time scales. To overcome this limitation, we have developed a deep neural network machine learning formalism, which we call deep charge density prediction (DeepCDP), for predicting charge densities by only using atomic positions for molecules and condensed phase (periodic) systems. Our method uses the weighted smooth overlap of atomic positions to fingerprint environments on a grid-point basis and map it to electron density data generated from QM simulations. We trained models for bulk systems of copper, LiF, and silicon; for a molecular system, water; and for two-dimensional charged and uncharged systems, hydroxyl-functionalized graphane, with and without an added proton. We showed that DeepCDP achieves prediction R2 values greater than 0.99 and mean squared error values on the order of 10-5e2 Å-6 for most systems. DeepCDP scales linearly with system size, is highly parallelizable, and is capable of accurately predicting the excess charge in protonated hydroxyl-functionalized graphane. We demonstrate how DeepCDP can be used to accurately track the location of charges (protons) by computing electron densities at a few selected grid points in the materials, thus significantly reducing the computational cost. We also show that our models can be transferable, allowing prediction of electron densities for systems on which it has not been trained but that contain a subset of atomic species on which it has been trained. Our approach can be used to develop models that span different chemical systems and train them for the study of large-scale charge transport and chemical reactions.
Collapse
Affiliation(s)
- Siddarth K Achar
- Computational Modeling & Simulation Program, University of Pittsburgh, Pittsburgh, PA 15260, USA
- Department of Chemical & Petroleum Engineering, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Leonardo Bernasconi
- Center for Research Computing and Department of Chemistry, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - J Karl Johnson
- Department of Chemical & Petroleum Engineering, University of Pittsburgh, Pittsburgh, PA 15261, USA
| |
Collapse
|
2
|
Oliveira BGD. Why much of Chemistry may be indisputably non-bonded? SEMINA: CIÊNCIAS EXATAS E TECNOLÓGICAS 2023. [DOI: 10.5433/1679-0375.2022v43n2p211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
In this compendium, the wide scope of all intermolecular interactions ever known has been revisited, in particular giving emphasis the capability of much of the elements of the periodic table to form non-covalent contacts. Either hydrogen bonds, dihydrogen bonds, halogen bonds, pnictogen bonds, chalcogen bonds, triel bonds, tetrel bonds, regium bonds, spodium bonds or even the aerogen bond interactions may be cited. Obviously that experimental techniques have been used in some works, but it was through the theoretical methods that these interactions were validate, wherein the QTAIM integrations and SAPT energy partitions have been useful in this regard. Therefore, the great goal concerns to elucidate the interaction strength and if the intermolecular system shall be total, partial or non-covalently bonded, wherein this last one encompasses the most majority of the intermolecular interactions what leading to affirm that chemistry is debatably non-bonded.
Collapse
|
3
|
King DS, Truhlar DG, Gagliardi L. Machine-Learned Energy Functionals for Multiconfigurational Wave Functions. J Phys Chem Lett 2021; 12:7761-7767. [PMID: 34374555 DOI: 10.1021/acs.jpclett.1c02042] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
We introduce multiconfiguration data-driven functional methods (MC-DDFMs), a group of methods which aim to correct the total or classical energy of a qualitatively accurate multiconfigurational wave function using a machine-learned functional of some featurization of the wave function such as its density, on-top density, or both. On a data set of carbene singlet-triplet energy splittings, we show that MC-DDFMs are able to achieve near-benchmark performance on systems not used for training with a robust degree of active-space independence. Beyond demonstrating that the density and on-top density hold the information necessary to correct the singlet-triplet energy splittings of multiconfigurational wave functions, this approach shows great promise for the development of functionals for MC-PDFT because corrections to the classical energy appear to be more transferable to types of molecules not included in the training data than corrections to total energies such as those yielded by CASSCF or NEVPT2.
Collapse
Affiliation(s)
- Daniel S King
- Department of Chemistry, University of Chicago, Chicago, Illinois, United States
| | - Donald G Truhlar
- Department of Chemistry, University of Minnesota, Minneapolis, Minnesota, United States
| | - Laura Gagliardi
- Department of Chemistry, Pritzker School of Molecular Engineering, James Franck Institute, Chicago Center for Theoretical Chemistry, University of Chicago, Chicago, Illinois, United States
| |
Collapse
|
4
|
Wieduwilt EK, Boisson JC, Terraneo G, Hénon E, Genoni A. A Step toward the Quantification of Noncovalent Interactions in Large Biological Systems: The Independent Gradient Model-Extremely Localized Molecular Orbital Approach. J Chem Inf Model 2021; 61:795-809. [PMID: 33444021 DOI: 10.1021/acs.jcim.0c01188] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The independent gradient model (IGM) is a recent electron density-based computational method that enables to detect and quantify covalent and noncovalent interactions. When applied to large systems, the original version of the technique still relies on promolecular electron densities given by the sum of spherically averaged atomic electron distributions, which leads to approximate evaluations of the inter- and intramolecular interactions occurring in systems of biological interest. To overcome this drawback and perform IGM analyses based on quantum mechanically rigorous electron densities also for macromolecular systems, we coupled the IGM approach with the recently constructed libraries of extremely localized molecular orbitals (ELMOs) that allow fast and reliable reconstructions of polypeptide and protein electron densities. The validation tests performed on small polypeptides and peptide dimers have shown that the novel IGM-ELMO strategy provides results that are systematically closer to the fully quantum mechanical ones and outperforms the IGM method based on the crude promolecular approximation, but still keeping a quite low computational cost. The results of the test calculations carried out on proteins have also confirmed the trends observed for the IGM analyses conducted on small systems. This makes us envisage the future application of the novel IGM-ELMO approach to unravel complicated noncovalent interaction networks (e.g., in protein-protein contacts) or to rationally design new drugs through molecular docking calculations and virtual high-throughput screenings.
Collapse
Affiliation(s)
- Erna K Wieduwilt
- Université de Lorraine & CNRS, Laboratoire de Physique et Chimie Théoriques, UMR CNRS 7019, 1 Boulevard Arago, Metz F-57078, France
| | - Jean-Charles Boisson
- CReSTIC EA 3804, Université de Reims Champagne-Ardenne, Moulin de la Housse, Reims Cedex 02 BP39, F-51687, France
| | - Giancarlo Terraneo
- Laboratory of Supramolecular and Bio-Nanomaterials (SupraBioNanoLab), Department of Chemistry, Materials and Chemical Engineering "Giulio Natta", Politecnico di Milano, Via L. Mancinelli 7, Milan I-20131, Italy
| | - Eric Hénon
- Institut de Chimie Moléculaire de Reims UMR CNRS 7312, Université de Reims Champagne-Ardenne, Moulin de la Housse, Reims Cedex 02 BP39, F-51687, France
| | - Alessandro Genoni
- Université de Lorraine & CNRS, Laboratoire de Physique et Chimie Théoriques, UMR CNRS 7019, 1 Boulevard Arago, Metz F-57078, France
| |
Collapse
|
5
|
Aggarwal A, Vinayak V, Bag S, Bhattacharyya C, Waghmare UV, Maiti PK. Predicting the DNA Conductance Using a Deep Feedforward Neural Network Model. J Chem Inf Model 2020; 61:106-114. [PMID: 33320660 DOI: 10.1021/acs.jcim.0c01072] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Double-stranded DNA (dsDNA) has been established as an efficient medium for charge migration, bringing it to the forefront of the field of molecular electronics and biological research. The charge migration rate is controlled by the electronic couplings between the two nucleobases of DNA/RNA. These electronic couplings strongly depend on the intermolecular geometry and orientation. Estimating these electronic couplings for all the possible relative geometries of molecules using the computationally demanding first-principles calculations requires a lot of time and computational resources. In this article, we present a machine learning (ML)-based model to calculate the electronic coupling between any two bases of dsDNA/dsRNA and bypass the computationally expensive first-principles calculations. Using the Coulomb matrix representation which encodes the atomic identities and coordinates of the DNA base pairs to prepare the input dataset, we train a feedforward neural network model. Our neural network (NN) model can predict the electronic couplings between dsDNA base pairs with any structural orientation with a mean absolute error (MAE) of less than 0.014 eV. We further use the NN-predicted electronic coupling values to compute the dsDNA/dsRNA conductance.
Collapse
Affiliation(s)
- Abhishek Aggarwal
- Center for Condensed Matter Theory, Department of Physics, Indian Institute of Science, Bangalore 560012, India
| | - Vinayak Vinayak
- Undergraduate Program, Indian Institute of Science, Bangalore 560012, India
| | - Saientan Bag
- Center for Condensed Matter Theory, Department of Physics, Indian Institute of Science, Bangalore 560012, India
| | - Chiranjib Bhattacharyya
- Department of Computer Science and Automation, Indian Institute of Science, Bangalore 560012, India
| | - Umesh V Waghmare
- Theoretical Sciences Unit, Jawaharlal Nehru Center for Advanced Scientific Research, Jakkur P.O., Bangalore 560064, India
| | - Prabal K Maiti
- Center for Condensed Matter Theory, Department of Physics, Indian Institute of Science, Bangalore 560012, India
| |
Collapse
|
6
|
Del Rio BG, Kuenneth C, Tran HD, Ramprasad R. An Efficient Deep Learning Scheme To Predict the Electronic Structure of Materials and Molecules: The Example of Graphene-Derived Allotropes. J Phys Chem A 2020; 124:9496-9502. [PMID: 33138367 DOI: 10.1021/acs.jpca.0c07458] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Computations based on density functional theory (DFT) are transforming various aspects of materials research and discovery. However, the effort required to solve the central equation of DFT, namely the Kohn-Sham equation, which remains a major obstacle for studying large systems with hundreds of atoms in a practical amount of time with routine computational resources. Here, we propose a deep learning architecture that systematically learns the input-output behavior of the Kohn-Sham equation and predicts the electronic density of states, a primary output of DFT calculations, with unprecedented speed and chemical accuracy. The algorithm also adapts and progressively improves in predictive power and versatility as it is exposed to new diverse atomic configurations. We demonstrate this capability for a diverse set of carbon allotropes spanning a large configurational and phase space. The electronic density of states, along with the electronic charge density, may be used downstream to predict a variety of materials properties, bypassing the Kohn-Sham equation, leading to an ultrafast and high-fidelity DFT emulator.
Collapse
Affiliation(s)
- Beatriz G Del Rio
- School of Materials Science and Engineering, Georgia Institute of Technology, 771 Ferst Drive NW, Atlanta, Georgia 30332, United States
| | - Christopher Kuenneth
- School of Materials Science and Engineering, Georgia Institute of Technology, 771 Ferst Drive NW, Atlanta, Georgia 30332, United States
| | - Huan Doan Tran
- School of Materials Science and Engineering, Georgia Institute of Technology, 771 Ferst Drive NW, Atlanta, Georgia 30332, United States
| | - Rampi Ramprasad
- School of Materials Science and Engineering, Georgia Institute of Technology, 771 Ferst Drive NW, Atlanta, Georgia 30332, United States
| |
Collapse
|