1
|
Li T, Huls NJ, Lu S, Hou P. Unsupervised manifold embedding to encode molecular quantum information for supervised learning of chemical data. Commun Chem 2024; 7:133. [PMID: 38862828 PMCID: PMC11166954 DOI: 10.1038/s42004-024-01217-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Accepted: 06/03/2024] [Indexed: 06/13/2024] Open
Abstract
Molecular representation is critical in chemical machine learning. It governs the complexity of model development and the fulfillment of training data to avoid either over- or under-fitting. As electronic structures and associated attributes are the root cause for molecular interactions and their manifested properties, we have sought to examine the local electron information on a molecular manifold to understand and predict molecular interactions. Our efforts led to the development of a lower-dimensional representation of a molecular manifold, Manifold Embedding of Molecular Surface (MEMS), to embody surface electronic quantities. By treating a molecular surface as a manifold and computing its embeddings, the embedded electronic attributes retain the chemical intuition of molecular interactions. MEMS can be further featurized as input for chemical learning. Our solubility prediction with MEMS demonstrated the feasibility of both shallow and deep learning by neural networks, suggesting that MEMS is expressive and robust against dimensionality reduction.
Collapse
Affiliation(s)
- Tonglei Li
- Deparment of Industrial and Molecular Pharmaceutics, Purdue University, West Lafayette, 47907, IN, USA.
| | - Nicholas J Huls
- Deparment of Industrial and Molecular Pharmaceutics, Purdue University, West Lafayette, 47907, IN, USA
| | - Shan Lu
- Deparment of Industrial and Molecular Pharmaceutics, Purdue University, West Lafayette, 47907, IN, USA
| | - Peng Hou
- Deparment of Industrial and Molecular Pharmaceutics, Purdue University, West Lafayette, 47907, IN, USA
| |
Collapse
|
2
|
Miranda-Quintana RA, Heidar-Zadeh F, Fias S, Chapman AEA, Liu S, Morell C, Gómez T, Cárdenas C, Ayers PW. Molecular interactions from the density functional theory for chemical reactivity: Interaction chemical potential, hardness, and reactivity principles. Front Chem 2022; 10:929464. [PMID: 35936089 PMCID: PMC9352952 DOI: 10.3389/fchem.2022.929464] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 06/27/2022] [Indexed: 11/13/2022] Open
Abstract
In the first paper of this series, the authors derived an expression for the interaction energy between two reagents in terms of the chemical reactivity indicators that can be derived from density functional perturbation theory. While negative interaction energies can explain reactivity, reactivity is often more simply explained using the “|dμ| big is good” rule or the maximum hardness principle. Expressions for the change in chemical potential (μ) and hardness when two reagents interact are derived. A partial justification for the maximum hardness principle is that the terms that appear in the interaction energy expression often reappear in the expression for the interaction hardness, but with opposite sign.
Collapse
Affiliation(s)
- Ramón Alain Miranda-Quintana
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, United States
- *Correspondence: Ramón Alain Miranda-Quintana, ; Tatiana Gómez, Carlos Cárdenas, ; Paul W. Ayers,
| | | | - Stijn Fias
- Department of Chemistry and Chemical Biology, McMaster University, Hamilton, ON, Canada
| | - Allison E. A. Chapman
- Department of Chemistry and Chemical Biology, McMaster University, Hamilton, ON, Canada
| | - Shubin Liu
- Research Computing Center, University of North Carolina, Chapel Hill, NC, United states
| | - Christophe Morell
- Université de Lyon, Universit́e Claude Bernard Lyon 1, Institut des Sciences Analytiques, UMR CNRS 5280, Villeurbanne Cedex, France
| | - Tatiana Gómez
- Theoretical and Computational Chemistry Center, Institute of Applied Chemical Sciences, Faculty of Engineering, Universidad Autonoma de Chile, Santiago, Chile
- *Correspondence: Ramón Alain Miranda-Quintana, ; Tatiana Gómez, Carlos Cárdenas, ; Paul W. Ayers,
| | - Carlos Cárdenas
- Departamento de Fisica, Facultad de Ciencias, Universidad de Chile, Santiago, Chile
- Centro para el desarrollo de la Nanociencias y Nanotecnologia, CEDENNA, Santiago, Chile
- *Correspondence: Ramón Alain Miranda-Quintana, ; Tatiana Gómez, Carlos Cárdenas, ; Paul W. Ayers,
| | - Paul W. Ayers
- Department of Chemistry and Chemical Biology, McMaster University, Hamilton, ON, Canada
- *Correspondence: Ramón Alain Miranda-Quintana, ; Tatiana Gómez, Carlos Cárdenas, ; Paul W. Ayers,
| |
Collapse
|
3
|
Miranda-Quintana RA, Heidar-Zadeh F, Fias S, Chapman AEA, Liu S, Morell C, Gómez T, Cárdenas C, Ayers PW. Molecular Interactions From the Density Functional Theory for Chemical Reactivity: The Interaction Energy Between Two-Reagents. Front Chem 2022; 10:906674. [PMID: 35769444 PMCID: PMC9234655 DOI: 10.3389/fchem.2022.906674] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 04/19/2022] [Indexed: 12/13/2022] Open
Abstract
Reactivity descriptors indicate where a reagent is most reactive and how it is most likely to react. However, a reaction will only occur when the reagent encounters a suitable reaction partner. Determining whether a pair of reagents is well-matched requires developing reactivity rules that depend on both reagents. This can be achieved using the expression for the minimum-interaction-energy obtained from the density functional reactivity theory. Different terms in this expression will be dominant in different circumstances; depending on which terms control the reactivity, different reactivity indicators will be preferred.
Collapse
Affiliation(s)
- Ramón Alain Miranda-Quintana
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, United States
- *Correspondence: Ramón Alain Miranda-Quintana, ; Carlos Cárdenas, ; Paul W. Ayers, ; Tatiana Gómez,
| | | | - Stijn Fias
- Department of Chemistry and Chemical Biology, McMaster University, Hamilton, ON, Canada
| | - Allison E. A. Chapman
- Department of Chemistry and Chemical Biology, McMaster University, Hamilton, ON, Canada
| | - Shubin Liu
- Research Computing Center, University of North Carolina, Chapel Hill, NC, United States
| | - Christophe Morell
- Université de Lyon, Université Claude Bernard Lyon 1, Institut des Sciences Analytiques-UMR CNRS 5280, Villeurbanne, France
| | - Tatiana Gómez
- Theoretical and Computational Chemistry Center, Institute of Applied Chemical Sciences, Faculty of Engineering, Universidad Autonoma de Chile, Santiago, Chile
- *Correspondence: Ramón Alain Miranda-Quintana, ; Carlos Cárdenas, ; Paul W. Ayers, ; Tatiana Gómez,
| | - Carlos Cárdenas
- Departamento de Fisica, Facultad de Ciencias, Universidad de Chile, Santiago, Chile
- Centro para el desarrollo de la Nanociencias y Nanotecnologia, CEDENNA, Santiago, Chile
- *Correspondence: Ramón Alain Miranda-Quintana, ; Carlos Cárdenas, ; Paul W. Ayers, ; Tatiana Gómez,
| | - Paul W. Ayers
- Department of Chemistry and Chemical Biology, McMaster University, Hamilton, ON, Canada
- *Correspondence: Ramón Alain Miranda-Quintana, ; Carlos Cárdenas, ; Paul W. Ayers, ; Tatiana Gómez,
| |
Collapse
|
4
|
Welearegay MA, Balawender R, Holas A. Information and complexity measures in molecular reactivity studies. Phys Chem Chem Phys 2014; 16:14928-46. [DOI: 10.1039/c4cp01729c] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The usefulness of the information and complexity measure in molecular reactivity studies.
Collapse
Affiliation(s)
| | - Robert Balawender
- Institute of Physical Chemistry
- Polish Academy of Sciences
- PL-01-224 Warsaw, Poland
| | - Andrzej Holas
- Institute of Physical Chemistry
- Polish Academy of Sciences
- PL-01-224 Warsaw, Poland
| |
Collapse
|
5
|
Zhang M, Li T. Intermolecular interactions in organic crystals: gaining insight from electronic structure analysis by density functional theory. CrystEngComm 2014. [DOI: 10.1039/c4ce00411f] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Conceptual density functional theory is exploited to extend the HSAB (hard and soft acids and bases) principle for investigating the locality and regioselectivity of intermolecular interactions in organic crystals.
Collapse
Affiliation(s)
- Mingtao Zhang
- Industrial & Physical Pharmacy
- Purdue University
- West Lafayette, USA
| | - Tonglei Li
- Industrial & Physical Pharmacy
- Purdue University
- West Lafayette, USA
| |
Collapse
|
6
|
Balawender R, Welearegay MA, Lesiuk M, De Proft F, Geerlings P. Exploring Chemical Space with the Alchemical Derivatives. J Chem Theory Comput 2013; 9:5327-40. [PMID: 26592270 DOI: 10.1021/ct400706g] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
In this paper, we verify the usefulness of the alchemical derivatives in the prediction of chemical properties. We concentrate on the stability of the transmutation products, where the term "transmutation" means the change of the nuclear charge at an atomic site at constant number of electrons. As illustrative transmutations showing the potential of the method in exploring chemical space, we present some examples of increasing complexity starting with the deprotonation, continuing with the transmutation of the nitrogen molecule, and ending with the substitution of isoelectronic B-N units for C-C units and N units for C-H units in carbocyclic systems. The basis set influence on the qualitative and quantitative accuracies of the alchemical predictions was investigated. The alchemical deprotonation energy (from the second order Taylor expansion) correlates well with the vertical deprotonation energy and can be used as a preliminary indicator for the experimental deprotonation energy. The results of calculations for the BN derivatives of benzene and pyrene show that this method has great potential for efficient and accurate scanning of chemical space.
Collapse
Affiliation(s)
- Robert Balawender
- Institute of Physical Chemistry, Polish Academy of Sciences , Kasprzaka 44/52, PL-01-224 Warsaw, Poland
| | - Meressa A Welearegay
- Institute of Physical Chemistry, Polish Academy of Sciences , Kasprzaka 44/52, PL-01-224 Warsaw, Poland
| | - Michał Lesiuk
- Faculty of Chemistry, University of Warsaw , Pasteura 1, PL-02-093 Warsaw, Poland
| | - Frank De Proft
- Eenheid Algemene Chemie, Vrije Universiteit Brussel, Faculteit Wetenschappen , Pleinlaan 2, B-1050 Brussels, Belgium
| | - Paul Geerlings
- Eenheid Algemene Chemie, Vrije Universiteit Brussel, Faculteit Wetenschappen , Pleinlaan 2, B-1050 Brussels, Belgium
| |
Collapse
|
7
|
Méndez F, Alonso JA, Richaud A. Protophilicity index and protofelicity equalization principle: new measures of Brønsted-Lowry-Lewis acid-base interactions. J Mol Model 2013; 19:3961-7. [PMID: 23832651 DOI: 10.1007/s00894-013-1881-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2013] [Accepted: 05/05/2013] [Indexed: 11/26/2022]
Abstract
The simultaneous contributions of proton and electron transfer to the Brønsted-Lowry and Lewis acid-base properties of a set of p-substituted phenols are reported in this work. As a result of the analysis, a novel protophilicity index considered as the second-order energy change of a Brønsted-Lowry base as it is saturated with protons, a combined Brønsted-Lowry-Lewis acidity index (with a corresponding basicity index), and a protofelicity equalization principle (a parallel of the electronegativity equalization principle) are presented.
Collapse
Affiliation(s)
- Francisco Méndez
- Departamento de Química, División de Ciencias Básicas e Ingeniería, Universidad Autónoma Metropolitana-Iztapalapa, A.P. 55-534, México, DF, 09340, Mexico.
| | | | | |
Collapse
|
8
|
Affiliation(s)
- Pratim Kumar Chattaraj
- Department of Chemistry, Center for Theoretical Studies, Indian Institute of Technology, Kharagpur, India
| | | | | |
Collapse
|
9
|
Ayers PW, Liu S, Li T. Stability conditions for density functional reactivity theory: an interpretation of the total local hardness. Phys Chem Chem Phys 2011; 13:4427-33. [PMID: 21253639 DOI: 10.1039/c0cp01675f] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The second-order Taylor series expansions commonly used in the density functional chemical reactivity theory are used to define local stability conditions for electronic states. Systems which satisfy these conditions are stable to infinitesimal perturbations due to approaching chemical reagents. The basic formalism considered here supersedes previous variational approaches to chemical reactivity theory like the electrophilicity, potentialphilicity, and chargephilicity. The total local hardness emerges naturally in this analysis, and can be clearly interpreted. When the total local hardness is small, the system is relatively insensitive to perturbations. Furthermore, minus the total local hardness is an energetically favorable perturbation of the external potential.
Collapse
Affiliation(s)
- Paul W Ayers
- Department of Chemistry, McMaster University, Hamilton, Ontario L8S 4M1, Canada.
| | | | | |
Collapse
|
10
|
Li T, Zhou P, Mattei A. Electronic origin of pyridinyl N as a better hydrogen-bonding acceptor than carbonyl O. CrystEngComm 2011. [DOI: 10.1039/c1ce05967j] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
11
|
Liu S, Schauer CK, Pedersen LG. Molecular acidity: A quantitative conceptual density functional theory description. J Chem Phys 2010; 131:164107. [PMID: 19894927 DOI: 10.1063/1.3251124] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Accurate predictions of molecular acidity using ab initio and density functional approaches are still a daunting task. Using electronic and reactivity properties, one can quantitatively estimate pKa values of acids. In a recent paper [S. B. Liu and L. G. Pedersen, J. Phys. Chem. A 113, 3648 (2009)], we employed the molecular electrostatic potential (MEP) on the nucleus and the sum of valence natural atomic orbital (NAO) energies for the purpose. In this work, we reformulate these relationships on the basis of conceptual density functional theory and compare the results with those from the thermodynamic cycle method. We show that MEP and NAO properties of the dissociating proton of an acid should satisfy the same relationships with experimental pKa data. We employ 27 main groups and first to third row transition metal-water complexes as illustrative examples to numerically verify the validity of these strong linear correlations. Results also show that the accuracy of our approach and that of the conventional method through the thermodynamic cycle are statistically similar.
Collapse
Affiliation(s)
- Shubin Liu
- Research Computing Center, University of North Carolina, Chapel Hill, North Carolina 27599-3420, USA.
| | | | | |
Collapse
|
12
|
Roy RK, Saha S. Studies of regioselectivity of large molecular systems using DFT based reactivity descriptors. ACTA ACUST UNITED AC 2010. [DOI: 10.1039/b811052m] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|