1
|
Kwak B, Park J, Kang T, Jo J, Lee B, Yoon S. GeoT: A Geometry-Aware Transformer for Reliable Molecular Property Prediction and Chemically Interpretable Representation Learning. ACS OMEGA 2023; 8:39759-39769. [PMID: 37901490 PMCID: PMC10601421 DOI: 10.1021/acsomega.3c05753] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/06/2023] [Accepted: 09/21/2023] [Indexed: 10/31/2023]
Abstract
In recent years, molecular representation learning has emerged as a key area of focus in various chemical tasks. However, many existing models fail to fully consider the geometric information on molecular structures, resulting in less intuitive representations. Moreover, the widely used message passing mechanism is limited to providing the interpretation of experimental results from a chemical perspective. To address these challenges, we introduce a novel transformer-based framework for molecular representation learning, named the geometry-aware transformer (GeoT). The GeoT learns molecular graph structures through attention-based mechanisms specifically designed to offer reliable interpretability as well as molecular property prediction. Consequently, the GeoT can generate attention maps of the interatomic relationships associated with training objectives. In addition, the GeoT demonstrates performance comparable to that of MPNN-based models while achieving reduced computational complexity. Our comprehensive experiments, including an empirical simulation, reveal that the GeoT effectively learns chemical insights into molecular structures, bridging the gap between artificial intelligence and molecular sciences.
Collapse
Affiliation(s)
- Bumju Kwak
- Recommendation
Team, Kakao Corporation, Gyeonggi 13529, Republic of Korea
| | - Jiwon Park
- LG
Chem, Seoul 07795, Republic
of Korea
- Interdisciplinary
Program in Artificial Intelligence, Seoul
National University, Seoul 08826, Republic
of Korea
| | - Taewon Kang
- Department
of Materials Science and Engineering, Korea
Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea
| | - Jeonghee Jo
- Institute
of New Media and Communications, Seoul National
University, Seoul 08826, Republic
of Korea
| | - Byunghan Lee
- Department
of Electronic Engineering, Seoul National
University of Science and Technology, Seoul 01811, Republic of Korea
| | - Sungroh Yoon
- Interdisciplinary
Program in Artificial Intelligence, Seoul
National University, Seoul 08826, Republic
of Korea
- Department
of Electrical and Computer Engineering, Seoul National University, Seoul 08826, Republic
of Korea
- Artificial
Intelligence Institute, Seoul National University, Seoul 08826, Republic of Korea
| |
Collapse
|
2
|
Alabugin IV, Kuhn L, Krivoshchapov NV, Mehaffy P, Medvedev MG. Anomeric effect, hyperconjugation and electrostatics: lessons from complexity in a classic stereoelectronic phenomenon. Chem Soc Rev 2021; 50:10212-10252. [PMID: 34542133 DOI: 10.1039/d1cs00564b] [Citation(s) in RCA: 50] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Understanding the interplay of multiple components (steric, electrostatic, stereoelectronic, dispersive, etc.) that define the overall energy, structure, and reactivity of organic molecules can be a daunting task. The task becomes even more difficult when multiple approaches based on different physical premises disagree in their analysis of a multicomponent molecular system. Herein, we will use a classic conformational "oddity", the anomeric effect, to discuss the value of identifying the key contributors to reactivity that can guide chemical predictions. After providing the background related to the relevant types of hyperconjugation and a brief historic outline of the origins of the anomeric effect, we outline variations of its patterns and provide illustrative examples for the role of the anomeric effect in structure, stability, and spectroscopic properties. We show that the complete hyperconjugative model remains superior in explaining the interplay between structure and reactivity. We will use recent controversies regarding the origin of the anomeric effect to start a deeper discussion relevant to any electronic effect. Why are such questions inherently controversial? How to describe a complex quantum system using a model that is "as simple as possible, but no simpler"? What is a fair test for such a model? Perhaps, instead of asking "who is right and who is wrong?" one should ask "why do we disagree?". Stereoelectronic thinking can reconcile quantum complexity with chemical intuition and build the conceptual bridge between structure and reactivity. Even when many factors contribute to the observed structural and conformational trends, electron delocalization is a dominating force when the electronic demand is high (i.e., bonds are breaking as molecules distort from their equilibrium geometries). In these situations, the role of orbital interactions increases to the extent where they can define reactivity. For example, negative hyperconjugation can unleash the "underutilized" stereoelectronic power of unshared electrons (i.e., the lone pairs) to stabilize a developing positive charge at an anomeric carbon. This analysis paves the way for the broader discussion of the omnipresent importance of negative hyperconjugation in oxygen-containing functional groups. From that point of view, the stereoelectronic component of the anomeric effect plays a unique role in guiding reaction design.
Collapse
Affiliation(s)
- Igor V Alabugin
- Department of Chemistry and Biochemistry, Florida State University, USA.
| | - Leah Kuhn
- Department of Chemistry and Biochemistry, Florida State University, USA.
| | - Nikolai V Krivoshchapov
- N. D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, 47 Leninsky prosp., 119991 Moscow, Russian Federation. .,Lomonosov Moscow State University, Leninskie Gory 1 (3), Moscow, 119991, Russian Federation
| | - Patricia Mehaffy
- Department of Chemistry and Biochemistry, Florida State University, USA.
| | - Michael G Medvedev
- N. D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, 47 Leninsky prosp., 119991 Moscow, Russian Federation. .,A. N. Nesmeyanov Institute of Organoelement Compounds, Russian Academy of Sciences, 28 Vavilova St., 119991 Moscow, Russian Federation
| |
Collapse
|
3
|
Alabugin IV, dos Passos Gomes G, Abdo MA. Hyperconjugation. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2018. [DOI: 10.1002/wcms.1389] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
|
4
|
Kramer N, Jöst C, Mackenroth A, Greb L. Synthesis of Electron-Rich, Planarized Silicon(IV) Species and a Theoretical Analysis of Dimerizing Aminosilanes. Chemistry 2017; 23:17764-17774. [DOI: 10.1002/chem.201703649] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2017] [Indexed: 12/31/2022]
Affiliation(s)
- Nina Kramer
- Anorganisch-Chemisches Institut; Ruprecht-Karls-Universität Heidelberg; Im Neuenheimer Feld 270 69120 Heidelberg Germany
| | - Christoph Jöst
- Anorganisch-Chemisches Institut; Ruprecht-Karls-Universität Heidelberg; Im Neuenheimer Feld 270 69120 Heidelberg Germany
| | - Alexandra Mackenroth
- Anorganisch-Chemisches Institut; Ruprecht-Karls-Universität Heidelberg; Im Neuenheimer Feld 270 69120 Heidelberg Germany
| | - Lutz Greb
- Anorganisch-Chemisches Institut; Ruprecht-Karls-Universität Heidelberg; Im Neuenheimer Feld 270 69120 Heidelberg Germany
| |
Collapse
|
5
|
Cieplak AS. Protein folding, misfolding and aggregation: The importance of two-electron stabilizing interactions. PLoS One 2017; 12:e0180905. [PMID: 28922400 PMCID: PMC5603215 DOI: 10.1371/journal.pone.0180905] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2017] [Accepted: 06/22/2017] [Indexed: 12/17/2022] Open
Abstract
Proteins associated with neurodegenerative diseases are highly pleiomorphic and may adopt an all-α-helical fold in one environment, assemble into all-β-sheet or collapse into a coil in another, and rapidly polymerize in yet another one via divergent aggregation pathways that yield broad diversity of aggregates’ morphology. A thorough understanding of this behaviour may be necessary to develop a treatment for Alzheimer’s and related disorders. Unfortunately, our present comprehension of folding and misfolding is limited for want of a physicochemical theory of protein secondary and tertiary structure. Here we demonstrate that electronic configuration and hyperconjugation of the peptide amide bonds ought to be taken into account to advance such a theory. To capture the effect of polarization of peptide linkages on conformational and H-bonding propensity of the polypeptide backbone, we introduce a function of shielding tensors of the Cα atoms. Carrying no information about side chain-side chain interactions, this function nonetheless identifies basic features of the secondary and tertiary structure, establishes sequence correlates of the metamorphic and pH-driven equilibria, relates binding affinities and folding rate constants to secondary structure preferences, and manifests common patterns of backbone density distribution in amyloidogenic regions of Alzheimer’s amyloid β and tau, Parkinson’s α-synuclein and prions. Based on those findings, a split-intein like mechanism of molecular recognition is proposed to underlie dimerization of Aβ, tau, αS and PrPC, and divergent pathways for subsequent association of dimers are outlined; a related mechanism is proposed to underlie formation of PrPSc fibrils. The model does account for: (i) structural features of paranuclei, off-pathway oligomers, non-fibrillar aggregates and fibrils; (ii) effects of incubation conditions, point mutations, isoform lengths, small-molecule assembly modulators and chirality of solid-liquid interface on the rate and morphology of aggregation; (iii) fibril-surface catalysis of secondary nucleation; and (iv) self-propagation of infectious strains of mammalian prions.
Collapse
Affiliation(s)
- Andrzej Stanisław Cieplak
- Department of Chemistry, Bilkent University, Ankara, Turkey
- Department of Chemistry, Yale University, New Haven, Connecticut, United States of America
- Department of Chemistry, Brandeis University, Waltham, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
6
|
Moncea O, Gunawan MA, Poinsot D, Cattey H, Becker J, Yurchenko RI, Butova ED, Hausmann H, Šekutor M, Fokin AA, Hierso JC, Schreiner PR. Defying Stereotypes with Nanodiamonds: Stable Primary Diamondoid Phosphines. J Org Chem 2016; 81:8759-8769. [PMID: 27560114 DOI: 10.1021/acs.joc.6b01219] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Affiliation(s)
- Oana Moncea
- Institut
de Chimie Moléculaire de l’Université de Bourgogne
(ICMUB), UMR-CNRS 6302, Université de Bourgogne Franche-Comté, 9 Avenue Alain Savary, 21078 Dijon, France
- Institut
für Organische Chemie, Justus-Liebig-Universität, Heinrich-Buff-Ring 17, 35392 Giessen, Germany
| | - Maria A. Gunawan
- Institut
de Chimie Moléculaire de l’Université de Bourgogne
(ICMUB), UMR-CNRS 6302, Université de Bourgogne Franche-Comté, 9 Avenue Alain Savary, 21078 Dijon, France
- Institut
für Organische Chemie, Justus-Liebig-Universität, Heinrich-Buff-Ring 17, 35392 Giessen, Germany
| | - Didier Poinsot
- Institut
de Chimie Moléculaire de l’Université de Bourgogne
(ICMUB), UMR-CNRS 6302, Université de Bourgogne Franche-Comté, 9 Avenue Alain Savary, 21078 Dijon, France
| | - Hélène Cattey
- Institut
de Chimie Moléculaire de l’Université de Bourgogne
(ICMUB), UMR-CNRS 6302, Université de Bourgogne Franche-Comté, 9 Avenue Alain Savary, 21078 Dijon, France
| | - Jonathan Becker
- Institut
für Anorganische und Analytische Chemie, Justus-Liebig-Universität, Heinrich-Buff-Ring 17, 35392 Giessen, Germany
| | - Raisa I. Yurchenko
- Department
of Organic Chemistry, Kiev Polytechnic Institute, Pr. Pobedy 37, 03056 Kiev, Ukraine
| | - Ekaterina D. Butova
- Department
of Organic Chemistry, Kiev Polytechnic Institute, Pr. Pobedy 37, 03056 Kiev, Ukraine
| | - Heike Hausmann
- Institut
für Organische Chemie, Justus-Liebig-Universität, Heinrich-Buff-Ring 17, 35392 Giessen, Germany
| | - Marina Šekutor
- Institut
für Organische Chemie, Justus-Liebig-Universität, Heinrich-Buff-Ring 17, 35392 Giessen, Germany
| | - Andrey A. Fokin
- Institut
für Organische Chemie, Justus-Liebig-Universität, Heinrich-Buff-Ring 17, 35392 Giessen, Germany
- Department
of Organic Chemistry, Kiev Polytechnic Institute, Pr. Pobedy 37, 03056 Kiev, Ukraine
| | - Jean-Cyrille Hierso
- Institut
de Chimie Moléculaire de l’Université de Bourgogne
(ICMUB), UMR-CNRS 6302, Université de Bourgogne Franche-Comté, 9 Avenue Alain Savary, 21078 Dijon, France
- Institut Universitaire de France (IUF), 103 Bd. Saint Michel, 75005 Paris Cedex 5, France
| | - Peter R. Schreiner
- Institut
für Organische Chemie, Justus-Liebig-Universität, Heinrich-Buff-Ring 17, 35392 Giessen, Germany
| |
Collapse
|