1
|
Esposito J, Kakar J, Khokhar T, Noll-Walker T, Omar F, Christen A, James Cleaves H, Sandora M. Comparing the complexity of written and molecular symbolic systems. Biosystems 2024; 244:105297. [PMID: 39154841 DOI: 10.1016/j.biosystems.2024.105297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2024] [Revised: 08/11/2024] [Accepted: 08/11/2024] [Indexed: 08/20/2024]
Abstract
Symbolic systems (SSs) are uniquely products of living systems, such that symbolism and life may be inextricably intertwined phenomena. Within a given SS, there is a range of symbol complexity over which signaling is functionally optimized. This range exists relative to a complex and potentially infinitely large background of latent, unused symbol space. Understanding how symbol sets sample this latent space is relevant to diverse fields including biochemistry and linguistics. We quantitatively explored the graphic complexity of two biosemiotic systems: genetically encoded amino acids (GEAAs) and written language. Molecular and graphical notions of complexity are highly correlated for GEAAs and written language. Symbol sets are generally neither minimally nor maximally complex relative to their latent spaces, but exist across an objectively definable distribution, with the GEAAs having especially low complexity. The selection pressures guiding these disparate systems are explicable by symbol production and disambiguation efficiency. These selection pressures may be universal, offer a quantifiable metric for comparison, and suggest that all life in the Universe may discover optimal symbol set complexity distributions with respect to their latent spaces. If so, the "complexity" of individual components of SSs may not be as strong a biomarker as symbol set complexity distribution.
Collapse
Affiliation(s)
- Julia Esposito
- Blue Marble Space Institute of Science, Seattle, WA, USA
| | - Jyotika Kakar
- Blue Marble Space Institute of Science, Seattle, WA, USA; Department of Computer Engineering, University of Mumbai, MH, India
| | - Tasneem Khokhar
- Blue Marble Space Institute of Science, Seattle, WA, USA; Department of Physics and Astronomy, University of California, Irvine, CA, USA
| | | | - Fatima Omar
- Blue Marble Space Institute of Science, Seattle, WA, USA; Jodrell Bank Centre for Astrophysics, The University of Manchester, Oxford Road, Manchester, M13 9PL, UK
| | - Anna Christen
- Blue Marble Space Institute of Science, Seattle, WA, USA
| | - H James Cleaves
- Department of Chemistry, Howard University, Washington, DC, 20059, USA; Blue Marble Space Institute of Science, Seattle, WA, USA; Earth-Life Science Institute, Tokyo Institute of Technology, Meguro-ku, Tokyo, Japan.
| | | |
Collapse
|
2
|
Ameta D, Behera L, Chakraborty A, Sandhan T. Predicting odor from vibrational spectra: a data-driven approach. Sci Rep 2024; 14:20321. [PMID: 39223164 PMCID: PMC11369114 DOI: 10.1038/s41598-024-70696-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2024] [Accepted: 08/20/2024] [Indexed: 09/04/2024] Open
Abstract
This study investigates olfaction, a complex and not well-understood sensory modality. The chemical mechanism behind smell can be described by so far proposed two theories: vibrational and docking theories. The vibrational theory has been gaining acceptance lately but needs more extensive validation. To fill this gap for the first time, we, with the help of data-driven classification, clustering, and Explainable AI techniques, systematically analyze a large dataset of vibrational spectra (VS) of 3018 molecules obtained from the atomistic simulation. The study utlizes image representations of VS using Gramian Angular Fields and Markov Transition Fields, allowing computer vision techniques to be applied for better feature extraction and improved odor classification. Furthermore, we fuse the PCA-reduced fingerprint features with image features, which show additional improvement in classification results. We use two clustering methods, agglomerative hierarchical (AHC) and k-means, on dimensionality reduced (UMAP, MDS, t-SNE, and PCA) VS and image features, which shed further insight into the connections between molecular structure, VS, and odor. Additionally, we contrast our method with an earlier work that employed traditional machine learning on fingerprint features for the same dataset, and demonstrate that even with a representative subset of 3018 molecules, our deep learning model outperforms previous results. This comprehensive and systematic analysis highlights the potential of deep learning in furthering the field of olfactory research while confirming the vibrational theory of olfaction.
Collapse
Affiliation(s)
- Durgesh Ameta
- Indian Knowledge System and Mental Health Applications Centre, Indian Institute of Technology, Mandi, 175005, India
- Indian Knowledge System Centre, ISS, Delhi, 110065, India
| | - Laxmidhar Behera
- Indian Knowledge System and Mental Health Applications Centre, Indian Institute of Technology, Mandi, 175005, India
- Department of Electrical Engineering, Indian Institute of Technology, Kanpur, 208016, India
| | | | - Tushar Sandhan
- Department of Electrical Engineering, Indian Institute of Technology, Kanpur, 208016, India.
| |
Collapse
|
3
|
He Y, Huang R, Zhang R, He F, Han L, Han W. PredCoffee: A binary classification approach specifically for coffee odor. iScience 2024; 27:110041. [PMID: 38868178 PMCID: PMC11167484 DOI: 10.1016/j.isci.2024.110041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 04/26/2024] [Accepted: 05/16/2024] [Indexed: 06/14/2024] Open
Abstract
Compared to traditional methods, using machine learning to assess or predict the odor of molecules can save costs in various aspects. Our research aims to collect molecules with coffee odor and summarize the regularity of these molecules, ultimately creating a binary classifier that can determine whether a molecule has a coffee odor. In this study, a total of 371 coffee-odor molecules and 9,700 non-coffee-odor molecules were collected. The Knowledge-guided Pre-training of Graph Transformer (KPGT), support vector machine (SVM), random forest (RF), multi-layer perceptron (MLP), and message-passing neural networks (MPNN) were used to train the data. The model with the best performance was selected as the basis of the predictor. The prediction accuracy value of the KPGT model exceeded 0.84 and the predictor has been deployed as a webserver PredCoffee.
Collapse
Affiliation(s)
- Yi He
- Key Laboratory for Molecular Enzymology and Engineering of Ministry of Education, School of Life Sciences, Jilin University, 2699 Qianjin Street, Changchun 130012, China
| | - Ruirui Huang
- Key Laboratory for Molecular Enzymology and Engineering of Ministry of Education, School of Life Sciences, Jilin University, 2699 Qianjin Street, Changchun 130012, China
| | - Ruoyu Zhang
- Key Laboratory for Molecular Enzymology and Engineering of Ministry of Education, School of Life Sciences, Jilin University, 2699 Qianjin Street, Changchun 130012, China
| | - Fei He
- Department of Electrical Engineer and Computer Science, Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA
| | - Lu Han
- Key Laboratory for Molecular Enzymology and Engineering of Ministry of Education, School of Life Sciences, Jilin University, 2699 Qianjin Street, Changchun 130012, China
| | - Weiwei Han
- Key Laboratory for Molecular Enzymology and Engineering of Ministry of Education, School of Life Sciences, Jilin University, 2699 Qianjin Street, Changchun 130012, China
| |
Collapse
|
4
|
Nam SH, Lee J, Kim E, Koo JW, Shin Y, Hwang TM. Electronic tongue for the simple and rapid determination of taste and odor compounds in water. CHEMOSPHERE 2023; 338:139511. [PMID: 37478991 DOI: 10.1016/j.chemosphere.2023.139511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 06/25/2023] [Accepted: 07/13/2023] [Indexed: 07/23/2023]
Abstract
Taste and odor (T&O) compounds present in natural water bodies could originate from algae. In this study, alga-generated compounds that can cause T&O issues in water, such as geosmin (GE), 2-Methylisoborneol (MIB), 2,4,6-Trichloroanisole (TCA), 2-Methylbenzofuran (MB), 2-Isopropyl-3-methoxypyrazine (IPMP), 2-Isobutyl-3-methoxypyrazine (IBMP), cis-3-Hexenyl acetate (HA), trans,trans-2,4-Heptadienal (HD), trans,cis-2,6-Nonadienal (ND), and trans-2-Decenal (DN), were determined through solid-phase microextraction coupled with gas chromatography/mass spectrometry (HS-SPME GC/MS) and electronic tongue (E-tongue), and the results from the two techniques were compared. Although HS-SPME GC/MS facilitates the detection and quantification of T&O compounds with high precision and accuracy, the sample preparation and handling is difficult and the analysis time (1 h) is longer than those of other analytical methods. E-tongue can be used as an alternative analytical method for water quality analysis and risk management because it enables controlled and rapid analysis (3 min) of T&O compounds in water at a low cost. Notably, principal component analysis indicated that E-tongue can discriminate and quantify eight T&O compounds at as low as 0.02 μg L-1 concentration. Further, partial least squares analysis confirmed that the sensor exhibits high sensitivity to concentration changes. The sensors with the highest variable importance in projection scores were determined to be SCS (1.39 and 1.38) for GE and MIB, CTS (1.34) for IPMP, CPS (1.33) for IBMP, AHS (1.42) for HA, ANS (1.22) for HD, and NMS (1.14 and 1.19) for ND and DN.
Collapse
Affiliation(s)
- Sook-Hyun Nam
- Korea Institute of Civil Engineering and Building Technology, 283 Goyangdar-Ro, Ilsan-Gu, Goyang-Si, Gyeonggi-Do, 411-712, Republic of Korea.
| | - Juwon Lee
- Korea Institute of Civil Engineering and Building Technology, 283 Goyangdar-Ro, Ilsan-Gu, Goyang-Si, Gyeonggi-Do, 411-712, Republic of Korea; Korea University of Science & Technology, 217 Gajung-to Yuseong-gu, Daejeon, 305-333, Republic of Korea
| | - Eunju Kim
- Korea Institute of Civil Engineering and Building Technology, 283 Goyangdar-Ro, Ilsan-Gu, Goyang-Si, Gyeonggi-Do, 411-712, Republic of Korea
| | - Jae-Wuk Koo
- Korea Institute of Civil Engineering and Building Technology, 283 Goyangdar-Ro, Ilsan-Gu, Goyang-Si, Gyeonggi-Do, 411-712, Republic of Korea
| | - Yonghyun Shin
- Korea Institute of Civil Engineering and Building Technology, 283 Goyangdar-Ro, Ilsan-Gu, Goyang-Si, Gyeonggi-Do, 411-712, Republic of Korea
| | - Tae-Mun Hwang
- Korea Institute of Civil Engineering and Building Technology, 283 Goyangdar-Ro, Ilsan-Gu, Goyang-Si, Gyeonggi-Do, 411-712, Republic of Korea; Korea University of Science & Technology, 217 Gajung-to Yuseong-gu, Daejeon, 305-333, Republic of Korea.
| |
Collapse
|
5
|
Ribeiro SG, Martins C, Tavares T, Rudnitskaya A, Alves F, Rocha SM. Volatile Composition of Fortification Grape Spirit and Port Wine: Where Do We Stand? Foods 2023; 12:2432. [PMID: 37372643 DOI: 10.3390/foods12122432] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 05/15/2023] [Accepted: 06/13/2023] [Indexed: 06/29/2023] Open
Abstract
Port wine's prominence worldwide is unequivocal and the grape spirit, which comprises roughly one fifth of the total volume of this fortified wine, is also a contributor to the recognized quality of this beverage. Nonetheless, information about the influence of the grape spirit on the final aroma of Port wine, as well as its volatile composition, is extremely limited. Moreover, the aroma characteristics of Port wines are modulated mainly by their volatile profiles. Hence, this review presents a detailed overview of the volatile composition of the fortification spirit and Port wine, along with the methodologies employed for their characterization. Moreover, it gives a general overview of the Douro Demarcated Region (Portugal) and the relevance of fortification spirit to the production of Port wine. As far as we know, this review contains the most extensive database on the volatile composition of grape spirit and Port wine, corresponding to 23 and 208 compounds, respectively. To conclude, the global outlook and future challenges are addressed, with the position of the analytical coverage of the chemical data on volatile components discussed as crucial for the innovation centered on consumer preferences.
Collapse
Affiliation(s)
- Sónia Gomes Ribeiro
- Department of Chemistry & LAQV-REQUIMTE, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| | - Cátia Martins
- Department of Chemistry & LAQV-REQUIMTE, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| | - Tiago Tavares
- Department of Chemistry & LAQV-REQUIMTE, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| | - Alisa Rudnitskaya
- Department of Chemistry & CESAM, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| | - Fernando Alves
- Symington Family Estates, Vinhos S.A. Travessa Barão de Forrester, 86, 4400-034 Vila Nova de Gaia, Portugal
| | - Sílvia M Rocha
- Department of Chemistry & LAQV-REQUIMTE, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| |
Collapse
|
6
|
Rugard M, Audouze K, Tromelin A. Combining the Classification and Pharmacophore Approaches to Understand Homogeneous Olfactory Perceptions at Peripheral Level: Focus on Two Aroma Mixtures. Molecules 2023; 28:molecules28104028. [PMID: 37241770 DOI: 10.3390/molecules28104028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 04/20/2023] [Accepted: 05/03/2023] [Indexed: 05/28/2023] Open
Abstract
The mechanisms involved in the homogeneous perception of odorant mixtures remain largely unknown. With the aim of enhancing knowledge about blending and masking mixture perceptions, we focused on structure-odor relationships by combining the classification and pharmacophore approaches. We built a dataset of about 5000 molecules and their related odors and reduced the multidimensional space defined by 1014 fingerprints representing the structures to a tridimensional 3D space using uniform manifold approximation and projection (UMAP). The self-organizing map (SOM) classification was then performed using the 3D coordinates in the UMAP space that defined specific clusters. We explored the allocating in these clusters of the components of two aroma mixtures: a blended mixture (red cordial (RC) mixture, 6 molecules) and a masking binary mixture (isoamyl acetate/whiskey-lactone [IA/WL]). Focusing on clusters containing the components of the mixtures, we looked at the odor notes carried by the molecules belonging to these clusters and also at their structural features by pharmacophore modeling (PHASE). The obtained pharmacophore models suggest that WL and IA could have a common binding site(s) at the peripheral level, but that would be excluded for the components of RC. In vitro experiments will soon be carried out to assess these hypotheses.
Collapse
Affiliation(s)
- Marylène Rugard
- T3S, Inserm UMR S-1124, Université Paris Cité, F-75006 Paris, France
| | - Karine Audouze
- T3S, Inserm UMR S-1124, Université Paris Cité, F-75006 Paris, France
| | - Anne Tromelin
- Centre des Sciences du Goût et de l'Alimentation, CNRS, INRAE, Institut Agro, Université de Bourgogne, F-21000 Dijon, France
| |
Collapse
|
7
|
Schicker D, Singh S, Freiherr J, Grasskamp AT. OWSum: algorithmic odor prediction and insight into structure-odor relationships. J Cheminform 2023; 15:51. [PMID: 37150811 PMCID: PMC10164323 DOI: 10.1186/s13321-023-00722-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Accepted: 04/16/2023] [Indexed: 05/09/2023] Open
Abstract
We derived and implemented a linear classification algorithm for the prediction of a molecule's odor, called Olfactory Weighted Sum (OWSum). Our approach relies solely on structural patterns of the molecules as features for algorithmic treatment and uses conditional probabilities combined with tf-idf values. In addition to the prediction of molecular odor, OWSum provides insights into properties of the dataset and allows to understand how algorithmic classifications are reached by quantitatively assigning structural patterns to odors. This provides chemists with an intuitive understanding of underlying interactions. To deal with ambiguities of the natural language used to describe odor, we introduced descriptor overlap as a metric for the quantification of semantic overlap between descriptors. Thus, grouping of descriptors and derivation of higher-level descriptors becomes possible. Our approach poses a large leap forward in our capabilities to understand and predict molecular features.
Collapse
Affiliation(s)
- Doris Schicker
- Sensory Analytics and Technologies, Fraunhofer Institute for Process Engineering and Packaging IVV, Giggenhauser Straße 35, 85354, Freising, Germany.
- Department of Psychiatry and Psychotherapy, Friedrich-Alexander-Universität Erlangen-Nürnberg, Schwabachanlage 6, 91054, Erlangen, Germany.
| | - Satnam Singh
- Sensory Analytics and Technologies, Fraunhofer Institute for Process Engineering and Packaging IVV, Giggenhauser Straße 35, 85354, Freising, Germany
- Department of Psychiatry and Psychotherapy, Friedrich-Alexander-Universität Erlangen-Nürnberg, Schwabachanlage 6, 91054, Erlangen, Germany
| | - Jessica Freiherr
- Sensory Analytics and Technologies, Fraunhofer Institute for Process Engineering and Packaging IVV, Giggenhauser Straße 35, 85354, Freising, Germany
- Department of Psychiatry and Psychotherapy, Friedrich-Alexander-Universität Erlangen-Nürnberg, Schwabachanlage 6, 91054, Erlangen, Germany
| | - Andreas T Grasskamp
- Sensory Analytics and Technologies, Fraunhofer Institute for Process Engineering and Packaging IVV, Giggenhauser Straße 35, 85354, Freising, Germany.
| |
Collapse
|
8
|
Ben Khemis I, Noureddine O, Aouaini F, Salamah M. Aljaloud A, Nasr S, Ben Lamine A. Indirect characterizations of mOR-EG: Modeling analysis of five concentration-olfactory response curves via an advanced monolayer adsorption model. Int J Biol Macromol 2022; 222:1277-1286. [DOI: 10.1016/j.ijbiomac.2022.09.251] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 09/18/2022] [Accepted: 09/27/2022] [Indexed: 11/05/2022]
|