1
|
Kneipp J, Seifert S, Gärber F. SERS microscopy as a tool for comprehensive biochemical characterization in complex samples. Chem Soc Rev 2024; 53:7641-7656. [PMID: 38934892 DOI: 10.1039/d4cs00460d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/28/2024]
Abstract
Surface enhanced Raman scattering (SERS) spectra of biomaterials such as cells or tissues can be used to obtain biochemical information from nanoscopic volumes in these heterogeneous samples. This tutorial review discusses the factors that determine the outcome of a SERS experiment in complex bioorganic samples. They are related to the SERS process itself, the possibility to selectively probe certain regions or constituents of a sample, and the retrieval of the vibrational information in order to identify molecules and their interaction. After introducing basic aspects of SERS experiments in the context of biocompatible environments, spectroscopy in typical microscopic settings is exemplified, including the possibilities to combine SERS with other linear and non-linear microscopic tools, and to exploit approaches that improve lateral and temporal resolution. In particular the great variation of data in a SERS experiment calls for robust data analysis tools. Approaches will be introduced that have been originally developed in the field of bioinformatics for the application to omics data and that show specific potential in the analysis of SERS data. They include the use of simulated data and machine learning tools that can yield chemical information beyond achieving spectral classification.
Collapse
Affiliation(s)
- Janina Kneipp
- Department of Chemistry, Humboldt-Universität zu Berlin, Brook-Taylor-Str. 2, 12489 Berlin, Germany.
| | - Stephan Seifert
- Hamburg School of Food Science, Department of Chemistry, Universität Hamburg, Grindelallee 117, 20146 Hamburg, Germany
| | - Florian Gärber
- Hamburg School of Food Science, Department of Chemistry, Universität Hamburg, Grindelallee 117, 20146 Hamburg, Germany
| |
Collapse
|
2
|
Hu Q, Wu HJ. Direct Glycan Analysis of Biological Samples and Intact Glycoproteins by Integrating Machine Learning-Driven Surface-Enhanced Raman Scattering and Boronic Acid Arrays. ACS MEASUREMENT SCIENCE AU 2024; 4:307-314. [PMID: 38910864 PMCID: PMC11191725 DOI: 10.1021/acsmeasuresciau.4c00014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 05/03/2024] [Accepted: 05/06/2024] [Indexed: 06/25/2024]
Abstract
Frequent monitoring of glycan patterns is a critical step in studying glycan-mediated cellular processes. However, the current glycan analysis tools are resource-intensive and less suitable for routine use in standard laboratories. We developed a novel glycan detection platform by integrating surface-enhanced Raman spectroscopy (SERS), boronic acid (BA) receptors, and machine learning tools. This sensor monitors the molecular fingerprint spectra of BA binding to cis-diol-containing glycans. Different types of BA receptors could yield different stereoselective reactions toward different glycans and exhibit unique vibrational spectra. By integration of the Raman spectra collected from different BA receptors, the structural information can be enriched, eventually improving the accuracy of glycan classification and quantification. Here, we established a SERS-based sensor incorporating multiple different BA receptors. This sensing platform could directly analyze the biological samples, including whole milk and intact glycoproteins (fetuin and asialofetuin), without tedious glycan release and purification steps. The results demonstrate the platform's ability to classify milk oligosaccharides with remarkable classification accuracy, despite the presence of other non-glycan constituents in the background. This sensor could also directly quantify sialylation levels of a fetuin/asialofetuin mixture without glycan release procedures. Moreover, by selecting appropriate BA receptors, the sensor exhibits an excellent performance of differentiating between α2,3 and α2,6 linkages of sialic acids. This low-cost, rapid, and highly accessible sensor will provide the scientific community with an invaluable tool for routine glycan screening in standard laboratories.
Collapse
Affiliation(s)
- Qiang Hu
- The Artie McFerrin Department
of Chemical Engineering, Texas A&M University, College Station, Texas 77843, United States
| | - Hung-Jen Wu
- The Artie McFerrin Department
of Chemical Engineering, Texas A&M University, College Station, Texas 77843, United States
| |
Collapse
|
3
|
Jang M, Bae G, Kwon YM, Cho JH, Lee DH, Kang S, Yim S, Myung S, Lim J, Lee SS, Song W, An K. Artificial Q-Grader: Machine Learning-Enabled Intelligent Olfactory and Gustatory Sensing System. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2308976. [PMID: 38582529 PMCID: PMC11186046 DOI: 10.1002/advs.202308976] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 01/04/2024] [Indexed: 04/08/2024]
Abstract
Portable and personalized artificial intelligence (AI)-driven sensors mimicking human olfactory and gustatory systems have immense potential for the large-scale deployment and autonomous monitoring systems of Internet of Things (IoT) devices. In this study, an artificial Q-grader comprising surface-engineered zinc oxide (ZnO) thin films is developed as the artificial nose, tongue, and AI-based statistical data analysis as the artificial brain for identifying both aroma and flavor chemicals in coffee beans. A poly(vinylidene fluoride-co-hexafluoropropylene)/ZnO thin film transistor (TFT)-based liquid sensor is the artificial tongue, and an Au, Ag, or Pd nanoparticles/ZnO nanohybrid gas sensor is the artificial nose. In order to classify the flavor of coffee beans (acetic acid (sourness), ethyl butyrate and 2-furanmethanol (sweetness), caffeine (bitterness)) and the origin of coffee beans (Papua New Guinea, Brazil, Ethiopia, and Colombia-decaffeine), rational combination of TFT transfer and dynamic response curves capture the liquids and gases-dependent electrical transport behavior and principal component analysis (PCA)-assisted machine learning (ML) is implemented. A PCA-assisted ML model distinguished the four target flavors with >92% prediction accuracy. ML-based regression model predicts the flavor chemical concentrations with >99% accuracy. Also, the classification model successfully distinguished four different types of coffee-bean with 100% accuracy.
Collapse
Affiliation(s)
- Moonjeong Jang
- Thin Film Materials Research CenterKorea Research Institute of Chemical TechnologyDaejeon34114Republic of Korea
- National Nano Fab Center (NNFC)Daejeon34141Republic of Korea
| | - Garam Bae
- Thin Film Materials Research CenterKorea Research Institute of Chemical TechnologyDaejeon34114Republic of Korea
- Department of Medical Artificial IntelligenceKonyang UniversityDaejeon35365Republic of Korea
| | - Yeong Min Kwon
- Thin Film Materials Research CenterKorea Research Institute of Chemical TechnologyDaejeon34114Republic of Korea
| | - Jae Hee Cho
- Thin Film Materials Research CenterKorea Research Institute of Chemical TechnologyDaejeon34114Republic of Korea
| | - Do Hyung Lee
- Thin Film Materials Research CenterKorea Research Institute of Chemical TechnologyDaejeon34114Republic of Korea
| | - Saewon Kang
- Thin Film Materials Research CenterKorea Research Institute of Chemical TechnologyDaejeon34114Republic of Korea
| | - Soonmin Yim
- Thin Film Materials Research CenterKorea Research Institute of Chemical TechnologyDaejeon34114Republic of Korea
| | - Sung Myung
- Thin Film Materials Research CenterKorea Research Institute of Chemical TechnologyDaejeon34114Republic of Korea
| | - Jongsun Lim
- Thin Film Materials Research CenterKorea Research Institute of Chemical TechnologyDaejeon34114Republic of Korea
| | - Sun Sook Lee
- Thin Film Materials Research CenterKorea Research Institute of Chemical TechnologyDaejeon34114Republic of Korea
| | - Wooseok Song
- Thin Film Materials Research CenterKorea Research Institute of Chemical TechnologyDaejeon34114Republic of Korea
- School of Electronic and Electrical EngineeringSunkyunkwan UniversitySuwon16419Republic of Korea
| | - Ki‐Seok An
- Thin Film Materials Research CenterKorea Research Institute of Chemical TechnologyDaejeon34114Republic of Korea
| |
Collapse
|
4
|
Li JQ, Neng-Wang H, Canning AJ, Gaona A, Crawford BM, Garman KS, Vo-Dinh T. Surface-Enhanced Raman Spectroscopy-Based Detection of Micro-RNA Biomarkers for Biomedical Diagnosis Using a Comparative Study of Interpretable Machine Learning Algorithms. APPLIED SPECTROSCOPY 2024; 78:84-98. [PMID: 37908079 DOI: 10.1177/00037028231209053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
Surface-enhanced Raman spectroscopy (SERS) has wide diagnostic applications due to narrow spectral features that allow multiplex analysis. We have previously developed a multiplexed, SERS-based nanosensor for micro-RNA (miRNA) detection called the inverse molecular sentinel (iMS). Machine learning (ML) algorithms have been increasingly adopted for spectral analysis due to their ability to discover underlying patterns and relationships within large and complex data sets. However, the high dimensionality of SERS data poses a challenge for traditional ML techniques, which can be prone to overfitting and poor generalization. Non-negative matrix factorization (NMF) reduces the dimensionality of SERS data while preserving information content. In this paper, we compared the performance of ML methods including convolutional neural network (CNN), support vector regression, and extreme gradient boosting combined with and without NMF for spectral unmixing of four-way multiplexed SERS spectra from iMS assays used for miRNA detection. CNN achieved high accuracy in spectral unmixing. Incorporating NMF before CNN drastically decreased memory and training demands without sacrificing model performance on SERS spectral unmixing. Additionally, models were interpreted using gradient class activation maps and partial dependency plots to understand predictions. These models were used to analyze clinical SERS data from single-plexed iMS in RNA extracted from 17 endoscopic tissue biopsies. CNN and CNN-NMF, trained on multiplexed data, performed most accurately with RMSElabel = 0.101 and 9.68 × 10-2, respectively. We demonstrated that CNN-based ML shows great promise in spectral unmixing of multiplexed SERS spectra, and the effect of dimensionality reduction on performance and training speed.
Collapse
Affiliation(s)
- Joy Q Li
- Fitzpatrick Institute for Photonics, Durham, North Carolina, USA
- Department of Biomedical Engineering, Duke University, Durham, North Carolina, USA
| | - Hsin Neng-Wang
- Fitzpatrick Institute for Photonics, Durham, North Carolina, USA
- Department of Biomedical Engineering, Duke University, Durham, North Carolina, USA
| | - Aidan J Canning
- Fitzpatrick Institute for Photonics, Durham, North Carolina, USA
- Department of Biomedical Engineering, Duke University, Durham, North Carolina, USA
| | - Alejandro Gaona
- Fitzpatrick Institute for Photonics, Durham, North Carolina, USA
- Department of Biomedical Engineering, Duke University, Durham, North Carolina, USA
| | - Bridget M Crawford
- Fitzpatrick Institute for Photonics, Durham, North Carolina, USA
- Department of Biomedical Engineering, Duke University, Durham, North Carolina, USA
| | - Katherine S Garman
- Division of Gastroenterology, Department of Medicine, Duke University, Durham, North Carolina, USA
| | - Tuan Vo-Dinh
- Fitzpatrick Institute for Photonics, Durham, North Carolina, USA
- Department of Biomedical Engineering, Duke University, Durham, North Carolina, USA
- Department of Chemistry, Duke University, Durham, North Carolina, USA
| |
Collapse
|
5
|
Ju Y, Neumann O, Bajomo M, Zhao Y, Nordlander P, Halas NJ, Patel A. Identifying Surface-Enhanced Raman Spectra with a Raman Library Using Machine Learning. ACS NANO 2023; 17:21251-21261. [PMID: 37910670 DOI: 10.1021/acsnano.3c05510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/03/2023]
Abstract
Since its discovery, surface-enhanced Raman spectroscopy (SERS) has shown outstanding promise of identifying trace amounts of unknown molecules in rapid, portable formats. However, the many different types of nanoparticles or nanostructured metallic SERS substrates created over the past few decades show substantial variability in the SERS spectra they provide. These inconsistencies have even raised speculation that substrate-specific SERS spectral libraries must be compiled for practical use of this type of spectroscopy. Here, we report a machine learning (ML) algorithm that can identify chemicals by matching their SERS spectra to those of a standard Raman spectral library. We use an approach analogous to facial recognition that utilizes feature extraction in the presence of multiple nuisance variables for spectral recognition. The key element is a metric we call "Characteristic Peak Similarity" (CaPSim) that focuses on the characteristic peaks in the SERS spectra. It has the flexibility to accommodate substrate-specific variability when quantifying the degree of similarity to a Raman spectrum. Analysis shows that CaPSim substantially outperforms existing spectral matching algorithms in terms of accuracy. This ML-based approach could greatly facilitate the spectroscopic identification of molecules in fieldable SERS applications.
Collapse
Affiliation(s)
| | | | | | - Yiping Zhao
- Department of Physics and Astronomy, University of Georgia, Athens, Georgia 30602, United States
| | | | | | - Ankit Patel
- Department of Neuroscience, Baylor College of Medicine, Houston, Texas 77030, United States
| |
Collapse
|
6
|
Yousuff M, Babu R. Enhancing the classification metrics of spectroscopy spectrums using neural network based low dimensional space. EARTH SCIENCE INFORMATICS 2022; 16:825-844. [PMID: 36575666 PMCID: PMC9782283 DOI: 10.1007/s12145-022-00917-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 12/04/2022] [Indexed: 06/17/2023]
Abstract
Spectroscopy is a methodology for gaining knowledge of particles, especially biomolecules, by quantifying the interactions between matter and light. By examining the level of light absorbed, reflected or released by a specimen, its constituents, properties, and volume can be determined. Spectra obtained through spectroscopy procedures are quick, harmless and contactless; hence nowadays preferred in chemometrics. Due to the high dimensional nature of the spectra, it is challenging to build a robust classifier with good performance metrics. Many linear and nonlinear dimensionality reduction-based classification models have been previously implemented to overcome this issue. However, they lack in capturing the subtle details of the spectra into the low dimension space or cannot efficiently handle the nonlinearity present in the spectral data. We propose a graph-based neural network embedding approach to extract appropriate features into latent space and circumvent the spectrums' nonlinearity problem. Our approach performs dimensionality reduction into two phases: constructing a nearest neighbor graph and producing almost linear embedding using a fully connected neural network. Further, the low dimensional embedding is subjected to classification using the Random Forest algorithm. In this paper, we have implemented and compared our technique with four nonlinear dimensionality techniques widely used for spectral data analysis. In this study, we have considered five different spectral datasets belonging to specific applications. The various classification performance metrics of all the techniques are evaluated. The proposed approach is able to perform competitively well on six different low-dimensional spaces for each dataset with an accuracy score above 95% and Matthew's correlation coefficient value close to 1. The trustworthiness score of almost 1 show that the presented dimensionality reduction approach preserves the closest neighbor structure of high dimensional spectral inputs into latent space.
Collapse
Affiliation(s)
- Mohamed Yousuff
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore Campus, Vellore, 632014 Tamilnadu India
| | - Rajasekhara Babu
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore Campus, Vellore, 632014 Tamilnadu India
| |
Collapse
|