1
|
Shamraeva MA, Visvikis T, Zoidis S, Anthony IGM, Van Nuffel S. The Application of a Random Forest Classifier to ToF-SIMS Imaging Data. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024. [PMID: 39455427 DOI: 10.1021/jasms.4c00324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/28/2024]
Abstract
Time-of-flight secondary ion mass spectrometry (ToF-SIMS) imaging is a potent analytical tool that provides spatially resolved chemical information on surfaces at the microscale. However, the hyperspectral nature of ToF-SIMS datasets can be challenging to analyze and interpret. Both supervised and unsupervised machine learning (ML) approaches are increasingly useful to help analyze ToF-SIMS data. Random Forest (RF) has emerged as a robust and powerful algorithm for processing mass spectrometry data. This machine learning approach offers several advantages, including accommodating nonlinear relationships, robustness to outliers in the data, managing the high-dimensional feature space, and mitigating the risk of overfitting. The application of RF to ToF-SIMS imaging facilitates the classification of complex chemical compositions and the identification of features contributing to these classifications. This tutorial aims to assist nonexperts in either machine learning or ToF-SIMS to apply Random Forest to complex ToF-SIMS datasets.
Collapse
Affiliation(s)
- Mariya A Shamraeva
- Maastricht MultiModal Molecular Imaging Institute (M4i), Maastricht University, Universiteitssingel 50, 6229 ER Maastricht, The Netherlands
| | - Theodoros Visvikis
- Faculty of Science and Engineering, Maastricht University, Paul-Henri Spaaklaan 1, Maastricht 6229EN, The Netherlands
| | - Stefanos Zoidis
- Faculty of Science and Engineering, Maastricht University, Paul-Henri Spaaklaan 1, Maastricht 6229EN, The Netherlands
| | - Ian G M Anthony
- Maastricht MultiModal Molecular Imaging Institute (M4i), Maastricht University, Universiteitssingel 50, 6229 ER Maastricht, The Netherlands
| | - Sebastiaan Van Nuffel
- Maastricht MultiModal Molecular Imaging Institute (M4i), Maastricht University, Universiteitssingel 50, 6229 ER Maastricht, The Netherlands
- Faculty of Science and Engineering, Maastricht University, Paul-Henri Spaaklaan 1, Maastricht 6229EN, The Netherlands
| |
Collapse
|
2
|
Aoyagi S, Fujita M, Itoh H, Itoh H, Nagatomi T, Okamoto M, Ueno T. Development of Peptide Identification System for ToF-SIMS Spectra Using Supervised Machine Learning. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024. [PMID: 39395019 DOI: 10.1021/jasms.4c00310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/14/2024]
Abstract
Time-of-flight secondary ion mass spectrometry (ToF-SIMS) data interpretation for organic materials is complicated because of various fragment ions produced from each molecule and the overlapping of certain mass peaks from different molecules. Fragmentation mechanisms in SIMS are complex because different sputtering and ionization processes can simultaneously occur. Therefore, a prediction system that can identify materials in a sample is required. A novel prediction system for peptides based on ToF-SIMS and amino-acid-based teaching information (labels) for supervised machine learning was developed. To develop the prediction system for general organic materials, the annotation of materials is crucial to creating effective labels for supervised learning. Peptides are composed of 20 amino acid residues, which can be used as labels. We previously developed a peptide prediction system using Random Forest, a supervised machine-learning method. However, only the amino acids contained in the target peptide were predicted, and the amino acid sequence was unable to be assumed. In this study, the amino acid sequence of the test peptide was determined by adding the information on two adjacent amino acids to the labels. Once the prediction system learned the target peptide spectra, the peptides in the newly obtained ToF-SIMS spectra could be identified. The new prediction system also provides useful information for the identification of unknown peptides. The prediction results indicate that two adjacent permutations of amino acids are effective pieces of teaching information for expressing the amino acid sequence of a peptide.
Collapse
Affiliation(s)
- Satoka Aoyagi
- Faculty of Science and Technology, Seikei University, Musashino, Tokyo 180-8633, Japan
| | - Miya Fujita
- JSR Corporation, 100 Kawajiri-Cho, Yokkaichi, Mie 510-8552, Japan
| | - Hidemi Itoh
- Platform Laboratory for Science and Technology, Asahi Kasei Corporation, 2-1 Samejima, Fuji, Shizuoka 416-8501, Japan
| | - Hiroto Itoh
- Material Science Group, Data Generation Division, Data Science Center, Technology Development Headquarters, Konica Minolta, Inc., Tokyo 100-7015, Japan
| | - Takaharu Nagatomi
- Platform Laboratory for Science and Technology, Asahi Kasei Corporation, 2-1 Samejima, Fuji, Shizuoka 416-8501, Japan
| | - Masayuki Okamoto
- Analytical Science Research Laboratory, Kao Corp., Minato 1334, Wakayama-shi, Wakayama 640-8580, Japan
| | - Tomikazu Ueno
- JSR Corporation, 100 Kawajiri-Cho, Yokkaichi, Mie 510-8552, Japan
| |
Collapse
|
3
|
Bamford SE, Gardner W, Winkler DA, Muir BW, Alahakoon D, Pigram PJ. Self-Organizing Maps for Secondary Ion Mass Spectrometry. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024; 35:2516-2528. [PMID: 39307990 DOI: 10.1021/jasms.4c00318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/03/2024]
Abstract
Secondary ion mass spectrometry (SIMS) is a powerful analytical technique for characterizing the molecular and elemental composition of surfaces. Individual mass spectra can provide information about the mean surface composition, while spatial mapping can elucidate the spatial distributions of molecular species in 2D and 3D with no prior labeling of molecular targets. The data sets produced by SIMS techniques are large and inherently complex, often containing subtle relationships between spatial and molecular features. Machine learning algorithms are well suited to exploring this complexity, making them ideal for data analysis, interpretation, and visualization of SIMS data sets. One such algorithm, the self-organizing map (SOM), is particularly well suited to clustering similar samples and reducing the dimensionality of hyperspectral data sets. Here, we present an introduction to the SOM, a concise mathematical description, and recent examples of its use in SIMS and other related mass spectrometry techniques. These examples demonstrate how SOMs may be used to interpret high volumes of individual mass spectra, imaging, or depth profiling data sets. This review will be useful for specialists in SIMS and other mass spectral techniques seeking to explore self-organizing maps for data analysis.
Collapse
Affiliation(s)
- Sarah E Bamford
- Centre for Materials and Surface Science and Department of Mathematical and Physical Sciences, La Trobe University, Bundoora, Victoria 3086, Australia
| | - Wil Gardner
- Centre for Materials and Surface Science and Department of Mathematical and Physical Sciences, La Trobe University, Bundoora, Victoria 3086, Australia
| | - David A Winkler
- Department of Biochemistry and Chemistry, La Trobe Institute for Molecular Science, La Trobe University, Bundoora, Victoria 3086, Australia
- Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Victoria 3052, Australia
- School of Pharmacy, University of Nottingham, Nottingham NG7 2RD, United Kingdom
| | | | - Damminda Alahakoon
- Research Centre for Data Analytics and Cognition, La Trobe Business School, La Trobe University, Bundoora, Victoria 3086, Australia
| | - Paul J Pigram
- Centre for Materials and Surface Science and Department of Mathematical and Physical Sciences, La Trobe University, Bundoora, Victoria 3086, Australia
| |
Collapse
|
4
|
Zhao Y, Otto SK, Lombardo T, Henss A, Koeppe A, Selzer M, Janek J, Nestler B. Identification of Lithium Compounds on Surfaces of Lithium Metal Anode with Machine-Learning-Assisted Analysis of ToF-SIMS Spectra. ACS APPLIED MATERIALS & INTERFACES 2023; 15:50469-50478. [PMID: 37852613 PMCID: PMC10623505 DOI: 10.1021/acsami.3c09643] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 09/13/2023] [Indexed: 10/20/2023]
Abstract
Detailed knowledge about contamination and passivation compounds on the surface of lithium metal anodes (LMAs) is essential to enable their use in all-solid-state batteries (ASSBs). Time-of-flight secondary ion mass spectrometry (ToF-SIMS), a highly surface-sensitive technique, can be used to reliably characterize the surface status of LMAs. However, as ToF-SIMS data are usually highly complex, manual data analysis can be difficult and time-consuming. In this study, machine learning techniques, especially logistic regression (LR), are used to identify the characteristic secondary ions of 5 different pure lithium compounds. Furthermore, these models are applied to the mixture and LMA samples to enable identification of their compositions based on the measured ToF-SIMS spectra. This machine-learning-based analysis approach shows good performance in identifying characteristic ions of the analyzed compounds that fit well with their chemical nature. Moreover, satisfying accuracy in identifying the compositions of unseen new samples is achieved. In addition, the scope and limitations of such a strategy in practical applications are discussed. This work presents a robust analytical method that can assist researchers in simplifying the analysis of the studied lithium compound samples, offering the potential for broader applications in other material systems.
Collapse
Affiliation(s)
- Yinghan Zhao
- Institute
for Applied Materials − Microstructure Modelling and Simulation, Karlsruhe Institute of Technology, D-76131 Karlsruhe, Germany
| | - Svenja-K. Otto
- Institute
of Physical Chemistry, Justus-Liebig-Universität
Giessen, D-35392 Giessen, Germany
| | - Teo Lombardo
- Institute
of Physical Chemistry, Justus-Liebig-Universität
Giessen, D-35392 Giessen, Germany
| | - Anja Henss
- Institute
of Physical Chemistry, Justus-Liebig-Universität
Giessen, D-35392 Giessen, Germany
| | - Arnd Koeppe
- Institute
for Applied Materials − Microstructure Modelling and Simulation, Karlsruhe Institute of Technology, D-76131 Karlsruhe, Germany
| | - Michael Selzer
- Institute
for Applied Materials − Microstructure Modelling and Simulation, Karlsruhe Institute of Technology, D-76131 Karlsruhe, Germany
- Institute
for Digital Materials Science, Karlsruhe
University of Applied Sciences, D-76133 Karlsruhe, Germany
| | - Jürgen Janek
- Institute
of Physical Chemistry, Justus-Liebig-Universität
Giessen, D-35392 Giessen, Germany
| | - Britta Nestler
- Institute
for Applied Materials − Microstructure Modelling and Simulation, Karlsruhe Institute of Technology, D-76131 Karlsruhe, Germany
- Institute
for Digital Materials Science, Karlsruhe
University of Applied Sciences, D-76133 Karlsruhe, Germany
| |
Collapse
|
5
|
Bamford SE, Vassileff N, Spiers JG, Gardner W, Winkler DA, Muir BW, Hill AF, Pigram PJ. High resolution imaging and analysis of extracellular vesicles using mass spectral imaging and machine learning. JOURNAL OF EXTRACELLULAR BIOLOGY 2023; 2:e110. [PMID: 38938371 PMCID: PMC11080915 DOI: 10.1002/jex2.110] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/01/2023] [Revised: 07/16/2023] [Accepted: 08/22/2023] [Indexed: 06/29/2024]
Abstract
Extracellular vesicles (EVs) are potentially useful biomarkers for disease detection and monitoring. Development of a label-free technique for imaging and distinguishing small volumes of EVs from different cell types and cell states would be of great value. Here, we have designed a method to explore the chemical changes in EVs associated with neuroinflammation using Time-of-Flight Secondary Ion Mass spectrometry (ToF-SIMS) and machine learning (ML). Mass spectral imaging was able to identify and differentiate EVs released by microglia following lipopolysaccharide (LPS) stimulation compared to a control group. This process requires a much smaller sample size (1 µL) than other molecular analysis methods (up to 50 µL). Conspicuously, we saw a reduction in free cysteine thiols (a marker of cellular oxidative stress associated with neuroinflammation) in EVs from microglial cells treated with LPS, consistent with the reduced cellular free thiol levels measured experimentally. This validates the synergistic combination of ToF-SIMS and ML as a sensitive and valuable technique for collecting and analysing molecular data from EVs at high resolution.
Collapse
Affiliation(s)
- Sarah Elizabeth Bamford
- Centre for Materials and Surface Science and Department of Mathematical and Physical SciencesLa Trobe UniversityBundooraVictoriaAustralia
| | - Natasha Vassileff
- The Department of Biochemistry and ChemistryLa Trobe Institute for Molecular ScienceLa Trobe UniversityBundooraVictoriaAustralia
| | - Jereme G. Spiers
- The Department of Biochemistry and ChemistryLa Trobe Institute for Molecular ScienceLa Trobe UniversityBundooraVictoriaAustralia
- Clear Vision Research, Eccles Institute of Neuroscience, John Curtin School of Medical Research, College of Health and MedicineThe Australian National UniversityActonACTAustralia
- School of Medicine and Psychology, College of Health and MedicineThe Australian National UniversityActonACTAustralia
| | - Wil Gardner
- Centre for Materials and Surface Science and Department of Mathematical and Physical SciencesLa Trobe UniversityBundooraVictoriaAustralia
| | - David A. Winkler
- The Department of Biochemistry and ChemistryLa Trobe Institute for Molecular ScienceLa Trobe UniversityBundooraVictoriaAustralia
- Monash Institute of Pharmaceutical SciencesMonash UniversityParkvilleVictoriaAustralia
- School of PharmacyUniversity of NottinghamNottinghamUK
| | | | - Andrew F. Hill
- The Department of Biochemistry and ChemistryLa Trobe Institute for Molecular ScienceLa Trobe UniversityBundooraVictoriaAustralia
- Institute for Health and SportVictoria UniversityVictoriaAustralia
| | - Paul J. Pigram
- Centre for Materials and Surface Science and Department of Mathematical and Physical SciencesLa Trobe UniversityBundooraVictoriaAustralia
| |
Collapse
|
6
|
Parker GD, Hanley L, Yu XY. Mass Spectral Imaging to Map Plant-Microbe Interactions. Microorganisms 2023; 11:2045. [PMID: 37630605 PMCID: PMC10459445 DOI: 10.3390/microorganisms11082045] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2023] [Revised: 07/23/2023] [Accepted: 08/08/2023] [Indexed: 08/27/2023] Open
Abstract
Plant-microbe interactions are of rising interest in plant sustainability, biomass production, plant biology, and systems biology. These interactions have been a challenge to detect until recent advancements in mass spectrometry imaging. Plants and microbes interact in four main regions within the plant, the rhizosphere, endosphere, phyllosphere, and spermosphere. This mini review covers the challenges within investigations of plant and microbe interactions. We highlight the importance of sample preparation and comparisons among time-of-flight secondary ion mass spectroscopy (ToF-SIMS), matrix-assisted laser desorption/ionization (MALDI), laser desorption ionization (LDI/LDPI), and desorption electrospray ionization (DESI) techniques used for the analysis of these interactions. Using mass spectral imaging (MSI) to study plants and microbes offers advantages in understanding microbe and host interactions at the molecular level with single-cell and community communication information. More research utilizing MSI has emerged in the past several years. We first introduce the principles of major MSI techniques that have been employed in the research of microorganisms. An overview of proper sample preparation methods is offered as a prerequisite for successful MSI analysis. Traditionally, dried or cryogenically prepared, frozen samples have been used; however, they do not provide a true representation of the bacterial biofilms compared to living cell analysis and chemical imaging. New developments such as microfluidic devices that can be used under a vacuum are highly desirable for the application of MSI techniques, such as ToF-SIMS, because they have a subcellular spatial resolution to map and image plant and microbe interactions, including the potential to elucidate metabolic pathways and cell-to-cell interactions. Promising results due to recent MSI advancements in the past five years are selected and highlighted. The latest developments utilizing machine learning are captured as an important outlook for maximal output using MSI to study microorganisms.
Collapse
Affiliation(s)
- Gabriel D. Parker
- Department of Chemistry, University of Illinois Chicago, Chicago, IL 60607, USA
| | - Luke Hanley
- Department of Chemistry, University of Illinois Chicago, Chicago, IL 60607, USA
| | - Xiao-Ying Yu
- Materials Science and Technology Division, Oak Ridge National Laboratory, Oak Ridge, TN 37830, USA
| |
Collapse
|
7
|
Aoyagi S, Matsuda K. Quantitative analysis of ToF-SIMS data of a two organic compound mixture using an autoencoder and simple artificial neural networks. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2023; 37:e9445. [PMID: 36457202 DOI: 10.1002/rcm.9445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 11/22/2022] [Accepted: 11/22/2022] [Indexed: 06/17/2023]
Abstract
RATIONALE Matrix effects cause a nonlinear relationship between ion intensities and concentrations in mass spectrometry, including time-of-flight secondary ion mass spectrometry (ToF-SIMS). Here, two artificial neural network (ANN)-based methods, autoencoder-based and simple ANN methods, were employed for the quantitative and qualitative analyses of a two organic compound mixture via ToF-SIMS. METHODS The multilayer model sample contained a mixture of Irganox 1010 and Fmoc-pentafluoro-L-phenylalanine (Fmoc-PFLPA). The sample's positive and negative ion depth profiles were collected through ToF-SIMS. ToF-SIMS-derived cross-sectional image datasets were analyzed using three unsupervised methods, namely principal component analysis (PCA), multivariate curve resolution (MCR), and use of a sparse autoencoder (SAE). The supervised simple ANN method was optimized based on the spectra and validated by predicting the test dataset ratios of Irganox 1010. RESULTS The results obtained using the SAE demonstrated linear calibration curves and appropriate material distribution images. The Irganox 1010 and Fmoc-PFLPA positive and negative ion datasets exhibited >0.97 correlation coefficients. The PCA and MCR results demonstrated lower linearity than that of SAE. Moreover, SAE weights indicated the ions important for each organic compound. The simple ANN method accurately predicted the ratios in the test dataset and indicated the important ions. CONCLUSIONS Both the supervised and unsupervised methods based on ANN, which were employed in regulating nonlinear relationships, were effective in the quantitative and qualitative analyses of the ToF-SIMS data of the two organic compound mixtures. Regarding qualitative analysis, both ANN-based methods indicated specific ions from the molecules in the sample.
Collapse
Affiliation(s)
- Satoka Aoyagi
- Faculty of Science and Technology, Seikei University, Tokyo, Japan
| | - Kazuhiro Matsuda
- Surface Science Laboratories, Toray Research Center, Inc., Otsu, Shiga, Japan
| |
Collapse
|
8
|
Forbes TP, Gillen JG, Souna AJ, Lawrence J. Unsupervised Pharmaceutical Polymorph Identification and Multicomponent Particle Mapping of ToF-SIMS Data by Non-Negative Matrix Factorization. Anal Chem 2022; 94:16443-16450. [DOI: 10.1021/acs.analchem.2c03913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Affiliation(s)
- Thomas P. Forbes
- Materials Measurement Science Division, National Institute of Standards and Technology, Gaithersburg, Maryland 20899, United States
| | - John Greg Gillen
- Materials Measurement Science Division, National Institute of Standards and Technology, Gaithersburg, Maryland 20899, United States
| | - Amanda J. Souna
- Materials Science and Engineering Division, National Institute of Standards and Technology, Gaithersburg, Maryland 20899, United States
- Department of Chemistry and Biochemistry, University of Maryland, College Park, Maryland 20742, United States
| | - Jeffrey Lawrence
- Materials Measurement Science Division, National Institute of Standards and Technology, Gaithersburg, Maryland 20899, United States
| |
Collapse
|
9
|
Zushi Y. Direct Prediction of Physicochemical Properties and Toxicities of Chemicals from Analytical Descriptors by GC-MS. Anal Chem 2022; 94:9149-9157. [PMID: 35700270 PMCID: PMC9246259 DOI: 10.1021/acs.analchem.2c01667] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
![]()
With advances in
machine learning (ML) techniques, the quantitative
structure–activity relationship (QSAR) approach is becoming
popular for evaluating chemicals. However, the QSAR approach requires
that the chemical structure of the target compound is known and that
it should be convertible to molecular descriptors. These requirements
lead to limitations in predicting the properties and toxicities of
chemicals distributed in the environment as in the PubChem database;
the structural information on only 14% of compounds is available.
This study proposes a new ML-based QSAR approach that can predict
the properties and toxicities of compounds using analytical descriptors
of mass spectrum and retention index obtained via gas chromatography–mass
spectrometry without requiring exact structural information. The model
was developed based on the XGBoost ML method. The root-mean-square
errors (RMSEs) for log Ko-w, log (molecular weight), melting point,
boiling point, log (vapor pressure), log (water solubility), log (LD50) (rat, oral), and log (LD50) (mouse, oral) are
0.97, 0.052, 51, 23, 0.74, 1.1, 0.74, and 0.6, respectively. The model
performed well on a chemical standard mixture measurement, with similar
results to those of model validation. It also performed well on a
measurement of contaminated oil with spectral deconvolution. These
results indicate that the model is suitable for investigating unknown-structured
chemicals detected in measurements. Any online user can execute the
model through a web application named Detective-QSAR (http://www.mixture-platform.net/Detective_QSAR_Med_Open/). The analytical descriptor-based approach is expected to create
new opportunities for the evaluation of unknown chemicals around us.
Collapse
Affiliation(s)
- Yasuyuki Zushi
- Research Institute of Science for Safety and Sustainability, National Institute of Advanced Industrial Science and Technology, 16-1 Onogawa, Tsukuba, Ibaraki 305-8506, Japan.,Graduate School of Science and Technology, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki 305-8577, Japan
| |
Collapse
|
10
|
Gardner W, Winkler DA, Cutts SM, Torney SA, Pietersz GA, Muir BW, Pigram PJ. Two-Dimensional and Three-Dimensional Time-of-Flight Secondary Ion Mass Spectrometry Image Feature Extraction Using a Spatially Aware Convolutional Autoencoder. Anal Chem 2022; 94:7804-7813. [PMID: 35616489 DOI: 10.1021/acs.analchem.1c05453] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Feature extraction algorithms are an important class of unsupervised methods used to reduce data dimensionality. They have been applied extensively for time-of-flight secondary ion mass spectrometry (ToF-SIMS) imaging─commonly, matrix factorization (MF) techniques such as principal component analysis have been used. A limitation of MF is the assumption of linearity, which is generally not accurate for ToF-SIMS data. Recently, nonlinear autoencoders have been shown to outperform MF techniques for ToF-SIMS image feature extraction. However, another limitation of most feature extraction methods (including autoencoders) that is particularly important for hyperspectral data is that they do not consider spatial information. To address this limitation, we describe the application of the convolutional autoencoder (CNNAE) to hyperspectral ToF-SIMS imaging data. The CNNAE is an artificial neural network developed specifically for hyperspectral data that uses convolutional layers for image encoding, thereby explicitly incorporating pixel neighborhood information. We compared the performance of the CNNAE with other common feature extraction algorithms for two biological ToF-SIMS imaging data sets. We investigated the extracted features and used the dimensionality-reduced data to train additional ML algorithms. By converting two-dimensional convolutional layers to three-dimensional (3D), we also showed how the CNNAE can be extended to 3D ToF-SIMS images. In general, the CNNAE produced features with significantly higher contrast and autocorrelation than other techniques. Furthermore, histologically recognizable features in the data were more accurately represented. The extension of the CNNAE to 3D data also provided an important proof of principle for the analysis of more complex 3D data sets.
Collapse
Affiliation(s)
- Wil Gardner
- Centre for Materials and Surface Science and Department of Chemistry and Physics, La Trobe University, Bundoora, Victoria 3086, Australia.,La Trobe Institute for Molecular Sciences, La Trobe University, Bundoora, Victoria 3086, Australia.,CSIRO Manufacturing, Clayton, Victoria 3168, Australia
| | - David A Winkler
- La Trobe Institute for Molecular Sciences, La Trobe University, Bundoora, Victoria 3086, Australia.,Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Victoria 3052, Australia.,School of Pharmacy, University of Nottingham, Nottingham NG7 2RD, U.K
| | - Suzanne M Cutts
- La Trobe Institute for Molecular Sciences, La Trobe University, Bundoora, Victoria 3086, Australia
| | - Steven A Torney
- La Trobe Institute for Molecular Sciences, La Trobe University, Bundoora, Victoria 3086, Australia
| | - Geoffrey A Pietersz
- Immune Therapies Laboratory, Burnet Institute, Melbourne, Victoria 3004, Australia.,Atherothrombosis and Vascular Biology Laboratory, Baker Heart and Diabetes Institute, Melbourne, Victoria 3004, Australia
| | | | - Paul J Pigram
- Centre for Materials and Surface Science and Department of Chemistry and Physics, La Trobe University, Bundoora, Victoria 3086, Australia
| |
Collapse
|
11
|
Applications of multivariate analysis and unsupervised machine learning to ToF-SIMS images of organic, bioorganic, and biological systems. Biointerphases 2022; 17:020802. [DOI: 10.1116/6.0001590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Time-of-flight secondary ion mass spectrometry (ToF-SIMS) imaging offers a powerful, label-free method for exploring organic, bioorganic, and biological systems. The technique is capable of very high spatial resolution, while also producing an enormous amount of information about the chemical and molecular composition of a surface. However, this information is inherently complex, making interpretation and analysis of the vast amount of data produced by a single ToF-SIMS experiment a considerable challenge. Much research over the past few decades has focused on the application and development of multivariate analysis (MVA) and machine learning (ML) techniques that find meaningful patterns and relationships in these datasets. Here, we review the unsupervised algorithms—that is, algorithms that do not require ground truth labels—that have been applied to ToF-SIMS images, as well as other algorithms and approaches that have been used in the broader family of mass spectrometry imaging (MSI) techniques. We first give a nontechnical overview of several commonly used classes of unsupervised algorithms, such as matrix factorization, clustering, and nonlinear dimensionality reduction. We then review the application of unsupervised algorithms to various organic, bioorganic, and biological systems including cells and tissues, organic films, residues and coatings, and spatially structured systems such as polymer microarrays. We then cover several novel algorithms employed for other MSI techniques that have received little attention from ToF-SIMS imaging researchers. We conclude with a brief outline of potential future directions for the application of MVA and ML algorithms to ToF-SIMS images.
Collapse
|
12
|
Lang Y, Zhou L, Imamura Y. Development of Machine-Learning Techniques for Time-of-Flight Secondary Ion Mass Spectrometry Spectral Analysis: Application for the Identification of Silane Coupling Agents in Multicomponent Films. Anal Chem 2022; 94:2546-2553. [DOI: 10.1021/acs.analchem.1c04436] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Yusheng Lang
- Innovative Technology Laboratories, AGC Incorporated, Yokohama 230-0045, Japan
| | - Lilin Zhou
- Innovative Technology Laboratories, AGC Incorporated, Yokohama 230-0045, Japan
| | - Yutaka Imamura
- Innovative Technology Laboratories, AGC Incorporated, Yokohama 230-0045, Japan
| |
Collapse
|
13
|
Aoyagi S, Fujiwara Y, Takano A, Vorng JL, Gilmore IS, Wang YC, Tallarek E, Hagenhoff B, Iida SI, Luch A, Jungnickel H, Lang Y, Shon HK, Lee TG, Li Z, Matsuda K, Mihara I, Miisho A, Murayama Y, Nagatomi T, Ikeda R, Okamoto M, Saiga K, Tsuchiya T, Uemura S. Evaluation of Time-of-Flight Secondary Ion Mass Spectrometry Spectra of Peptides by Random Forest with Amino Acid Labels: Results from a Versailles Project on Advanced Materials and Standards Interlaboratory Study. Anal Chem 2021; 93:4191-4197. [PMID: 33635050 DOI: 10.1021/acs.analchem.0c04577] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We report the results of a VAMAS (Versailles Project on Advanced Materials and Standards) interlaboratory study on the identification of peptide sample TOF-SIMS spectra by machine learning. More than 1000 time-of-flight secondary ion mass spectrometry (TOF-SIMS) spectra of six peptide model samples (one of them was a test sample) were collected using 27 TOF-SIMS instruments from 25 institutes of six countries, the U. S., the U. K., Germany, China, South Korea, and Japan. Because peptides have systematic and simple chemical structures, they were selected as model samples. The intensity of peaks in every TOF-SIMS spectrum was extracted using the same peak list and normalized to the total ion count. The spectra of the test peptide sample were predicted by Random Forest with 20 amino acid labels. The accuracy of the prediction for the test spectra was 0.88. Although the prediction of an unknown peptide was not perfect, it was shown that all of the amino acids in an unknown peptide can be determined by Random Forest prediction and the TOF-SIMS spectra. Moreover, the prediction of peptides, which are included in the training spectra, was almost perfect. Random Forest also suggests specific fragment ions from an amino acid residue Q, whose fragment ions detected by TOF-SIMS have not been reported, in the important features. This study indicated that the analysis using Random Forest, which enables translation of the mathematical relationships to chemical relationships, and the multi labels representing monomer chemical structures, is useful to predict the TOF-SIMS spectra of an unknown peptide.
Collapse
Affiliation(s)
- Satoka Aoyagi
- Faculty of Science and Technology, Seikei University, Musashino, Tokyo 180-8633, Japan
| | - Yukio Fujiwara
- National Metrology Institute of Japan (NMIJ), National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Umezono, Tsukuba, Ibaraki 305-8568, Japan
| | - Akio Takano
- Toyama Co., Ltd., 3816-1 Kishi, Yamakita-machi, Ashigarakami-gun, Kanagawa 258-0112, Japan
| | - Jean-Luc Vorng
- National Physical Laboratory, Hampton Road, Teddington, Middlesex TW11 0LW, UK
| | - Ian S Gilmore
- National Physical Laboratory, Hampton Road, Teddington, Middlesex TW11 0LW, UK
| | - Yung-Chen Wang
- Medtronic, Corporate Science & Technology, 710 Medtronic Parkway, Mailstop LT240, Minneapolis Minnesota 55432, United States
| | | | | | - Shin-Ichi Iida
- ULVAC-PHI, Inc., 2500 Hagisono, Chigasaki, Kanagawa 253-8522, Japan
| | - Andreas Luch
- Department of Chemical and Product Safety, German Federal Institute for Risk Assessment (BfR), Max-Dohrn-Strasse 8-10, Berlin 10589, Germany
| | - Harald Jungnickel
- Department of Chemical and Product Safety, German Federal Institute for Risk Assessment (BfR), Max-Dohrn-Strasse 8-10, Berlin 10589, Germany
| | - Yusheng Lang
- Analytical Science Team, Common Base Technology Division, Innovative Technology Laboratories, AGC Inc., 1150 Hazawa-cho, Kanagawa-ku, Yokohama-shi, Kanagawa 221-8755, Japan
| | - Hyun Kyong Shon
- Bio-imaging Team, Korea Research Institute of Standards and Science (KRISS), Daejeon 34113, South Korea
| | - Tae Geol Lee
- Bio-imaging Team, Korea Research Institute of Standards and Science (KRISS), Daejeon 34113, South Korea
| | - Zhanping Li
- Department of Chemistry, Tsinghua University, No. 30, Shuangqing Road, Haidian District, Beijing 100084, China
| | - Kazuhiro Matsuda
- Faculty of Science and Technology, Seikei University, Musashino, Tokyo 180-8633, Japan.,Surface Science Laboratories, Toray Research Center, Inc., 3-3-7, Sonoyama, Otsu, Shiga 520-8567, Japan
| | - Ichiro Mihara
- Analytical Technology and Solutions Laboratory, Kurashiki Research Center, KURARAY CO., LTD, 2045-1, Sakazu, Kurashiki, Okayama 710-0801, Japan
| | - Ako Miisho
- KOBELCO RESEARCH INSTITUTE, INC., 1-5-5, Takatsukadai, Nishi-ku, Kobe, Hyogo 651-2271, Japan
| | - Yohei Murayama
- Specialty Chemicals Development Center, Peripheral Products Operations, Canon Inc., 4202, Fukara, Susono, Shizuoka 410-1196, Japan
| | - Takaharu Nagatomi
- Platform Laboratory for Science and Technology, Asahi Kasei Corporation, 2-1 Samejima, Fuji, Shizuoka 416-8501, Japan
| | - Reiko Ikeda
- Analytical Science Research Laboratory, Kao Corp., Minato 1334. Wakayama-shi, Wakayama 640-8580, Japan
| | - Masayuki Okamoto
- Analytical Science Research Laboratory, Kao Corp., Minato 1334. Wakayama-shi, Wakayama 640-8580, Japan
| | - Kunio Saiga
- Mitsui Chemical Analysis & Consulting Service Inc., 580-32 Nagaura, Sodegaura, Chiba 299-0265, Japan
| | - Toshihiko Tsuchiya
- Mitsui Chemical Analysis & Consulting Service Inc., 580-32 Nagaura, Sodegaura, Chiba 299-0265, Japan
| | - Shigeaki Uemura
- Sumitomo Electric Industries, Ltd., 1-1-1, Koyakita, Itami, Hyogo 664-0016, Japan
| |
Collapse
|
14
|
Tuck M, Blanc L, Touti R, Patterson NH, Van Nuffel S, Villette S, Taveau JC, Römpp A, Brunelle A, Lecomte S, Desbenoit N. Multimodal Imaging Based on Vibrational Spectroscopies and Mass Spectrometry Imaging Applied to Biological Tissue: A Multiscale and Multiomics Review. Anal Chem 2020; 93:445-477. [PMID: 33253546 DOI: 10.1021/acs.analchem.0c04595] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Michael Tuck
- Institut de Chimie & Biologie des Membranes & des Nano-objets, CBMN UMR 5248, CNRS, Université de Bordeaux, 1 Allée Geoffroy Saint-Hilaire, 33600 Pessac, France
| | - Landry Blanc
- Institut de Chimie & Biologie des Membranes & des Nano-objets, CBMN UMR 5248, CNRS, Université de Bordeaux, 1 Allée Geoffroy Saint-Hilaire, 33600 Pessac, France
| | - Rita Touti
- Institut de Chimie & Biologie des Membranes & des Nano-objets, CBMN UMR 5248, CNRS, Université de Bordeaux, 1 Allée Geoffroy Saint-Hilaire, 33600 Pessac, France
| | - Nathan Heath Patterson
- Mass Spectrometry Research Center, Department of Biochemistry, Vanderbilt University, Nashville, Tennessee 37232-8575, United States
| | - Sebastiaan Van Nuffel
- Materials Research Institute, The Pennsylvania State University, University Park, Pennsylvania 16802, United States
| | - Sandrine Villette
- Institut de Chimie & Biologie des Membranes & des Nano-objets, CBMN UMR 5248, CNRS, Université de Bordeaux, 1 Allée Geoffroy Saint-Hilaire, 33600 Pessac, France
| | - Jean-Christophe Taveau
- Institut de Chimie & Biologie des Membranes & des Nano-objets, CBMN UMR 5248, CNRS, Université de Bordeaux, 1 Allée Geoffroy Saint-Hilaire, 33600 Pessac, France
| | - Andreas Römpp
- Bioanalytical Sciences and Food Analysis, University of Bayreuth, Universitätsstraße 30, 95440 Bayreuth, Germany
| | - Alain Brunelle
- Laboratoire d'Archéologie Moléculaire et Structurale, LAMS UMR 8220, CNRS, Sorbonne Université, 4 Place Jussieu, 75005 Paris, France
| | - Sophie Lecomte
- Institut de Chimie & Biologie des Membranes & des Nano-objets, CBMN UMR 5248, CNRS, Université de Bordeaux, 1 Allée Geoffroy Saint-Hilaire, 33600 Pessac, France
| | - Nicolas Desbenoit
- Institut de Chimie & Biologie des Membranes & des Nano-objets, CBMN UMR 5248, CNRS, Université de Bordeaux, 1 Allée Geoffroy Saint-Hilaire, 33600 Pessac, France
| |
Collapse
|
15
|
Analyzing 3D hyperspectral TOF-SIMS depth profile data using self-organizing map-relational perspective mapping. Biointerphases 2020; 15:061004. [PMID: 33198474 DOI: 10.1116/6.0000614] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
The advantages of applying multivariate analysis to mass spectrometry imaging (MSI) data have been thoroughly demonstrated in recent decades. The identification and visualization of complex relationships between pixels in a hyperspectral data set can provide unique insights into the underlying surface chemistry. It is now recognized that most MSI data contain nonlinear relationships, which has led to increased application of machine learning approaches. Previously, we exemplified the use of the self-organizing map (SOM), a type of artificial neural network, for analyzing time-of-flight secondary ion mass spectrometry (TOF-SIMS) hyperspectral images. Recently, we developed a novel methodology, SOM-relational perspective mapping (RPM), which incorporates the algorithm RPM to improve visualization of the SOM for 2D TOF-SIMS images. Here, we use SOM-RPM to characterize and interpret 3D TOF-SIMS depth profile data, voxel-by-voxel. An organic Irganox™ multilayer standard sample was depth profiled using TOF-SIMS, and SOM-RPM was used to create 3D similarity maps of the depth-profiled sample, in which the mass spectral similarity of individual voxels is modeled with color similarity. We used this similarity map to segment the data into spatial features, demonstrating that the unsupervised method meaningfully differentiated between Irganox-3114 and Irganox-1010 nanometer-thin multilayer films. The method also identified unique clusters at the surface associated with environmental exposure and sample degradation. Key fragment ions characteristic of each cluster were identified, tying clusters to their underlying chemistries. SOM-RPM has the demonstrable ability to reduce vast data sets to simple 3D visualizations that can be used for clustering data and visualizing the complex relationships within.
Collapse
|
16
|
Gardner W, Cutts SM, Phillips DR, Pigram PJ. Understanding mass spectrometry images: complexity to clarity with machine learning. Biopolymers 2020; 112:e23400. [PMID: 32937683 DOI: 10.1002/bip.23400] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 08/25/2020] [Accepted: 08/26/2020] [Indexed: 11/08/2022]
Abstract
The application of artificial intelligence and machine learning to hyperspectral mass spectrometry imaging (MSI) data has received considerable attention over recent years. Various methodologies have shown great promise in their ability to handle the complexity and size of MSI data sets. Advances in this area have been particularly appealing for MSI of biological samples, which typically produce highly complicated data with often subtle relationships between features. There are many different machine learning approaches that have been applied to MSI data over the past two decades. In this review, we focus on a subset of non-linear machine learning techniques that have mostly only been applied in the past 5 years. Specifically, we review the use of the self-organizing map (SOM), SOM with relational perspective mapping (SOM-RPM), t-distributed stochastic neighbor embedding (t-SNE) and uniform manifold approximation and projection (UMAP). While not their only functionality, we have grouped these techniques based on their ability to produce what we refer to as similarity maps. Similarity maps are color representations of hyperspectral data, in which spectral similarity between pixels-that is, their distance in high-dimensional space-is represented by relative color similarity. In discussing these techniques, we describe, briefly, their associated algorithms and functionalities, and also outline applications in MSI research with a strong focus on biological sample types. The aim of this review is therefore to introduce this relatively recent paradigm for visualizing and exploring hyperspectral MSI, while also providing a comparison between each technique discussed.
Collapse
Affiliation(s)
- Wil Gardner
- Centre for Materials and Surface Science and Department of Chemistry and Physics, La Trobe University, Melbourne, Victoria, Australia.,La Trobe Institute for Molecular Sciences, La Trobe University, Melbourne, Victoria, Australia.,CSIRO Manufacturing, Clayton, Victoria, Australia
| | - Suzanne M Cutts
- La Trobe Institute for Molecular Sciences, La Trobe University, Melbourne, Victoria, Australia
| | - Don R Phillips
- La Trobe Institute for Molecular Sciences, La Trobe University, Melbourne, Victoria, Australia
| | - Paul J Pigram
- Centre for Materials and Surface Science and Department of Chemistry and Physics, La Trobe University, Melbourne, Victoria, Australia
| |
Collapse
|
17
|
Gardner W, Maliki R, Cutts SM, Muir BW, Ballabio D, Winkler DA, Pigram PJ. Self-Organizing Map and Relational Perspective Mapping for the Accurate Visualization of High-Dimensional Hyperspectral Data. Anal Chem 2020; 92:10450-10459. [DOI: 10.1021/acs.analchem.0c00986] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Wil Gardner
- Centre for Materials and Surface Science and Department of Chemistry and Physics, La Trobe University, Melbourne, Victoria 3086, Australia
- La Trobe Institute for Molecular Sciences, La Trobe University, Melbourne, Victoria 3086, Australia
- CSIRO Manufacturing, Clayton, Victoria 3168, Australia
| | - Ruqaya Maliki
- Centre for Materials and Surface Science and Department of Chemistry and Physics, La Trobe University, Melbourne, Victoria 3086, Australia
- La Trobe Institute for Molecular Sciences, La Trobe University, Melbourne, Victoria 3086, Australia
| | - Suzanne M. Cutts
- La Trobe Institute for Molecular Sciences, La Trobe University, Melbourne, Victoria 3086, Australia
| | | | - Davide Ballabio
- Milano Chemometrics and QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Piazza della Scienza 1, 20126, Milano, Italy
| | - David A. Winkler
- La Trobe Institute for Molecular Sciences, La Trobe University, Melbourne, Victoria 3086, Australia
- Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Victoria 3052, Australia
- School of Pharmacy, University of Nottingham, Nottingham NG7 2RD, United Kingdom
- CSIRO Data61, Melbourne, Victoria 3008, Australia
| | - Paul J. Pigram
- Centre for Materials and Surface Science and Department of Chemistry and Physics, La Trobe University, Melbourne, Victoria 3086, Australia
| |
Collapse
|