1
|
Verbeeck N, Caprioli RM, Van de Plas R. Unsupervised machine learning for exploratory data analysis in imaging mass spectrometry. MASS SPECTROMETRY REVIEWS 2020; 39:245-291. [PMID: 31602691 PMCID: PMC7187435 DOI: 10.1002/mas.21602] [Citation(s) in RCA: 118] [Impact Index Per Article: 29.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Accepted: 08/27/2018] [Indexed: 05/20/2023]
Abstract
Imaging mass spectrometry (IMS) is a rapidly advancing molecular imaging modality that can map the spatial distribution of molecules with high chemical specificity. IMS does not require prior tagging of molecular targets and is able to measure a large number of ions concurrently in a single experiment. While this makes it particularly suited for exploratory analysis, the large amount and high-dimensional nature of data generated by IMS techniques make automated computational analysis indispensable. Research into computational methods for IMS data has touched upon different aspects, including spectral preprocessing, data formats, dimensionality reduction, spatial registration, sample classification, differential analysis between IMS experiments, and data-driven fusion methods to extract patterns corroborated by both IMS and other imaging modalities. In this work, we review unsupervised machine learning methods for exploratory analysis of IMS data, with particular focus on (a) factorization, (b) clustering, and (c) manifold learning. To provide a view across the various IMS modalities, we have attempted to include examples from a range of approaches including matrix assisted laser desorption/ionization, desorption electrospray ionization, and secondary ion mass spectrometry-based IMS. This review aims to be an entry point for both (i) analytical chemists and mass spectrometry experts who want to explore computational techniques; and (ii) computer scientists and data mining specialists who want to enter the IMS field. © 2019 The Authors. Mass Spectrometry Reviews published by Wiley Periodicals, Inc. Mass SpecRev 00:1-47, 2019.
Collapse
Affiliation(s)
- Nico Verbeeck
- Delft Center for Systems and ControlDelft University of Technology ‐ TU DelftDelftThe Netherlands
- Aspect Analytics NVGenkBelgium
- STADIUS Center for Dynamical Systems, Signal Processing, and Data Analytics, Department of Electrical Engineering (ESAT)KU LeuvenLeuvenBelgium
| | - Richard M. Caprioli
- Mass Spectrometry Research CenterVanderbilt UniversityNashvilleTN
- Department of BiochemistryVanderbilt UniversityNashvilleTN
- Department of ChemistryVanderbilt UniversityNashvilleTN
- Department of PharmacologyVanderbilt UniversityNashvilleTN
- Department of MedicineVanderbilt UniversityNashvilleTN
| | - Raf Van de Plas
- Delft Center for Systems and ControlDelft University of Technology ‐ TU DelftDelftThe Netherlands
- Mass Spectrometry Research CenterVanderbilt UniversityNashvilleTN
- Department of BiochemistryVanderbilt UniversityNashvilleTN
| |
Collapse
|
2
|
Eriksson JO, Rezeli M, Hefner M, Marko-Varga G, Horvatovich P. Clusterwise Peak Detection and Filtering Based on Spatial Distribution To Efficiently Mine Mass Spectrometry Imaging Data. Anal Chem 2019; 91:11888-11896. [PMID: 31403280 PMCID: PMC6751525 DOI: 10.1021/acs.analchem.9b02637] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
![]()
Mass
spectrometry imaging (MSI) has the potential to reveal the
localization of thousands of biomolecules such as metabolites and
lipids in tissue sections. The increase in both mass and spatial resolution
of today’s instruments brings on considerable challenges in
terms of data processing; accurately extracting meaningful signals
from the large data sets generated by MSI without losing information
that could be clinically relevant is one of the most fundamental tasks
of analysis software. Ion images of the biomolecules are generated
by visualizing their intensities in 2-D space using mass spectra collected
across the tissue section. The intensities are often calculated by
summing each compound’s signal between predefined sets of borders
(bins) in the m/z dimension. This
approach, however, can result in mixed signals from different compounds
in the same bin or splitting the signal from one compound between
two adjacent bins, leading to low quality ion images. To remedy this
problem, we propose a novel data processing approach. Our approach
consists of a sensitive peak detection method able to discover both
faint and localized signals by utilizing clusterwise kernel density
estimates (KDEs) of peak distributions. We show that our method can
recall more ground-truth molecules, molecule fragments, and isotopes
than existing methods based on binning. Furthermore, it automatically
detects previously reported molecular ions of lipids, including those
close in m/z, in an experimental
data set.
Collapse
Affiliation(s)
| | - Melinda Rezeli
- Lund University , Department of Biomedical Engineering , Lund , Sweden
| | - Max Hefner
- Lund University , Department of Biomedical Engineering , Lund , Sweden
| | | | - Peter Horvatovich
- Lund University , Department of Biomedical Engineering , Lund , Sweden.,University of Groningen, Department of Analytical Biochemistry , Groningen Research Institute of Pharmacy , Antonius Deusinglaan 1 , 9713 AV Groningen , The Netherlands
| |
Collapse
|
3
|
Han J, Permentier H, Bischoff R, Groothuis G, Casini A, Horvatovich P. Imaging of protein distribution in tissues using mass spectrometry: An interdisciplinary challenge. Trends Analyt Chem 2019. [DOI: 10.1016/j.trac.2018.12.016] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
4
|
Marcell Szasz A, Malm J, Rezeli M, Sugihara Y, Betancourt LH, Rivas D, Gyorffy B, Marko-Varga G. Challenging the heterogeneity of disease presentation in malignant melanoma-impact on patient treatment. Cell Biol Toxicol 2018; 35:1-14. [PMID: 30357519 PMCID: PMC6514062 DOI: 10.1007/s10565-018-9446-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2018] [Accepted: 08/29/2018] [Indexed: 11/27/2022]
Abstract
There is an increasing global interest to support research areas that can assist in understanding disease and improving patient care. The National Cancer Institute (NIH) has identified precision medicine-based approaches as key research strategies to expedite advances in cancer research. The Cancer Moonshot program ( https://www.cancer.gov/research/key-initiatives/moonshot-cancer-initiative ) is the largest cancer program of all time, and has been launched to accelerate cancer research that aims to increase the availability of therapies to more patients and, ultimately, to eradicate cancer. Mass spectrometry-based proteomics has been extensively used to study the molecular mechanisms of cancer, to define molecular subtypes of tumors, to map cancer-associated protein interaction networks and post-translational modifications, and to aid in the development of new therapeutics and new diagnostic and prognostic tests. To establish the basis for our melanoma studies, we have established the Southern Sweden Malignant Melanoma Biobank. Tissues collected over many years have been accurately characterized with respect to the tumor and patient information. The extreme variability displayed in the protein profiles and the detection of missense mutations has confirmed the complexity and heterogeneity of the disease. It is envisaged that the combined analysis of clinical, histological, and proteomic data will provide patients with a more personalized medical treatment. With respect to disease presentation, targeted treatment and medical mass spectrometry analysis and imaging, this overview report will outline and summarize the current achievements and status within malignant melanoma. We present data generated by our cancer research center in Lund, Sweden, where we have built extensive capabilities in biobanking, proteogenomics, and patient treatments over an extensive time period.
Collapse
Affiliation(s)
- A Marcell Szasz
- Center of Excellence in Biological and Medical Mass Spectrometry, Lund University, BMC D13, 221 84, Lund, Sweden
- Division of Oncology and Pathology, Department of Clinical Sciences Lund, Lund University, 221 85, Lund, Sweden
- Cancer Center, Semmelweis University, Budapest, 1083, Hungary
- MTA-TTK Momentum Oncology Biomarker Research Group, Hungarian Academy of Sciences, Budapest, 1117, Hungary
| | - Johan Malm
- Center of Excellence in Biological and Medical Mass Spectrometry, Lund University, BMC D13, 221 84, Lund, Sweden
- Department of Oncology, Lund University, Skåne University Hospital, 221 85, Lund, Sweden
- Department of Translational Medicine, Section for Clinical Chemistry, Lund University, Skåne University Hospital Malmö, 205 02, Malmö, Sweden
| | - Melinda Rezeli
- Clinical Protein Science and Imaging, Department of Biomedical Engineering, Lund University, BMC D13, 221 84, Lund, Sweden
| | - Yutaka Sugihara
- Center of Excellence in Biological and Medical Mass Spectrometry, Lund University, BMC D13, 221 84, Lund, Sweden
- Clinical Protein Science and Imaging, Department of Biomedical Engineering, Lund University, BMC D13, 221 84, Lund, Sweden
| | - Lazaro H Betancourt
- Center of Excellence in Biological and Medical Mass Spectrometry, Lund University, BMC D13, 221 84, Lund, Sweden
- Clinical Protein Science and Imaging, Department of Biomedical Engineering, Lund University, BMC D13, 221 84, Lund, Sweden
| | - Daniel Rivas
- Institute of Environmental Sciences and Water Research, IDAEA, Spanish Research Council (CSIC), Barcelona, Spain
| | - Balázs Gyorffy
- MTA-TTK Momentum Oncology Biomarker Research Group, Hungarian Academy of Sciences, Budapest, 1117, Hungary
- 2nd Department of Pediatrics, Semmelweis University, Budapest, 1094, Hungary
| | - György Marko-Varga
- Center of Excellence in Biological and Medical Mass Spectrometry, Lund University, BMC D13, 221 84, Lund, Sweden.
- Clinical Protein Science and Imaging, Department of Biomedical Engineering, Lund University, BMC D13, 221 84, Lund, Sweden.
- Division of Life Science and Biotechnology, Yonsei University, Soel, Korea.
| |
Collapse
|
5
|
Martin FC, Ang CS, Gardner DK, Renfree MB, Shaw G. Uterine flushing proteome of the tammar wallaby after reactivation from diapause. Reproduction 2016; 152:491-505. [DOI: 10.1530/rep-16-0154] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Accepted: 08/01/2016] [Indexed: 01/15/2023]
Abstract
The marsupial tammar wallaby has the longest period of embryonic diapause of any mammal, up to 11 months, during which there is no cell division or blastocyst growth. Since the blastocyst in diapause is surrounded by acellular coats, the signals that maintain or terminate diapause involve factors that reside in uterine secretions. The nature of such factors remains to be resolved. In this study, uterine flushings (UFs) were used to assess changes in uterine secretions of tammars using liquid chromatography–mass spectrometry (LC–MS/MS) during diapause (day 0 and 3) and reactivation days (d) 4, 5, 6, 8, 9, 11 and 24 after removal of pouch young (RPY), which initiates embryonic development. This study supports earlier suggestions that the presence of specific factors stimulate reactivation, early embryonic growth and cell proliferation. A mitogen, hepatoma-derived growth factor and soluble epidermal growth factor receptors were observed from d3 until at least d11 RPY when these secreted proteins constituted 21% of the UF proteome. Binding of these factors to specific cellular receptors or growth factors may directly stimulate DNA synthesis and division in endometrial gland cells. Proteins involved in the p53/CDKN1A (p21) cell cycle inhibition pathway were also observed in the diapause samples. Progesterone and most of the oestrogen-regulated proteins were present in the UF after d3, which is concomitant with the start of blastocyst mitoses at d4. We propose that once the p21 inhibition of the cell cycle is lost, growth factors including HDGF and EGFR are responsible for reactivation of the diapausing blastocyst via the uterine secretions.
Collapse
|
6
|
Chang J, Kim Y, Kwon HJ. Advances in identification and validation of protein targets of natural products without chemical modification. Nat Prod Rep 2016; 33:719-30. [DOI: 10.1039/c5np00107b] [Citation(s) in RCA: 79] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
This review focuses on and reports case studies of the latest advances in target protein identification methods for label-free natural products. The integration of newly developed technologies will provide new insights and highlight the value of natural products for use as biological probes and new drug candidates.
Collapse
Affiliation(s)
- J. Chang
- Department of Biotechnology
- Translational Research Center for Protein Function Control
- College of Life Science & Biotechnology
- Yonsei University
- Seoul 120-749
| | - Y. Kim
- Department of Biotechnology
- Translational Research Center for Protein Function Control
- College of Life Science & Biotechnology
- Yonsei University
- Seoul 120-749
| | - H. J. Kwon
- Department of Biotechnology
- Translational Research Center for Protein Function Control
- College of Life Science & Biotechnology
- Yonsei University
- Seoul 120-749
| |
Collapse
|
7
|
Cobice DF, Goodwin RJA, Andren PE, Nilsson A, Mackay CL, Andrew R. Future technology insight: mass spectrometry imaging as a tool in drug research and development. Br J Pharmacol 2015; 172:3266-83. [PMID: 25766375 DOI: 10.1111/bph.13135] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2014] [Revised: 02/09/2015] [Accepted: 03/03/2015] [Indexed: 12/14/2022] Open
Abstract
In pharmaceutical research, understanding the biodistribution, accumulation and metabolism of drugs in tissue plays a key role during drug discovery and development. In particular, information regarding pharmacokinetics, pharmacodynamics and transport properties of compounds in tissues is crucial during early screening. Historically, the abundance and distribution of drugs have been assessed by well-established techniques such as quantitative whole-body autoradiography (WBA) or tissue homogenization with LC/MS analysis. However, WBA does not distinguish active drug from its metabolites and LC/MS, while highly sensitive, does not report spatial distribution. Mass spectrometry imaging (MSI) can discriminate drug and its metabolites and endogenous compounds, while simultaneously reporting their distribution. MSI data are influencing drug development and currently used in investigational studies in areas such as compound toxicity. In in vivo studies MSI results may soon be used to support new drug regulatory applications, although clinical trial MSI data will take longer to be validated for incorporation into submissions. We review the current and future applications of MSI, focussing on applications for drug discovery and development, with examples to highlight the impact of this promising technique in early drug screening. Recent sample preparation and analysis methods that enable effective MSI, including quantitative analysis of drugs from tissue sections will be summarized and key aspects of methodological protocols to increase the effectiveness of MSI analysis for previously undetectable targets addressed. These examples highlight how MSI has become a powerful tool in drug research and development and offers great potential in streamlining the drug discovery process.
Collapse
Affiliation(s)
- D F Cobice
- University/British Heart Foundation Centre for Cardiovascular Science, Queen's Medical Research Institute, University of Edinburgh, Edinburgh, UK
| | - R J A Goodwin
- Drug Metabolism and Distribution, Mass Spectrometry Imaging, AstraZeneca R&D, Macclesfield, UK
| | - P E Andren
- Biomolecular Imaging and Proteomics, National Center for Mass Spectrometry Imaging, Department of Pharmaceutical Biosciences, Uppsala University, Uppsala, Sweden
| | - A Nilsson
- Biomolecular Imaging and Proteomics, National Center for Mass Spectrometry Imaging, Department of Pharmaceutical Biosciences, Uppsala University, Uppsala, Sweden
| | - C L Mackay
- SIRCAMS, School of Chemistry, University of Edinburgh, Edinburgh, UK
| | - R Andrew
- University/British Heart Foundation Centre for Cardiovascular Science, Queen's Medical Research Institute, University of Edinburgh, Edinburgh, UK
| |
Collapse
|
8
|
Jaumot J, Tauler R. Potential use of multivariate curve resolution for the analysis of mass spectrometry images. Analyst 2015; 140:837-46. [DOI: 10.1039/c4an00801d] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
The application of MCR-ALS to mass spectrometry imaging data provides spatial distribution and MS spectra of components, allowing compound identification.
Collapse
Affiliation(s)
- Joaquim Jaumot
- Department of Environmental Chemistry
- IDAEA-CSIC
- Barcelona 08034
- Spain
| | - Romà Tauler
- Department of Environmental Chemistry
- IDAEA-CSIC
- Barcelona 08034
- Spain
| |
Collapse
|
9
|
Crecelius AC, Schubert US, von Eggeling F. MALDI mass spectrometric imaging meets “omics”: recent advances in the fruitful marriage. Analyst 2015; 140:5806-20. [DOI: 10.1039/c5an00990a] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Matrix-assisted laser desorption/ionization mass spectrometric imaging (MALDI MSI) is a method that allows the investigation of the molecular content of surfaces, in particular, tissues, within its morphological context.
Collapse
Affiliation(s)
- A. C. Crecelius
- Laboratory of Organic and Macromolecular Chemistry (IOMC)
- Friedrich Schiller University Jena
- 07743 Jena
- Germany
- Jena Center for Soft Matter (JCSM)
| | - U. S. Schubert
- Laboratory of Organic and Macromolecular Chemistry (IOMC)
- Friedrich Schiller University Jena
- 07743 Jena
- Germany
- Jena Center for Soft Matter (JCSM)
| | - F. von Eggeling
- Jena Center for Soft Matter (JCSM)
- Friedrich Schiller University Jena
- 07743 Jena
- Germany
- Institute of Physical Chemistry
| |
Collapse
|
10
|
Verbeeck N, Yang J, De Moor B, Caprioli R, Waelkens E, Van de Plas R. Automated anatomical interpretation of ion distributions in tissue: linking imaging mass spectrometry to curated atlases. Anal Chem 2014; 86:8974-82. [PMID: 25153352 PMCID: PMC4165455 DOI: 10.1021/ac502838t] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2014] [Accepted: 07/30/2014] [Indexed: 12/17/2022]
Abstract
Imaging mass spectrometry (IMS) has become a prime tool for studying the distribution of biomolecules in tissue. Although IMS data sets can become very large, computational methods have made it practically feasible to search these experiments for relevant findings. However, these methods lack access to an important source of information that many human interpretations rely upon: anatomical insight. In this work, we address this need by (1) integrating a curated anatomical data source with an empirically acquired IMS data source, establishing an algorithm-accessible link between them and (2) demonstrating the potential of such an IMS-anatomical atlas link by applying it toward automated anatomical interpretation of ion distributions in tissue. The concept is demonstrated in mouse brain tissue, using the Allen Mouse Brain Atlas as the curated anatomical data source that is linked to MALDI-based IMS experiments. We first develop a method to spatially map the anatomical atlas to the IMS data sets using nonrigid registration techniques. Once a mapping is established, a second computational method, called correlation-based querying, gives an elementary demonstration of the link by delivering basic insight into relationships between ion images and anatomical structures. Finally, a third algorithm moves further beyond both registration and correlation by providing automated anatomical interpretation of ion images. This task is approached as an optimization problem that deconstructs ion distributions as combinations of known anatomical structures. We demonstrate that establishing a link between an IMS experiment and an anatomical atlas enables automated anatomical annotation, which can serve as an important accelerator both for human and machine-guided exploration of IMS experiments.
Collapse
Affiliation(s)
- Nico Verbeeck
- Department
of Electrical Engineering (ESAT), STADIUS Center for
Dynamical Systems, Signal Processing, and Data Analytics, KU Leuven, Kasteelpark
Arenberg 10, Box 2446, 3001 Leuven, Belgium
- iMinds
Medical IT, Kasteelpark
Arenberg 10, Box 2446, 3001 Leuven, Belgium
| | - Junhai Yang
- Mass
Spectrometry Research Center and Departments of Biochemistry, Chemistry,
Pharmacology, and Medicine, Vanderbilt University, 465 21st Avenue South, MRB III Suite
9160, Nashville, Tennessee 37232, United States
| | - Bart De Moor
- Department
of Electrical Engineering (ESAT), STADIUS Center for
Dynamical Systems, Signal Processing, and Data Analytics, KU Leuven, Kasteelpark
Arenberg 10, Box 2446, 3001 Leuven, Belgium
- iMinds
Medical IT, Kasteelpark
Arenberg 10, Box 2446, 3001 Leuven, Belgium
| | - Richard
M. Caprioli
- Mass
Spectrometry Research Center and Departments of Biochemistry, Chemistry,
Pharmacology, and Medicine, Vanderbilt University, 465 21st Avenue South, MRB III Suite
9160, Nashville, Tennessee 37232, United States
| | - Etienne Waelkens
- Sybioma, KU Leuven, Campus Gasthuisberg O&N 2, Herestraat 49, Box
802, 3000 Leuven, Belgium
- Department
of Cellular and Molecular Medicine, KU Leuven, Campus Gasthuisberg O&N 1, Herestraat
49, Box 901, 3000 Leuven, Belgium
| | - Raf Van de Plas
- Mass
Spectrometry Research Center and Departments of Biochemistry, Chemistry,
Pharmacology, and Medicine, Vanderbilt University, 465 21st Avenue South, MRB III Suite
9160, Nashville, Tennessee 37232, United States
| |
Collapse
|