1
|
Liu J, Bao C, Zhang J, Han Z, Fang H, Lu H. Artificial intelligence with mass spectrometry-based multimodal molecular profiling methods for advancing therapeutic discovery of infectious diseases. Pharmacol Ther 2024; 263:108712. [PMID: 39241918 DOI: 10.1016/j.pharmthera.2024.108712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 07/22/2024] [Accepted: 09/03/2024] [Indexed: 09/09/2024]
Abstract
Infectious diseases, driven by a diverse array of pathogens, can swiftly undermine public health systems. Accurate diagnosis and treatment of infectious diseases-centered around the identification of biomarkers and the elucidation of disease mechanisms-are in dire need of more versatile and practical analytical approaches. Mass spectrometry (MS)-based molecular profiling methods can deliver a wealth of information on a range of functional molecules, including nucleic acids, proteins, and metabolites. While MS-driven omics analyses can yield vast datasets, the sheer complexity and multi-dimensionality of MS data can significantly hinder the identification and characterization of functional molecules within specific biological processes and events. Artificial intelligence (AI) emerges as a potent complementary tool that can substantially enhance the processing and interpretation of MS data. AI applications in this context lead to the reduction of spurious signals, the improvement of precision, the creation of standardized analytical frameworks, and the increase of data integration efficiency. This critical review emphasizes the pivotal roles of MS based omics strategies in the discovery of biomarkers and the clarification of infectious diseases. Additionally, the review underscores the transformative ability of AI techniques to enhance the utility of MS-based molecular profiling in the field of infectious diseases by refining the quality and practicality of data produced from omics analyses. In conclusion, we advocate for a forward-looking strategy that integrates AI with MS-based molecular profiling. This integration aims to transform the analytical landscape and the performance of biological molecule characterization, potentially down to the single-cell level. Such advancements are anticipated to propel the development of AI-driven predictive models, thus improving the monitoring of diagnostics and therapeutic discovery for the ongoing challenge related to infectious diseases.
Collapse
Affiliation(s)
- Jingjing Liu
- School of Chinese Medicine, Hong Kong Traditional Chinese Medicine Phenome Research Center, State Key Laboratory of Environmental and Biological Analysis, Hong Kong Baptist University, Hong Kong 999077, China
| | - Chaohui Bao
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Jiaxin Zhang
- School of Chinese Medicine, Hong Kong Traditional Chinese Medicine Phenome Research Center, State Key Laboratory of Environmental and Biological Analysis, Hong Kong Baptist University, Hong Kong 999077, China
| | - Zeguang Han
- Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai 200240, China.
| | - Hai Fang
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China.
| | - Haitao Lu
- School of Chinese Medicine, Hong Kong Traditional Chinese Medicine Phenome Research Center, State Key Laboratory of Environmental and Biological Analysis, Hong Kong Baptist University, Hong Kong 999077, China; Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China; Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai 200240, China.
| |
Collapse
|
2
|
Russo FF, Nowatzky Y, Jaeger C, Parr MK, Benner P, Muth T, Lisec J. Machine learning methods for compound annotation in non-targeted mass spectrometry-A brief overview of fingerprinting, in silico fragmentation and de novo methods. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2024; 38:e9876. [PMID: 39180507 DOI: 10.1002/rcm.9876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Revised: 07/03/2024] [Accepted: 07/12/2024] [Indexed: 08/26/2024]
Abstract
Non-targeted screenings (NTS) are essential tools in different fields, such as forensics, health and environmental sciences. NTSs often employ mass spectrometry (MS) methods due to their high throughput and sensitivity in comparison to, for example, nuclear magnetic resonance-based methods. As the identification of mass spectral signals, called annotation, is labour intensive, it has been used for developing supporting tools based on machine learning (ML). However, both the diversity of mass spectral signals and the sheer quantity of different ML tools developed for compound annotation present a challenge for researchers in maintaining a comprehensive overview of the field. In this work, we illustrate which ML-based methods are available for compound annotation in non-targeted MS experiments and provide a nuanced comparison of the ML models used in MS data analysis, unravelling their unique features and performance metrics. Through this overview we support researchers to judiciously apply these tools in their daily research. This review also offers a detailed exploration of methods and datasets to show gaps in current methods, and promising target areas, offering a starting point for developers intending to improve existing methodologies.
Collapse
Affiliation(s)
- Francesco F Russo
- Department of Analytical Chemistry and Reference Materials, Organic Trace Analysis and Food Analysis, Bundesanstalt für Materialforschung und -prüfung (BAM), Berlin, Germany
| | - Yannek Nowatzky
- eScience, Bundesanstalt für Materialprüfung und -forschung, Berlin, Germany
| | - Carsten Jaeger
- Department of Analytical Chemistry and Reference Materials, Environmental Analysis, Bundesanstalt für Materialforschung und -prüfung (BAM), Berlin, Germany
| | - Maria K Parr
- Institute of Pharmacy, Pharmaceutical and Medicinal Chemistry (Pharmaceutical Analyses), Freie Universität, Berlin, Germany
| | - Phillipp Benner
- eScience, Bundesanstalt für Materialprüfung und -forschung, Berlin, Germany
| | - Thilo Muth
- Department MF 2, Domain Specific Data Competence Centre, Robert Koch Institut, Berlin, Germany
| | - Jan Lisec
- Department of Analytical Chemistry and Reference Materials, Organic Trace Analysis and Food Analysis, Bundesanstalt für Materialforschung und -prüfung (BAM), Berlin, Germany
| |
Collapse
|
3
|
Yang C, Liu YH, Zheng HK. Identification of metabolic biomarkers in idiopathic pulmonary arterial hypertension using targeted metabolomics and bioinformatics analysis. Sci Rep 2024; 14:25283. [PMID: 39455660 PMCID: PMC11511845 DOI: 10.1038/s41598-024-76514-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2024] [Accepted: 10/14/2024] [Indexed: 10/28/2024] Open
Abstract
Pulmonary arterial hypertension (PAH) is a life-threatening disease with a poor prognosis, and metabolic abnormalities play a critical role in its development. This study used metabolomics, machine learning algorithms and bioinformatics to screen for potential metabolic biomarkers associated with the diagnosis of PAH. In this study, plasma samples were collected from 17 patients diagnosed with idiopathic pulmonary arterial hypertension (IPAH) and 20 healthy controls. Plasma metabolomic profiling was performed by high-performance liquid chromatography-mass spectrometry. Gene profiles of PAH patients were obtained from the GEO database. Key differentially expressed metabolites (DEMs) and metabolism-related genes were subsequently identified using machine learning algorithms. Twenty differential plasma metabolites associated with IPAH were identified (VIP score > 1 and p < 0 0.05), and enrichment analysis revealed the arginine biosynthesis pathway as the most altered pathway. Using machine learning models, including least absolute shrinkage and selection operator (LASSO), random forest (RF) and support vector machine (SVM), we extracted key metabolites that correlated with clinical phenotypes. Our results suggested that five metabolites, kynurenine, homoserine, tryptophan, AMP, and spermine, are potential biomarkers for IPAH. Bioinformatics analysis also identified 3 metabolism-related genes, MAPK6, SLC7A11 and CDC42BPA, that are strongly correlated with pulmonary hypertension, demonstrating strong predictive power and clinical relevance. Our findings revealed some key genes associated with metabolism in PH, and provided crucial information about complex metabolic reprogramming signals and may lead to the identification of useful metabolic biomarkers for the diagnosis of PAH.
Collapse
Affiliation(s)
- Chuang Yang
- Department of cardiology, The Second Hospital of Jilin University, No.218 Ziqiang Street, Changchun, 130000, China
| | - Yi-Hang Liu
- Department of cardiology, The Second Hospital of Jilin University, No.218 Ziqiang Street, Changchun, 130000, China
| | - Hai-Kuo Zheng
- Department of cardiology, China-Japan Union Hospital of Jilin University, No.126, Xiantan Street, Changchun, 130033, China.
| |
Collapse
|
4
|
Bennett AA, Steininger-Mairinger T, Eroğlu ÇG, Gfeller A, Wirth J, Puschenreiter M, Hann S. Dual column chromatography combined with high-resolution mass spectrometry improves coverage of non-targeted analysis of plant root exudates. Anal Chim Acta 2024; 1327:343126. [PMID: 39266059 DOI: 10.1016/j.aca.2024.343126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 08/12/2024] [Accepted: 08/18/2024] [Indexed: 09/14/2024]
Abstract
BACKGROUND Within the plant kingdom, there is an exceptional amount of chemical diversity that has yet to be annotated. It is for this reason that non-targeted analysis is of interest for those working in novel natural products. To increase the number and diversity of compounds observable in root exudate extracts, several workflows which differ at three key stages were compared: 1) sample extraction, 2) chromatography, and 3) data preprocessing. RESULTS Plants were grown in Hoagland's solution for two weeks, and exudates were initially extracted with water, followed by a 24-h regeneration period with subsequent extraction using methanol. Utilizing the second extraction showed improved results with less ion suppression and reduced retention time shifting compared to the first extraction. A single column method, utilizing a pentafluorophenyl column, paired with high-resolution mass spectrometry ionized and correctly identified 34 mock root exudate compounds, while the dual column method, incorporating a pentafluorophenyl column and a porous graphitic carbon column, retained and identified 43 compounds. In a pooled quality control sample of exudate extracts, the single column method detected 1,444 compounds. While the dual method detected fewer compounds overall (1,050), it revealed a larger number of small polar compounds. Three preprocessing methods (targeted, proprietary, and open source) successfully identified 43, 31, and 38 mock root exudate compounds to confidence level 1, respectively. SIGNIFICANCE Enhancing signal strength and analytical method stability involves removing the high ionic strength nutrient solution before sampling root exudate extracts. Despite signal intensity loss, a dual column method enhances compound coverage, particularly for small polar metabolites. Open-source software proves a viable alternative for non-targeted analysis, even surpassing proprietary software in peak picking.
Collapse
Affiliation(s)
- Alexandra A Bennett
- BOKU University, Department of Chemistry, Institute of Analytical Chemistry, 1190, Vienna, Austria
| | | | - Çağla Görkem Eroğlu
- Agroscope, Herbology in Field Crops, Plant Production Systems, Nyon, Switzerland
| | - Aurélie Gfeller
- Agroscope, Herbology in Field Crops, Plant Production Systems, Nyon, Switzerland
| | - Judith Wirth
- Agroscope, Herbology in Field Crops, Plant Production Systems, Nyon, Switzerland
| | - Markus Puschenreiter
- BOKU University, Department of Forest and Soil Sciences, Institute of Soil Research, 3430, Tulln, Austria
| | - Stephan Hann
- BOKU University, Department of Chemistry, Institute of Analytical Chemistry, 1190, Vienna, Austria
| |
Collapse
|
5
|
Zhang H, Yang Q, Xie T, Wang Y, Zhang Z, Lu H. MSBERT: Embedding Tandem Mass Spectra into Chemically Rational Space by Mask Learning and Contrastive Learning. Anal Chem 2024; 96:16599-16608. [PMID: 39397717 DOI: 10.1021/acs.analchem.4c02426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2024]
Abstract
Tandem mass spectrometry (MS/MS) is a powerful technique for chemical analysis in many areas of science. The vast MS/MS spectral data generated in liquid chromatography-mass spectrometry (LC-MS) experiments require efficient analysis and interpretation methods for the following compound identification. In this study, we propose MSBERT based on self-supervised learning strategies to embed MS/MS spectra into reasonable embeddings for efficient compound identification. It adopts the transformer encoder as the backbone for mask learning and uses the same spectra with different masks for contrastive learning. MSBERT is trained on the GNPS data set and tested on the GNPS data set, the MoNA data set, and the MTBLS1572 data set. It exhibits enhanced library matching and analogous compound searching capabilities compared to existing methods. The recalls at 1, 5, and 10 on a GNPS test subset with structures not in the training set are 0.7871, 0.8950, and 0.9080, respectively. The results are better than those of Spec2Vec with 0.6898, 0.8276, and 0.8620, and DreaMS with 0.7158, 0.8327, and 0.8635. The rationality of embeddings is demonstrated by t-SNE visualization, structural similarity, spectra clustering, compound identification, and analogous compound searching. A user-friendly web server is provided for efficient spectral analysis, and the source code for MSBERT is available at https://github.com/zhanghailiangcsu/MSBERT.
Collapse
Affiliation(s)
- Hailiang Zhang
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, China
| | - Qiong Yang
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, China
| | - Ting Xie
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, China
| | - Yue Wang
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, China
| | - Zhimin Zhang
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, China
| | - Hongmei Lu
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, China
| |
Collapse
|
6
|
Petrova B, Guler AT. Recent Developments in Single-Cell Metabolomics by Mass Spectrometry─A Perspective. J Proteome Res 2024. [PMID: 39437423 DOI: 10.1021/acs.jproteome.4c00646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2024]
Abstract
Recent advancements in single-cell (sc) resolution analyses, particularly in sc transcriptomics and sc proteomics, have revolutionized our ability to probe and understand cellular heterogeneity. The study of metabolism through small molecules, metabolomics, provides an additional level of information otherwise unattainable by transcriptomics or proteomics by shedding light on the metabolic pathways that translate gene expression into functional outcomes. Metabolic heterogeneity, critical in health and disease, impacts developmental outcomes, disease progression, and treatment responses. However, dedicated approaches probing the sc metabolome have not reached the maturity of other sc omics technologies. Over the past decade, innovations in sc metabolomics have addressed some of the practical limitations, including cell isolation, signal sensitivity, and throughput. To fully exploit their potential in biological research, however, remaining challenges must be thoroughly addressed. Additionally, integrating sc metabolomics with orthogonal sc techniques will be required to validate relevant results and gain systems-level understanding. This perspective offers a broad-stroke overview of recent mass spectrometry (MS)-based sc metabolomics advancements, focusing on ongoing challenges from a biologist's viewpoint, aimed at addressing pertinent and innovative biological questions. Additionally, we emphasize the use of orthogonal approaches and showcase biological systems that these sophisticated methodologies are apt to explore.
Collapse
Affiliation(s)
- Boryana Petrova
- Medical University of Vienna, Vienna 1090, Austria
- Department of Pathology, Boston Children's Hospital, Boston, Massachusetts 02115, United States
| | - Arzu Tugce Guler
- Department of Pathology, Boston Children's Hospital, Boston, Massachusetts 02115, United States
- Institute for Experiential AI, Northeastern University, Boston, Massachusetts 02115, United States
| |
Collapse
|
7
|
Nishitsuji R, Nakashima T, Hisamoto H, Endo T. Simultaneous Recognition and Detection of Adenosine Phosphates by Machine Learning Analysis for Surface-Enhanced Raman Scattering Spectral Data. SENSORS (BASEL, SWITZERLAND) 2024; 24:6648. [PMID: 39460128 PMCID: PMC11511347 DOI: 10.3390/s24206648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2024] [Revised: 10/09/2024] [Accepted: 10/11/2024] [Indexed: 10/28/2024]
Abstract
Adenosine phosphates (adenosine 5'-monophosphate (AMP), adenosine 5'-diphosphate (ADP), and adenosine 5'-triphosphate (ATP)) play important roles in energy storage and signal transduction in the human body. Thus, a measurement method that simultaneously recognizes and detects adenosine phosphates is necessary to gain insight into complex energy-relevant biological processes. Surface-enhanced Raman scattering (SERS) is a powerful technique for this purpose. However, the similarities in size, charge, and structure of adenosine phosphates (APs) make their simultaneous recognition and detection difficult. Although approaches that combine SERS and machine learning have been studied, they require massive quantities of training data. In this study, limited AP spectral data were obtained using fabricated gold nanostructures for SERS measurements. The training data were created by feature selection and data augmentation after preprocessing the small amount of acquired spectral data. The performances of several machine learning models trained on these generated training data were compared. Multilayer perceptron model successfully detected the presence of AMP, ADP, and ATP with an accuracy of 0.914. Consequently, this study establishes a new measurement system that enables the highly accurate recognition and detection of adenosine phosphates from limited SERS spectral data.
Collapse
Affiliation(s)
- Ryosuke Nishitsuji
- Department of Information Networking, Graduate School of Information Science and Technology, Osaka University, 2-8 Yamadaoka, Suita 565-0871, Osaka, Japan;
| | - Tomoharu Nakashima
- Department of Interdisciplinary Informatics, Graduate School of Informatics, Osaka Metropolitan University, 1-1 Gakuencho, Nakaku, Sakai 599-8531, Osaka, Japan;
| | - Hideaki Hisamoto
- Department of Applied Chemistry, Osaka Metropolitan University, 1-1 Gakuencho, Nakaku, Sakai 599-8531, Osaka, Japan;
| | - Tatsuro Endo
- Department of Applied Chemistry, Osaka Metropolitan University, 1-1 Gakuencho, Nakaku, Sakai 599-8531, Osaka, Japan;
| |
Collapse
|
8
|
Gessler A, Wieloch T, Saurer M, Lehmann MM, Werner RA, Kammerer B. The marriage between stable isotope ecology and plant metabolomics - new perspectives for metabolic flux analysis and the interpretation of ecological archives. THE NEW PHYTOLOGIST 2024; 244:21-31. [PMID: 39021246 DOI: 10.1111/nph.19973] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Accepted: 07/01/2024] [Indexed: 07/20/2024]
Abstract
Even though they share many thematical overlaps, plant metabolomics and stable isotope ecology have been rather separate fields mainly due to different mass spectrometry demands. New high-resolution bioanalytical mass spectrometers are now not only offering high-throughput metabolite identification but are also suitable for compound- and intramolecular position-specific isotope analysis in the natural isotope abundance range. In plant metabolomics, label-free metabolic pathway and metabolic flux analysis might become possible when applying this new technology. This is because changes in the commitment of substrates to particular metabolic pathways and the activation or deactivation of others alter enzyme-specific isotope effects. This leads to differences in intramolecular and compound-specific isotope compositions. In plant isotope ecology, position-specific isotope analysis in plant archives informed by metabolic pathway analysis could be used to reconstruct and separate environmental impacts on complex metabolic processes. A technology-driven linkage between the two disciplines could allow us to extract information on environment-metabolism interaction from plant archives such as tree rings but also within ecosystems. This would contribute to a holistic understanding of how plants react to environmental drivers, thus also providing helpful information on the trajectories of the vegetation under the conditions to come.
Collapse
Affiliation(s)
- Arthur Gessler
- Institute of Terrestrial Ecosystems, ETH Zurich, 8092, Zurich, Switzerland
- Ecosystem Ecology, Swiss Federal Research Institute WSL, 8903, Birmensdorf, Switzerland
| | - Thomas Wieloch
- Division of Geological and Planetary Sciences, California Institute of Technology, Pasadena, CA, 91125, USA
- Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå Plant Science Centre, 90736, Umeå, Sweden
| | - Matthias Saurer
- Ecosystem Ecology, Swiss Federal Research Institute WSL, 8903, Birmensdorf, Switzerland
| | - Marco M Lehmann
- Ecosystem Ecology, Swiss Federal Research Institute WSL, 8903, Birmensdorf, Switzerland
- Forest Soils and Biogeochemistry, Swiss Federal Research Institute WSL, 8903, Birmensdorf, Switzerland
| | - Roland A Werner
- Institute of Agricultural Sciences, ETH Zurich, 8092, Zurich, Switzerland
| | - Bernd Kammerer
- Core Competence Metabolomics, Albert-Ludwigs-University Freiburg, 79104, Freiburg, Germany
| |
Collapse
|
9
|
Pakkir Shah AK, Walter A, Ottosson F, Russo F, Navarro-Diaz M, Boldt J, Kalinski JCJ, Kontou EE, Elofson J, Polyzois A, González-Marín C, Farrell S, Aggerbeck MR, Pruksatrakul T, Chan N, Wang Y, Pöchhacker M, Brungs C, Cámara B, Caraballo-Rodríguez AM, Cumsille A, de Oliveira F, Dührkop K, El Abiead Y, Geibel C, Graves LG, Hansen M, Heuckeroth S, Knoblauch S, Kostenko A, Kuijpers MCM, Mildau K, Papadopoulos Lambidis S, Portal Gomes PW, Schramm T, Steuer-Lodd K, Stincone P, Tayyab S, Vitale GA, Wagner BC, Xing S, Yazzie MT, Zuffa S, de Kruijff M, Beemelmanns C, Link H, Mayer C, van der Hooft JJJ, Damiani T, Pluskal T, Dorrestein P, Stanstrup J, Schmid R, Wang M, Aron A, Ernst M, Petras D. Statistical analysis of feature-based molecular networking results from non-targeted metabolomics data. Nat Protoc 2024:10.1038/s41596-024-01046-3. [PMID: 39304763 DOI: 10.1038/s41596-024-01046-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 07/02/2024] [Indexed: 09/22/2024]
Abstract
Feature-based molecular networking (FBMN) is a popular analysis approach for liquid chromatography-tandem mass spectrometry-based non-targeted metabolomics data. While processing liquid chromatography-tandem mass spectrometry data through FBMN is fairly streamlined, downstream data handling and statistical interrogation are often a key bottleneck. Especially users new to statistical analysis struggle to effectively handle and analyze complex data matrices. Here we provide a comprehensive guide for the statistical analysis of FBMN results, focusing on the downstream analysis of the FBMN output table. We explain the data structure and principles of data cleanup and normalization, as well as uni- and multivariate statistical analysis of FBMN results. We provide explanations and code in two scripting languages (R and Python) as well as the QIIME2 framework for all protocol steps, from data clean-up to statistical analysis. All code is shared in the form of Jupyter Notebooks ( https://github.com/Functional-Metabolomics-Lab/FBMN-STATS ). Additionally, the protocol is accompanied by a web application with a graphical user interface ( https://fbmn-statsguide.gnps2.org/ ) to lower the barrier of entry for new users and for educational purposes. Finally, we also show users how to integrate their statistical results into the molecular network using the Cytoscape visualization tool. Throughout the protocol, we use a previously published environmental metabolomics dataset for demonstration purposes. Together, the protocol, code and web application provide a complete guide and toolbox for FBMN data integration, cleanup and advanced statistical analysis, enabling new users to uncover molecular insights from their non-targeted metabolomics data. Our protocol is tailored for the seamless analysis of FBMN results from Global Natural Products Social Molecular Networking and can be easily adapted to other mass spectrometry feature detection, annotation and networking tools.
Collapse
Affiliation(s)
- Abzer K Pakkir Shah
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Axel Walter
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
- Applied Bioinformatics, Department of Computer Science, University of Tübingen, Tübingen, Germany
| | - Filip Ottosson
- Section for Clinical Mass Spectrometry, Danish Center for Neonatal Screening, Department of Congenital Disorders, Statens Serum Institut, Copenhagen S, Denmark
| | - Francesco Russo
- Section for Clinical Mass Spectrometry, Danish Center for Neonatal Screening, Department of Congenital Disorders, Statens Serum Institut, Copenhagen S, Denmark
| | - Marcelo Navarro-Diaz
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Judith Boldt
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
- German Center for Infection Research, Partner Site Braunschweig-Hannover, Braunschweig, Germany
| | - Jarmo-Charles J Kalinski
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Department of Biochemistry and Microbiology, Rhodes University, Makhanda, South Africa
| | - Eftychia Eva Kontou
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- The Novo Nordisk Foundation for Biosustainability, Technical University of Denmark, Kongens Lyngby, Denmark
| | - James Elofson
- Department of Chemistry and Biochemistry, University of Denver, Denver, CO, USA
| | - Alexandros Polyzois
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Boyce Thompson Institute and Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY, USA
| | - Carolina González-Marín
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Universidad EAFIT, Medellín, Antioquia, Colombia
| | - Shane Farrell
- Bigelow Laboratory for Ocean Sciences, East Boothbay, ME, USA
- School of Marine Sciences, Darling Marine Center, University of Maine, Walpole, ME, USA
| | - Marie R Aggerbeck
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Department of Environmental Science, Aarhus University, Roskilde, Denmark
| | - Thapanee Pruksatrakul
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- National Center for Genetic Engineering and Biotechnology, National Science and Technology Development Agency, Thailand Science Park, Pathum Thani, Thailand
| | - Nathan Chan
- Department of Computer Science, University of California Riverside, Riverside, CA, USA
| | - Yunshu Wang
- Department of Computer Science, University of California Riverside, Riverside, CA, USA
| | - Magdalena Pöchhacker
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Department of Food Chemistry and Toxicology, University of Vienna, Vienna, Austria
| | - Corinna Brungs
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague, Czech Republic
| | - Beatriz Cámara
- Laboratorio de Microbiología Molecular y Biotecnología Ambiental, Centro de Biotecnología DAL, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | | | - Andres Cumsille
- Laboratorio de Microbiología Molecular y Biotecnología Ambiental, Centro de Biotecnología DAL, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Fernanda de Oliveira
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA, USA
- Department of Biotechnology, Engineering School of Lorena, University of São Paulo, Lorena, São Paulo, Brazil
| | - Kai Dührkop
- Department of Bioinformatics, University of Jena, Jena, Germany
| | - Yasin El Abiead
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA, USA
| | - Christian Geibel
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Lana G Graves
- Department of Environmental Systems Analysis, University of Tübingen, Tübingen, Germany
- Leibniz Institute of Freshwater Ecology and Inland Fisheries, Berlin, Germany
| | - Martin Hansen
- Department of Environmental Science, Aarhus University, Roskilde, Denmark
| | - Steffen Heuckeroth
- Institute of Inorganic and Analytical Chemistry, University of Münster, Münster, Germany
| | - Simon Knoblauch
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Anastasiia Kostenko
- Department of Chemistry and Biochemistry, University of Denver, Denver, CO, USA
| | - Mirte C M Kuijpers
- Department of Ecology, Behavior and Evolution, University of California San Diego, San Diego, CA, USA
| | - Kevin Mildau
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Department of Analytical Chemistry, University of Vienna, Vienna, Austria
- Bioinformatics Group, Wageningen University and Research, Wageningen, the Netherlands
| | | | - Paulo Wender Portal Gomes
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA, USA
| | - Tilman Schramm
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
- Department of Biochemistry, University of California Riverside, Riverside, CA, USA
| | - Karoline Steuer-Lodd
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
- Department of Biochemistry, University of California Riverside, Riverside, CA, USA
| | - Paolo Stincone
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Sibgha Tayyab
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Giovanni Andrea Vitale
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Berenike C Wagner
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Shipei Xing
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA, USA
| | - Marquis T Yazzie
- Department of Chemistry and Biochemistry, University of Denver, Denver, CO, USA
| | - Simone Zuffa
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA, USA
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA, USA
| | - Martinus de Kruijff
- Helmholtz Institute for Pharmaceutical Research Saarland, Helmholtz Centre for Infection Research, Saarbrücken, Germany
| | - Christine Beemelmanns
- Helmholtz Institute for Pharmaceutical Research Saarland, Helmholtz Centre for Infection Research, Saarbrücken, Germany
- Saarland University, Saarbrücken, Germany
| | - Hannes Link
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Christoph Mayer
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany
| | - Justin J J van der Hooft
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Bioinformatics Group, Wageningen University and Research, Wageningen, the Netherlands
- Department of Biochemistry, University of Johannesburg, Johannesburg, South Africa
| | - Tito Damiani
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague, Czech Republic
| | - Tomáš Pluskal
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague, Czech Republic
| | - Pieter Dorrestein
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego, CA, USA
| | - Jan Stanstrup
- Department of Nutrition, Exercise and Sports, University of Copenhagen, Frederiksberg C, Denmark
| | - Robin Schmid
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague, Czech Republic
| | - Mingxun Wang
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Department of Computer Science, University of California Riverside, Riverside, CA, USA
| | - Allegra Aron
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA
- Department of Chemistry and Biochemistry, University of Denver, Denver, CO, USA
| | - Madeleine Ernst
- Section for Clinical Mass Spectrometry, Danish Center for Neonatal Screening, Department of Congenital Disorders, Statens Serum Institut, Copenhagen S, Denmark.
| | - Daniel Petras
- Virtual Multi-Omics Laboratory, The Internet, Riverside, CA, USA.
- University of Tübingen, Interfaculty Institute of Microbiology and Infection Medicine, Tübingen, Germany.
- Department of Biochemistry, University of California Riverside, Riverside, CA, USA.
| |
Collapse
|
10
|
Masmali I, Nadeem MF, Mufti ZS, Ahmad A, Koam ANA, Ghazwani H. Data-driven approaches to study the spectral properties of chemical structures. Heliyon 2024; 10:e37459. [PMID: 39290266 PMCID: PMC11407057 DOI: 10.1016/j.heliyon.2024.e37459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 09/03/2024] [Accepted: 09/04/2024] [Indexed: 09/19/2024] Open
Abstract
The molecular energy, which is the sum of all eigenvalues, is crucial in determining the total π-electron energy of conjugated hydrocarbon molecules. We used machine learning techniques to calculate the energy, inertia, nullity, signature, and Estrada index of molecular graphs for bismuth tri-iodide and benzene rings embedded in P-type surfaces within 2D networks. We applied MATLAB to extract the actual eigenvalues from the data and developed general equations for these molecular properties. We then used these equations to estimate the values and compared them to the actual values through graphical analysis. Our results demonstrate the potential of data-driven techniques in predicting molecular properties and enhancing our understanding of spectral theory.
Collapse
Affiliation(s)
- Ibtisam Masmali
- Department of Mathematics, College of Science, Jazan University, Jazan, 45142, Saudi Arabia
| | - Muhammad Faisal Nadeem
- Department of Mathematics, COMSATS University Islamabad Lahore Campus, Lahore, 54000, Pakistan
| | - Zeeshan Saleem Mufti
- Department of Mathematics and Statistics, The University of Lahore, Lahore, 54000, Pakistan
| | - Ali Ahmad
- Department of Computer Science, College of Engineering and Computer Science, Jazan University, Jazan 45142, Kingdom of Saudi Arabia
| | - Ali N A Koam
- Department of Mathematics, College of Science, Jazan University, Jazan, 45142, Saudi Arabia
| | - Haleemah Ghazwani
- Department of Mathematics, College of Science, Jazan University, Jazan, 45142, Saudi Arabia
| |
Collapse
|
11
|
Park S, Kim EK. Machine Learning-Based Plasma Metabolomics in Liraglutide-Treated Type 2 Diabetes Mellitus Patients and Diet-Induced Obese Mice. Metabolites 2024; 14:483. [PMID: 39330490 PMCID: PMC11434292 DOI: 10.3390/metabo14090483] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2024] [Revised: 08/30/2024] [Accepted: 08/30/2024] [Indexed: 09/28/2024] Open
Abstract
Liraglutide, a glucagon-like peptide-1 receptor agonist, is effective in the treatment of type 2 diabetes mellitus (T2DM) and obesity. Despite its benefits, including improved glycemic control and weight loss, the common metabolic changes induced by liraglutide and correlations between those in rodents and humans remain unknown. Here, we used advanced machine learning techniques to analyze the plasma metabolomic data in diet-induced obese (DIO) mice and patients with T2DM treated with liraglutide. Among the machine learning models, Support Vector Machine was the most suitable for DIO mice, and Gradient Boosting was the most suitable for patients with T2DM. Through the cross-evaluation of machine learning models, we found that liraglutide promotes metabolic shifts and interspecies correlations in these shifts between DIO mice and patients with T2DM. Our comparative analysis helped identify metabolic correlations influenced by liraglutide between humans and rodents and may guide future therapeutic strategies for T2DM and obesity.
Collapse
Affiliation(s)
- Seokjae Park
- Department of Brain Sciences, Daegu Gyeongbuk Institute of Science and Technology, Daegu 42988, Republic of Korea;
- Neurometabolomics Research Center, Daegu Gyeongbuk Institute of Science and Technology, Daegu 42988, Republic of Korea
| | - Eun-Kyoung Kim
- Department of Brain Sciences, Daegu Gyeongbuk Institute of Science and Technology, Daegu 42988, Republic of Korea;
- Neurometabolomics Research Center, Daegu Gyeongbuk Institute of Science and Technology, Daegu 42988, Republic of Korea
| |
Collapse
|
12
|
Chi J, Shu J, Li M, Mudappathi R, Jin Y, Lewis F, Boon A, Qin X, Liu L, Gu H. Artificial Intelligence in Metabolomics: A Current Review. Trends Analyt Chem 2024; 178:117852. [PMID: 39071116 PMCID: PMC11271759 DOI: 10.1016/j.trac.2024.117852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
Metabolomics and artificial intelligence (AI) form a synergistic partnership. Metabolomics generates large datasets comprising hundreds to thousands of metabolites with complex relationships. AI, aiming to mimic human intelligence through computational modeling, possesses extraordinary capabilities for big data analysis. In this review, we provide a recent overview of the methodologies and applications of AI in metabolomics studies in the context of systems biology and human health. We first introduce the AI concept, history, and key algorithms for machine learning and deep learning, summarizing their strengths and weaknesses. We then discuss studies that have successfully used AI across different aspects of metabolomic analysis, including analytical detection, data preprocessing, biomarker discovery, predictive modeling, and multi-omics data integration. Lastly, we discuss the existing challenges and future perspectives in this rapidly evolving field. Despite limitations and challenges, the combination of metabolomics and AI holds great promises for revolutionary advancements in enhancing human health.
Collapse
Affiliation(s)
- Jinhua Chi
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Center for Translational Science, Florida International University, Port St. Lucie, FL 34987, USA
| | - Jingmin Shu
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Center for Personalized Diagnostics, Biodesign Institute, Arizona State University, Tempe, AZ 85281, USA
| | - Ming Li
- Phoenix VA Health Care System, Phoenix, AZ 85012, USA
- University of Arizona College of Medicine, Phoenix, AZ 85004, USA
| | - Rekha Mudappathi
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Center for Personalized Diagnostics, Biodesign Institute, Arizona State University, Tempe, AZ 85281, USA
| | - Yan Jin
- Center for Translational Science, Florida International University, Port St. Lucie, FL 34987, USA
| | - Freeman Lewis
- Center for Translational Science, Florida International University, Port St. Lucie, FL 34987, USA
| | - Alexandria Boon
- Center for Translational Science, Florida International University, Port St. Lucie, FL 34987, USA
| | - Xiaoyan Qin
- College of Liberal Arts and Sciences, Arizona State University, Tempe, AZ 85281, USA
| | - Li Liu
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Center for Personalized Diagnostics, Biodesign Institute, Arizona State University, Tempe, AZ 85281, USA
| | - Haiwei Gu
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Center for Translational Science, Florida International University, Port St. Lucie, FL 34987, USA
| |
Collapse
|
13
|
Kundu P, Beura S, Mondal S, Das AK, Ghosh A. Machine learning for the advancement of genome-scale metabolic modeling. Biotechnol Adv 2024; 74:108400. [PMID: 38944218 DOI: 10.1016/j.biotechadv.2024.108400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 05/13/2024] [Accepted: 06/23/2024] [Indexed: 07/01/2024]
Abstract
Constraint-based modeling (CBM) has evolved as the core systems biology tool to map the interrelations between genotype, phenotype, and external environment. The recent advancement of high-throughput experimental approaches and multi-omics strategies has generated a plethora of new and precise information from wide-ranging biological domains. On the other hand, the continuously growing field of machine learning (ML) and its specialized branch of deep learning (DL) provide essential computational architectures for decoding complex and heterogeneous biological data. In recent years, both multi-omics and ML have assisted in the escalation of CBM. Condition-specific omics data, such as transcriptomics and proteomics, helped contextualize the model prediction while analyzing a particular phenotypic signature. At the same time, the advanced ML tools have eased the model reconstruction and analysis to increase the accuracy and prediction power. However, the development of these multi-disciplinary methodological frameworks mainly occurs independently, which limits the concatenation of biological knowledge from different domains. Hence, we have reviewed the potential of integrating multi-disciplinary tools and strategies from various fields, such as synthetic biology, CBM, omics, and ML, to explore the biochemical phenomenon beyond the conventional biological dogma. How the integrative knowledge of these intersected domains has improved bioengineering and biomedical applications has also been highlighted. We categorically explained the conventional genome-scale metabolic model (GEM) reconstruction tools and their improvement strategies through ML paradigms. Further, the crucial role of ML and DL in omics data restructuring for GEM development has also been briefly discussed. Finally, the case-study-based assessment of the state-of-the-art method for improving biomedical and metabolic engineering strategies has been elaborated. Therefore, this review demonstrates how integrating experimental and in silico strategies can help map the ever-expanding knowledge of biological systems driven by condition-specific cellular information. This multiview approach will elevate the application of ML-based CBM in the biomedical and bioengineering fields for the betterment of society and the environment.
Collapse
Affiliation(s)
- Pritam Kundu
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Satyajit Beura
- Department of Bioscience and Biotechnology, Indian Institute of Technology, Kharagpur, West Bengal 721302, India
| | - Suman Mondal
- P.K. Sinha Centre for Bioenergy and Renewables, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Amit Kumar Das
- Department of Bioscience and Biotechnology, Indian Institute of Technology, Kharagpur, West Bengal 721302, India
| | - Amit Ghosh
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India; P.K. Sinha Centre for Bioenergy and Renewables, Indian Institute of Technology Kharagpur, West Bengal 721302, India.
| |
Collapse
|
14
|
Clarke R, Bharucha T, Arman BY, Gangadharan B, Gomez Fernandez L, Mosca S, Lin Q, Van Assche K, Stokes R, Dunachie S, Deats M, Merchant HA, Caillet C, Walsby-Tickle J, Probert F, Matousek P, Newton PN, Zitzmann N, McCullagh JSO. Using matrix assisted laser desorption ionisation mass spectrometry combined with machine learning for vaccine authenticity screening. NPJ Vaccines 2024; 9:155. [PMID: 39198486 PMCID: PMC11358428 DOI: 10.1038/s41541-024-00946-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Accepted: 08/07/2024] [Indexed: 09/01/2024] Open
Abstract
The global population is increasingly reliant on vaccines to maintain population health with billions of doses used annually in immunisation programmes. Substandard and falsified vaccines are becoming more prevalent, caused by both the degradation of authentic vaccines but also deliberately falsified vaccine products. These threaten public health, and the increase in vaccine falsification is now a major concern. There is currently no coordinated global infrastructure or screening methods to monitor vaccine supply chains. In this study, we developed and validated a matrix-assisted laser desorption/ionisation-mass spectrometry (MALDI-MS) workflow that used open-source machine learning and statistical analysis to distinguish authentic and falsified vaccines. We validated the method on two different MALDI-MS instruments used worldwide for clinical applications. Our results show that multivariate data modelling and diagnostic mass spectra can be used to distinguish authentic and falsified vaccines providing proof-of-concept that MALDI-MS can be used as a screening tool to monitor vaccine supply chains.
Collapse
Affiliation(s)
- Rebecca Clarke
- Department of Chemistry, University of Oxford, Oxford, OX1 3TA, UK
| | - Tehmina Bharucha
- Department of Biochemistry, University of Oxford, Oxford, OX1 3QU, UK
- Kavli Institute for Nanoscience Discovery, University of Oxford, Oxford, OX1 3QU, UK
| | - Benediktus Yohan Arman
- Department of Biochemistry, University of Oxford, Oxford, OX1 3QU, UK
- Kavli Institute for Nanoscience Discovery, University of Oxford, Oxford, OX1 3QU, UK
| | - Bevin Gangadharan
- Department of Biochemistry, University of Oxford, Oxford, OX1 3QU, UK
- Kavli Institute for Nanoscience Discovery, University of Oxford, Oxford, OX1 3QU, UK
| | - Laura Gomez Fernandez
- Department of Biochemistry, University of Oxford, Oxford, OX1 3QU, UK
- Kavli Institute for Nanoscience Discovery, University of Oxford, Oxford, OX1 3QU, UK
| | - Sara Mosca
- Central Laser Facility, Research Complex at Harwell, STFC Rutherford Appleton Laboratory, UK Research and Innovation (UKRI), Harwell Campus, Didcot, OX11 0QX, UK
| | - Qianqi Lin
- Central Laser Facility, Research Complex at Harwell, STFC Rutherford Appleton Laboratory, UK Research and Innovation (UKRI), Harwell Campus, Didcot, OX11 0QX, UK
- Hybrid Materials for Opto-Electronics Group, Department of Molecules and Materials, MESA+ Institute for Nanotechnology, Molecules Center and Center for Brain-Inspired Nano Systems, Faculty of Science and Technology, University of Twente, 7500AE, Enschede, the Netherlands
| | - Kerlijn Van Assche
- Medicine Quality Research Group, NDM Centre for Global Health Research, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7LG, UK
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, 10400, Thailand
- Infectious Diseases Data Observatory, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7LG, UK
| | | | - Susanna Dunachie
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, 10400, Thailand
- NDM Centre for Global Health Research, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7LG, UK
- NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Michael Deats
- Medicine Quality Research Group, NDM Centre for Global Health Research, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7LG, UK
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, 10400, Thailand
- Infectious Diseases Data Observatory, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7LG, UK
| | - Hamid A Merchant
- Department of Pharmacy, School of Applied Sciences, University of Huddersfield, Huddersfield, HD1 3DH, UK
- Department of Bioscience, School of Health, Sport and Bioscience, University of East London, Water Lane, London, E15 4LZ, UK
| | - Céline Caillet
- Medicine Quality Research Group, NDM Centre for Global Health Research, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7LG, UK
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, 10400, Thailand
- Infectious Diseases Data Observatory, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7LG, UK
| | | | - Fay Probert
- Department of Chemistry, University of Oxford, Oxford, OX1 3TA, UK
| | - Pavel Matousek
- Central Laser Facility, Research Complex at Harwell, STFC Rutherford Appleton Laboratory, UK Research and Innovation (UKRI), Harwell Campus, Didcot, OX11 0QX, UK
- Medicine Quality Research Group, NDM Centre for Global Health Research, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7LG, UK
| | - Paul N Newton
- Medicine Quality Research Group, NDM Centre for Global Health Research, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7LG, UK
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, 10400, Thailand
- Infectious Diseases Data Observatory, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7LG, UK
| | - Nicole Zitzmann
- Department of Biochemistry, University of Oxford, Oxford, OX1 3QU, UK
- Kavli Institute for Nanoscience Discovery, University of Oxford, Oxford, OX1 3QU, UK
| | | |
Collapse
|
15
|
Hupatz H, Rahu I, Wang WC, Peets P, Palm EH, Kruve A. Critical review on in silico methods for structural annotation of chemicals detected with LC/HRMS non-targeted screening. Anal Bioanal Chem 2024:10.1007/s00216-024-05471-x. [PMID: 39138659 DOI: 10.1007/s00216-024-05471-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Revised: 07/22/2024] [Accepted: 07/24/2024] [Indexed: 08/15/2024]
Abstract
Non-targeted screening with liquid chromatography coupled to high-resolution mass spectrometry (LC/HRMS) is increasingly leveraging in silico methods, including machine learning, to obtain candidate structures for structural annotation of LC/HRMS features and their further prioritization. Candidate structures are commonly retrieved based on the tandem mass spectral information either from spectral or structural databases; however, the vast majority of the detected LC/HRMS features remain unannotated, constituting what we refer to as a part of the unknown chemical space. Recently, the exploration of this chemical space has become accessible through generative models. Furthermore, the evaluation of the candidate structures benefits from the complementary empirical analytical information such as retention time, collision cross section values, and ionization type. In this critical review, we provide an overview of the current approaches for retrieving and prioritizing candidate structures. These approaches come with their own set of advantages and limitations, as we showcase in the example of structural annotation of ten known and ten unknown LC/HRMS features. We emphasize that these limitations stem from both experimental and computational considerations. Finally, we highlight three key considerations for the future development of in silico methods.
Collapse
Affiliation(s)
- Henrik Hupatz
- Department of Materials and Environmental Chemistry, Stockholm University, Svante Arrhenius Väg 16, 114 18, Stockholm, Sweden
- Stockholm University Center for Circular and Sustainable Systems (SUCCeSS), Stockholm University, 106 91, Stockholm, Sweden
| | - Ida Rahu
- Department of Materials and Environmental Chemistry, Stockholm University, Svante Arrhenius Väg 16, 114 18, Stockholm, Sweden.
| | - Wei-Chieh Wang
- Department of Materials and Environmental Chemistry, Stockholm University, Svante Arrhenius Väg 16, 114 18, Stockholm, Sweden
| | - Pilleriin Peets
- Institute of Biodiversity, Faculty of Biological Science, Cluster of Excellence Balance of the Microverse, Friedrich Schiller University Jena, 07743, Jena, Germany
| | - Emma H Palm
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6 Avenue du Swing, 4367, Belvaux, Luxembourg
| | - Anneli Kruve
- Department of Materials and Environmental Chemistry, Stockholm University, Svante Arrhenius Väg 16, 114 18, Stockholm, Sweden.
- Stockholm University Center for Circular and Sustainable Systems (SUCCeSS), Stockholm University, 106 91, Stockholm, Sweden.
- Department of Environmental Science, Stockholm University, Svante Arrhenius Väg 8, 114 18, Stockholm, Sweden.
| |
Collapse
|
16
|
Le K, Radović JR, MacCallum JL, Larter SR, Van Humbeck JF. Machine Learning in Complex Organic Mixtures: Applying Domain Knowledge Allows for Meaningful Performance with Small Data Sets. J Am Chem Soc 2024; 146:22563-22569. [PMID: 39082215 DOI: 10.1021/jacs.4c06595] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/15/2024]
Abstract
The ability to quantify individual components of complex mixtures is a challenge found throughout the life and physical sciences. An improved capacity to generate large data sets along with the uptake of machine-learning (ML)-based analysis tools has allowed for various "omics" disciplines to realize exceptional advances. Other areas of chemistry that deal with complex mixtures often do not leverage these advances. Environmental samples, for example, can be more difficult to access, and the resulting small data sets are less appropriate for unconstrained ML approaches. Herein, we present an approach to address this latter issue. Using a very small environmental data set─35 high-resolution mass spectra gathered from various solvent extractions of Canadian petroleum fractions─we show that the application of specific domain knowledge can lead to ML models with notable performance.
Collapse
Affiliation(s)
- Katelyn Le
- Department of Chemistry, University of Calgary, Calgary, Alberta T2N 1N4, Canada
| | - Jagoš R Radović
- Center for Petroleum Geochemistry (UH-CPG), Department of Earth and Atmospheric Sciences, University of Houston, Houston, Texas 77204-5007, United States
| | - Justin L MacCallum
- Department of Chemistry, University of Calgary, Calgary, Alberta T2N 1N4, Canada
| | - Stephen R Larter
- Department of Earth, Energy, and Environment, University of Calgary, Calgary, Alberta T2N 1N4, Canada
| | | |
Collapse
|
17
|
Belete GT, Zhou L, Li KK, So PK, Do CW, Lam TC. Metabolomics studies in common multifactorial eye disorders: a review of biomarker discovery for age-related macular degeneration, glaucoma, diabetic retinopathy and myopia. Front Mol Biosci 2024; 11:1403844. [PMID: 39193222 PMCID: PMC11347317 DOI: 10.3389/fmolb.2024.1403844] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Accepted: 07/24/2024] [Indexed: 08/29/2024] Open
Abstract
Introduction Multifactorial Eye disorders are a significant public health concern and have a huge impact on quality of life. The pathophysiological mechanisms underlying these eye disorders were not completely understood since functional and low-throughput biological tests were used. By identifying biomarkers linked to eye disorders, metabolomics enables early identification, tracking of the course of the disease, and personalized treatment. Methods The electronic databases of PubMed, Scopus, PsycINFO, and Web of Science were searched for research related to Age-Related macular degeneration (AMD), glaucoma, myopia, and diabetic retinopathy (DR). The search was conducted in August 2023. The number of cases and controls, the study's design, the analytical methods used, and the results of the metabolomics analysis were all extracted. Using the QUADOMICS tool, the quality of the studies included was evaluated, and metabolic pathways were examined for distinct metabolic profiles. We used MetaboAnalyst 5.0 to undertake pathway analysis of differential metabolites. Results Metabolomics studies included in this review consisted of 36 human studies (5 Age-related macular degeneration, 10 Glaucoma, 13 Diabetic retinopathy, and 8 Myopia). The most networked metabolites in AMD include glycine and adenosine monophosphate, while methionine, lysine, alanine, glyoxylic acid, and cysteine were identified in glaucoma. Furthermore, in myopia, glycerol, glutamic acid, pyruvic acid, glycine, cysteine, and oxoglutaric acid constituted significant metabolites, while glycerol, glutamic acid, lysine, citric acid, alanine, and serotonin are highly networked metabolites in cases of diabetic retinopathy. The common top metabolic pathways significantly enriched and associated with AMD, glaucoma, DR, and myopia were arginine and proline metabolism, methionine metabolism, glycine and serine metabolism, urea cycle metabolism, and purine metabolism. Conclusion This review recapitulates potential metabolic biomarkers, networks and pathways in AMD, glaucoma, DR, and myopia, providing new clues to elucidate disease mechanisms and therapeutic targets. The emergence of advanced metabolomics techniques has significantly enhanced the capability of metabolic profiling and provides novel perspectives on the metabolism and underlying pathogenesis of these multifactorial eye conditions. The advancement of metabolomics is anticipated to foster a deeper comprehension of disease etiology, facilitate the identification of novel therapeutic targets, and usher in an era of personalized medicine in eye research.
Collapse
Affiliation(s)
- Gizachew Tilahun Belete
- Centre for Myopia Research, School of Optometry, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
| | - Lei Zhou
- Centre for Myopia Research, School of Optometry, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
- Research Centre for SHARP Vision (RCSV), The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
- Centre for Eye and Vision Research (CEVR), The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
- Department of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
| | - King-Kit Li
- Centre for Myopia Research, School of Optometry, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
| | - Pui-Kin So
- University Research Facility in Life Sciences, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
| | - Chi-Wai Do
- Centre for Myopia Research, School of Optometry, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
- Research Centre for SHARP Vision (RCSV), The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
- Centre for Eye and Vision Research (CEVR), The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
- Research Centre for Chinese Medicine Innovation (RCMI), The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
| | - Thomas Chuen Lam
- Centre for Myopia Research, School of Optometry, The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
- Research Centre for SHARP Vision (RCSV), The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
- Centre for Eye and Vision Research (CEVR), The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
- Research Centre for Chinese Medicine Innovation (RCMI), The Hong Kong Polytechnic University, Kowloon, Hong Kong SAR, China
| |
Collapse
|
18
|
Kirschbaum C, Greis K, Torres-Boy A, Riedel J, Gewinner S, Schöllkopf W, Meijer G, Helden GV, Pagel K. Studying the Intrinsic Reactivity of Chromanes by Gas-Phase Infrared Spectroscopy. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024; 35:1950-1958. [PMID: 38950388 PMCID: PMC11311547 DOI: 10.1021/jasms.4c00216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Revised: 06/11/2024] [Accepted: 06/19/2024] [Indexed: 07/03/2024]
Abstract
Tandem mass spectrometry is routinely used for the structural analysis of organic molecules, but many fragmentation reactions are not well understood. Because several potential structures can correspond to a measured mass, the assignment of product ions is ambiguous using mass spectrometry alone. Here, we combine mass spectrometry with high-resolution gas-phase infrared spectroscopy and computational chemistry tools to identify product ion structures and derive collision-induced fragmentation mechanisms of the chromane derivatives Trolox and Methyltrolox. We find that protonated Trolox and Methyltrolox fragment identically via dehydration and decarbonylation, while deprotonated ions display substantially diverging reactivities. For deprotonated Methyltrolox, we observe unusual radical fragmentation reactions and suggest a [1,2]-Wittig rearrangement involving aryl migration in the gas phase. Overall, the combined experimental and theoretical approach presented here revealed complex proton dynamics and intramolecular rearrangement reactions, which expand our understanding on structure-reactivity relationships of isolated molecules in different protonation states.
Collapse
Affiliation(s)
- Carla Kirschbaum
- Freie
Universität Berlin, Institute of Chemistry
and Biochemistry, 14195 Berlin, Germany
- Fritz
Haber Institute of the Max Planck Society, 14195 Berlin, Germany
| | - Kim Greis
- Freie
Universität Berlin, Institute of Chemistry
and Biochemistry, 14195 Berlin, Germany
- Fritz
Haber Institute of the Max Planck Society, 14195 Berlin, Germany
| | | | - Jerome Riedel
- Freie
Universität Berlin, Institute of Chemistry
and Biochemistry, 14195 Berlin, Germany
- Fritz
Haber Institute of the Max Planck Society, 14195 Berlin, Germany
| | - Sandy Gewinner
- Fritz
Haber Institute of the Max Planck Society, 14195 Berlin, Germany
| | | | - Gerard Meijer
- Fritz
Haber Institute of the Max Planck Society, 14195 Berlin, Germany
| | - Gert von Helden
- Fritz
Haber Institute of the Max Planck Society, 14195 Berlin, Germany
| | - Kevin Pagel
- Freie
Universität Berlin, Institute of Chemistry
and Biochemistry, 14195 Berlin, Germany
- Fritz
Haber Institute of the Max Planck Society, 14195 Berlin, Germany
| |
Collapse
|
19
|
Li X, Zhou Chen Y, Kalia A, Zhu H, Liu LP, Hassoun S. An Ensemble Spectral Prediction (ESP) model for metabolite annotation. Bioinformatics 2024; 40:btae490. [PMID: 39180771 PMCID: PMC11344591 DOI: 10.1093/bioinformatics/btae490] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Revised: 06/25/2024] [Indexed: 08/26/2024] Open
Abstract
MOTIVATION A key challenge in metabolomics is annotating measured spectra from a biological sample with chemical identities. Currently, only a small fraction of measurements can be assigned identities. Two complementary computational approaches have emerged to address the annotation problem: mapping candidate molecules to spectra, and mapping query spectra to molecular candidates. In essence, the candidate molecule with the spectrum that best explains the query spectrum is recommended as the target molecule. Despite candidate ranking being fundamental in both approaches, limited prior works incorporated rank learning tasks in determining the target molecule. RESULTS We propose a novel machine learning model, Ensemble Spectral Prediction (ESP), for metabolite annotation. ESP takes advantage of prior neural network-based annotation models that utilize multilayer perceptron (MLP) networks and Graph Neural Networks (GNNs). Based on the ranking results of the MLP- and GNN-based models, ESP learns a weighting for the outputs of MLP and GNN spectral predictors to generate a spectral prediction for a query molecule. Importantly, training data is stratified by molecular formula to provide candidate sets during model training. Further, baseline MLP and GNN models are enhanced by considering peak dependencies through label mixing and multi-tasking on spectral topic distributions. When trained on the NIST 2020 dataset and evaluated on the relevant candidate sets from PubChem, ESP improves average rank by 23.7% and 37.2% over the MLP and GNN baselines, respectively, demonstrating performance gain over state-of-the-art neural network approaches. However, MLP approaches remain strong contenders when considering top five ranks. Importantly, we show that annotation performance is dependent on the training dataset, the number of molecules in the candidate set and candidate similarity to the target molecule. AVAILABILITY AND IMPLEMENTATION The ESP code, a trained model, and a Jupyter notebook that guide users on using the ESP tool is available at https://github.com/HassounLab/ESP.
Collapse
Affiliation(s)
- Xinmeng Li
- Department of Computer Science, Tufts University, Medford, MA, 02155, United States
| | - Yan Zhou Chen
- Department of Computer Science, Tufts University, Medford, MA, 02155, United States
| | - Apurva Kalia
- Department of Computer Science, Tufts University, Medford, MA, 02155, United States
| | - Hao Zhu
- Department of Computer Science, Tufts University, Medford, MA, 02155, United States
| | - Li-ping Liu
- Department of Computer Science, Tufts University, Medford, MA, 02155, United States
| | - Soha Hassoun
- Department of Computer Science, Tufts University, Medford, MA, 02155, United States
- Department of Chemical and Biological Engineering, Tufts University, Medford, MA, 02155, United States
| |
Collapse
|
20
|
Isom M, Desaire H. Skin Surface Sebum Analysis by ESI-MS. Biomolecules 2024; 14:790. [PMID: 39062504 PMCID: PMC11274890 DOI: 10.3390/biom14070790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Revised: 06/26/2024] [Accepted: 06/28/2024] [Indexed: 07/28/2024] Open
Abstract
The skin surface is an important sample source that the metabolomics community has only just begun to explore. Alterations in sebum, the lipid-rich mixture coating the skin surface, correlate with age, sex, ethnicity, diet, exercise, and disease state, making the skin surface an ideal sample source for future noninvasive biomarker exploration, disease diagnosis, and forensic investigation. The potential of sebum sampling has been realized primarily via electrospray ionization mass spectrometry (ESI-MS), an ideal approach to assess the skin surface lipidome. However, a better understanding of sebum collection and subsequent ESI-MS analysis is required before skin surface sampling can be implemented in routine analyses. Challenges include ambiguity in definitive lipid identification, inherent biological variability in sebum production, and methodological, technical variability in analyses. To overcome these obstacles, avoid common pitfalls, and achieve reproducible, robust outcomes, every portion of the workflow-from sample collection to data analysis-should be carefully considered with the specific application in mind. This review details current practices in sebum sampling, sample preparation, ESI-MS data acquisition, and data analysis, and it provides important considerations in acquiring meaningful lipidomic datasets from the skin surface. Forensic researchers investigating sebum as a means for suspect elimination in lieu of adequate fingerprint ridge detail or database matches, as well as clinical researchers interested in noninvasive biomarker exploration, disease diagnosis, and treatment monitoring, can use this review as a guide for developing methods of best-practice.
Collapse
Affiliation(s)
| | - Heather Desaire
- Department of Chemistry, University of Kansas, Lawrence, KS 66045, USA;
| |
Collapse
|
21
|
Castro DC, Chan-Andersen P, Romanova EV, Sweedler JV. Probe-based mass spectrometry approaches for single-cell and single-organelle measurements. MASS SPECTROMETRY REVIEWS 2024; 43:888-912. [PMID: 37010120 PMCID: PMC10545815 DOI: 10.1002/mas.21841] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Revised: 02/09/2023] [Accepted: 03/01/2023] [Indexed: 06/19/2023]
Abstract
Exploring the chemical content of individual cells not only reveals underlying cell-to-cell chemical heterogeneity but is also a key component in understanding how cells combine to form emergent properties of cellular networks and tissues. Recent technological advances in many analytical techniques including mass spectrometry (MS) have improved instrumental limits of detection and laser/ion probe dimensions, allowing the analysis of micron and submicron sized areas. In the case of MS, these improvements combined with MS's broad analyte detection capabilities have enabled the rise of single-cell and single-organelle chemical characterization. As the chemical coverage and throughput of single-cell measurements increase, more advanced statistical and data analysis methods have aided in data visualization and interpretation. This review focuses on secondary ion MS and matrix-assisted laser desorption/ionization MS approaches for single-cell and single-organelle characterization, which is followed by advances in mass spectral data visualization and analysis.
Collapse
Affiliation(s)
- Daniel C. Castro
- Department of Molecular and Integrative Physiology, University of Illinois at Urbana-Champaign, Urbana, IL USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL USA
| | - Peter Chan-Andersen
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL USA
| | - Elena V. Romanova
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL USA
- Neuroscience Program, University of Illinois at Urbana-Champaign, Urbana, IL USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL USA
| | - Jonathan V. Sweedler
- Department of Molecular and Integrative Physiology, University of Illinois at Urbana-Champaign, Urbana, IL USA
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL USA
- Neuroscience Program, University of Illinois at Urbana-Champaign, Urbana, IL USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL USA
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL USA
| |
Collapse
|
22
|
Ahmad P, Moussa DG, Siqueira WL. Metabolomics for dental caries diagnosis: Past, present, and future. MASS SPECTROMETRY REVIEWS 2024. [PMID: 38940512 DOI: 10.1002/mas.21896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 05/22/2024] [Accepted: 06/15/2024] [Indexed: 06/29/2024]
Abstract
Dental caries, a prevalent global infectious condition affecting over 95% of adults, remains elusive in its precise etiology. Addressing the complex dynamics of caries demands a thorough exploration of taxonomic, potential, active, and encoded functions within the oral ecosystem. Metabolomic profiling emerges as a crucial tool, offering immediate insights into microecosystem physiology and linking directly to the phenotype. Identified metabolites, indicative of caries status, play a pivotal role in unraveling the metabolic processes underlying the disease. Despite challenges in metabolite variability, the use of metabolomics, particularly via mass spectrometry and nuclear magnetic resonance spectroscopy, holds promise in caries research. This review comprehensively examines metabolomics in caries prevention, diagnosis, and treatment, highlighting distinct metabolite expression patterns and their associations with disease-related bacterial communities. Pioneering in approach, it integrates singular and combinatory metabolomics methodologies, diverse biofluids, and study designs, critically evaluating prior limitations while offering expert insights for future investigations. By synthesizing existing knowledge, this review significantly advances our comprehension of caries, providing a foundation for improved prevention and treatment strategies.
Collapse
Affiliation(s)
- Paras Ahmad
- College of Dentistry, University of Saskatchewan, Saskatoon, SK, Canada
| | - Dina G Moussa
- College of Dentistry, University of Saskatchewan, Saskatoon, SK, Canada
| | - Walter L Siqueira
- College of Dentistry, University of Saskatchewan, Saskatoon, SK, Canada
| |
Collapse
|
23
|
Guo J, Zhao J, Han P, Wu Y, Zheng K, Huang C, Wang Y, Chen C, Guo Q. Finding the best predictive model for hypertensive depression in older adults based on machine learning and metabolomics research. Front Psychiatry 2024; 15:1370602. [PMID: 38993388 PMCID: PMC11236531 DOI: 10.3389/fpsyt.2024.1370602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 06/10/2024] [Indexed: 07/13/2024] Open
Abstract
Objective Depression is a common comorbidity in hypertensive older adults, yet depression is more difficult to diagnose correctly. Our goal is to find predictive models of depression in hypertensive patients using a combination of various machine learning (ML) methods and metabolomics. Methods Methods We recruited 379 elderly people aged ≥65 years from the Chinese community. Plasma samples were collected and assayed by gas chromatography/liquid chromatography-mass spectrometry (GC/LC-MS). Orthogonal partial least squares discriminant analysis (OPLS-DA), volcano diagrams and thermograms were used to distinguish metabolites. The attribute discriminators CfsSubsetEval combined with search method BestFirst in WEKA software was used to find the best predicted metabolite combinations, and then 24 classification methods with 10-fold cross-validation were used for prediction. Results 34 individuals were considered hypertensive combined with depression according to our criteria, and 34 subjects with hypertension only were matched according to age and sex. 19 metabolites by GC-MS and 65 metabolites by LC-MS contributed significantly to the differentiation between the depressed and non-depressed cohorts, with a VIP value of more than 1 and a P value of less than 0.05. There were multiple metabolic pathway alterations. The metabolite combinations screened with WEKA for optimal diagnostic value included 12 metabolites. The machine learning methods with AUC values greater than 0.9 were bayesNet and random forests, and their other evaluation measures are also better. Conclusion Altered metabolites and metabolic pathways are present in older adults with hypertension combined with depression. Methods using metabolomics and machine learning performed quite well in predicting depression in hypertensive older adults, contributing to further clinical research.
Collapse
Affiliation(s)
- Jiangling Guo
- Department of Rehabilitation Medicine, Shanghai University of Medicine and Health Sciences Affiliated Zhoupu Hospital, Shanghai, China
- Graduate School, Shanghai University of Traditional Chinese Medicine, Shanghai, China
| | - Jingwang Zhao
- Shanghai YangZhi Rehabilitation Hospital (Shanghai Sunshine Rehabilitation Center), School of Medicine, Tongji University, Shanghai, China
| | - Peipei Han
- Department of Rehabilitation Medicine, Shanghai University of Medicine and Health Sciences Affiliated Zhoupu Hospital, Shanghai, China
| | - Yahui Wu
- Department of Rehabilitation Medicine, Shanghai University of Medicine and Health Sciences Affiliated Zhoupu Hospital, Shanghai, China
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, China
| | - Kai Zheng
- Department of Rehabilitation Medicine, Shanghai University of Medicine and Health Sciences Affiliated Zhoupu Hospital, Shanghai, China
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, China
| | - Chuanjun Huang
- Department of Rehabilitation Medicine, Shanghai University of Medicine and Health Sciences Affiliated Zhoupu Hospital, Shanghai, China
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, China
| | - Yue Wang
- Department of Rehabilitation Medicine, Shanghai University of Medicine and Health Sciences Affiliated Zhoupu Hospital, Shanghai, China
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, China
| | - Cheng Chen
- Department of Rehabilitation Medicine, Shanghai University of Medicine and Health Sciences Affiliated Zhoupu Hospital, Shanghai, China
- School of Health, Fujian Medical University, Fuzhou, Fujian, China
| | - Qi Guo
- Department of Rehabilitation Medicine, Shanghai University of Medicine and Health Sciences Affiliated Zhoupu Hospital, Shanghai, China
| |
Collapse
|
24
|
Vlnieska V, Khanda A, Gilshtein E, Beltrán JL, Heier J, Kunka D. Polypy: A Framework to Interpret Polymer Properties from Mass Spectroscopy Data. Polymers (Basel) 2024; 16:1771. [PMID: 39000627 PMCID: PMC11244493 DOI: 10.3390/polym16131771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Revised: 06/13/2024] [Accepted: 06/20/2024] [Indexed: 07/17/2024] Open
Abstract
Mass spectroscopy (MS) is a robust technique for polymer characterization, and it can provide the chemical fingerprint of a complete sample regarding polymer distribution chains. Nevertheless, polymer chemical properties such as polydispersity (Pd), average molecular mass (Mn), weight average molecular mass (Mw) and others are not determined by MS, as they are commonly characterized by gel permeation chromatography (GPC). In order to calculate polymer properties from MS, a Python script was developed to interpret polymer properties from spectroscopic raw data. Polypy script can be considered a peak detection and area distribution method, and represents the result of combining the MS raw data filtered using Root Mean Square (RMS) calculation with molecular classification based on theoretical molar masses. Polypy filters out areas corresponding to repetitive units. This approach facilitates the identification of the polymer chains and calculates their properties. The script also integrates visualization graphic tools for data analysis. In this work, aryl resin (poly(2,2-bis(4-oxy-(2-(methyloxirane)phenyl)propan) was the study case polymer molecule, and is composed of oligomer chains distributed mainly in the range of dimers to tetramers, in some cases presenting traces of pentamers and hexamers in the distribution profile of the oligomeric chains. Epoxy resin has Mn = 607 Da, Mw = 631 Da, and polydispersity (Pd) of 1.015 (data given by GPC). With Polypy script, calculations resulted in Mn = 584.42 Da, Mw = 649.29 Da, and Pd = 1.11, which are consistent results if compared with GPC characterization. Additional information, such as the percentage of oligomer distribution, was also calculated and for this polymer matrix it was not possible to retrieve it from the GPC method. Polypy is an approach to characterizing major polymer chemical properties using only MS raw spectra, and it can be utilized with any MS raw data for any polymer matrix.
Collapse
Affiliation(s)
- Vitor Vlnieska
- Laboratory for Functional Polymers, Swiss Federal Laboratories for Materials Science and Technology (EMPA), Überlandstrasse 129, 8600 Dübendorf, Switzerland
- Laboratory for Thin Films and Photovoltaics, Swiss Federal Laboratories for Materials Science and Technology (EMPA), Überlandstrasse 129, 8600 Dübendorf, Switzerland
| | - Ankita Khanda
- Integrated Quantum Optics, Institute for Photonic Quantum Systems (PhoQS), Paderborn University, Warburger Str. 100, 33098 Paderborn, Germany
| | - Evgeniia Gilshtein
- Department of Electrical and Photonics Engineering, Technical University of Denmark, Anker Engelunds Vej 101, 2800 Kongens Lyngby, Denmark
| | - Jorge Luis Beltrán
- Institute of Microstructure Technology, Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany
| | - Jakob Heier
- Laboratory for Thin Films and Photovoltaics, Swiss Federal Laboratories for Materials Science and Technology (EMPA), Überlandstrasse 129, 8600 Dübendorf, Switzerland
| | - Danays Kunka
- Institute of Microstructure Technology, Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany
| |
Collapse
|
25
|
Beck A, Muhoberac M, Randolph CE, Beveridge CH, Wijewardhane PR, Kenttämaa HI, Chopra G. Recent Developments in Machine Learning for Mass Spectrometry. ACS MEASUREMENT SCIENCE AU 2024; 4:233-246. [PMID: 38910862 PMCID: PMC11191731 DOI: 10.1021/acsmeasuresciau.3c00060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 12/27/2023] [Accepted: 01/22/2024] [Indexed: 06/25/2024]
Abstract
Statistical analysis and modeling of mass spectrometry (MS) data have a long and rich history with several modern MS-based applications using statistical and chemometric methods. Recently, machine learning (ML) has experienced a renaissance due to advents in computational hardware and the development of new algorithms for artificial neural networks (ANN) and deep learning architectures. Moreover, recent successes of new ANN and deep learning architectures in several areas of science, engineering, and society have further strengthened the ML field. Importantly, modern ML methods and architectures have enabled new approaches for tasks related to MS that are now widely adopted in several popular MS-based subdisciplines, such as mass spectrometry imaging and proteomics. Herein, we aim to provide an introductory summary of the practical aspects of ML methodology relevant to MS. Additionally, we seek to provide an up-to-date review of the most recent developments in ML integration with MS-based techniques while also providing critical insights into the future direction of the field.
Collapse
Affiliation(s)
- Armen
G. Beck
- Department
of Chemistry, Purdue University, 560 Oval Drive, West Lafayette, Indiana 47907, United States
| | - Matthew Muhoberac
- Department
of Chemistry, Purdue University, 560 Oval Drive, West Lafayette, Indiana 47907, United States
| | - Caitlin E. Randolph
- Department
of Chemistry, Purdue University, 560 Oval Drive, West Lafayette, Indiana 47907, United States
| | - Connor H. Beveridge
- Department
of Chemistry, Purdue University, 560 Oval Drive, West Lafayette, Indiana 47907, United States
| | - Prageeth R. Wijewardhane
- Department
of Chemistry, Purdue University, 560 Oval Drive, West Lafayette, Indiana 47907, United States
| | - Hilkka I. Kenttämaa
- Department
of Chemistry, Purdue University, 560 Oval Drive, West Lafayette, Indiana 47907, United States
| | - Gaurav Chopra
- Department
of Chemistry, Purdue University, 560 Oval Drive, West Lafayette, Indiana 47907, United States
- Department
of Computer Science (by courtesy), Purdue University, West Lafayette, Indiana 47907, United States
- Purdue
Institute for Drug Discovery, Purdue Institute for Cancer Research,
Regenstrief Center for Healthcare Engineering, Purdue Institute for
Inflammation, Immunology and Infectious Disease, Purdue Institute for Integrative Neuroscience, West Lafayette, Indiana 47907 United States
| |
Collapse
|
26
|
Bifarin O, Fernández FM. Automated Machine Learning and Explainable AI (AutoML-XAI) for Metabolomics: Improving Cancer Diagnostics. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024; 35:1089-1100. [PMID: 38690775 PMCID: PMC11157651 DOI: 10.1021/jasms.3c00403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 02/08/2024] [Accepted: 04/23/2024] [Indexed: 05/03/2024]
Abstract
Metabolomics generates complex data necessitating advanced computational methods for generating biological insight. While machine learning (ML) is promising, the challenges of selecting the best algorithms and tuning hyperparameters, particularly for nonexperts, remain. Automated machine learning (AutoML) can streamline this process; however, the issue of interpretability could persist. This research introduces a unified pipeline that combines AutoML with explainable AI (XAI) techniques to optimize metabolomics analysis. We tested our approach on two data sets: renal cell carcinoma (RCC) urine metabolomics and ovarian cancer (OC) serum metabolomics. AutoML, using Auto-sklearn, surpassed standalone ML algorithms like SVM and k-Nearest Neighbors in differentiating between RCC and healthy controls, as well as OC patients and those with other gynecological cancers. The effectiveness of Auto-sklearn is highlighted by its AUC scores of 0.97 for RCC and 0.85 for OC, obtained from the unseen test sets. Importantly, on most of the metrics considered, Auto-sklearn demonstrated a better classification performance, leveraging a mix of algorithms and ensemble techniques. Shapley Additive Explanations (SHAP) provided a global ranking of feature importance, identifying dibutylamine and ganglioside GM(d34:1) as the top discriminative metabolites for RCC and OC, respectively. Waterfall plots offered local explanations by illustrating the influence of each metabolite on individual predictions. Dependence plots spotlighted metabolite interactions, such as the connection between hippuric acid and one of its derivatives in RCC, and between GM3(d34:1) and GM3(18:1_16:0) in OC, hinting at potential mechanistic relationships. Through decision plots, a detailed error analysis was conducted, contrasting feature importance for correctly versus incorrectly classified samples. In essence, our pipeline emphasizes the importance of harmonizing AutoML and XAI, facilitating both simplified ML application and improved interpretability in metabolomics data science.
Collapse
Affiliation(s)
- Olatomiwa
O. Bifarin
- School
of Chemistry and Biochemistry, Georgia Institute
of Technology, Atlanta, Georgia 30332, United States
| | - Facundo M. Fernández
- School
of Chemistry and Biochemistry, Georgia Institute
of Technology, Atlanta, Georgia 30332, United States
- Petit
Institute of Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| |
Collapse
|
27
|
Kuhnen G, Class LC, Badekow S, Hanisch KL, Rohn S, Kuballa J. Python workflow for the selection and identification of marker peptides-proof-of-principle study with heated milk. Anal Bioanal Chem 2024; 416:3349-3360. [PMID: 38607384 PMCID: PMC11106092 DOI: 10.1007/s00216-024-05286-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 03/26/2024] [Accepted: 04/02/2024] [Indexed: 04/13/2024]
Abstract
The analysis of almost holistic food profiles has developed considerably over the last years. This has also led to larger amounts of data and the ability to obtain more information about health-beneficial and adverse constituents in food than ever before. Especially in the field of proteomics, software is used for evaluation, and these do not provide specific approaches for unique monitoring questions. An additional and more comprehensive way of evaluation can be done with the programming language Python. It offers broad possibilities by a large ecosystem for mass spectrometric data analysis, but needs to be tailored for specific sets of features, the research questions behind. It also offers the applicability of various machine-learning approaches. The aim of the present study was to develop an algorithm for selecting and identifying potential marker peptides from mass spectrometric data. The workflow is divided into three steps: (I) feature engineering, (II) chemometric data analysis, and (III) feature identification. The first step is the transformation of the mass spectrometric data into a structure, which enables the application of existing data analysis packages in Python. The second step is the data analysis for selecting single features. These features are further processed in the third step, which is the feature identification. The data used exemplarily in this proof-of-principle approach was from a study on the influence of a heat treatment on the milk proteome/peptidome.
Collapse
Affiliation(s)
- Gesine Kuhnen
- GALAB Laboratories GmbH, Am Schleusengraben 7, 21029, Hamburg, Germany
- Department of Food Chemistry and Analysis, Institute of Food Technology and Food Chemistry, Technical University Berlin, Gustav Meyer Allee 25, 13355, Berlin, Germany
| | - Lisa-Carina Class
- GALAB Laboratories GmbH, Am Schleusengraben 7, 21029, Hamburg, Germany
- Hamburg School of Food Science, Institute of Food Chemistry, University of Hamburg, Grindelallee 117, 20146, Hamburg, Germany
| | - Svenja Badekow
- GALAB Laboratories GmbH, Am Schleusengraben 7, 21029, Hamburg, Germany
| | - Kim Lara Hanisch
- GALAB Laboratories GmbH, Am Schleusengraben 7, 21029, Hamburg, Germany
| | - Sascha Rohn
- Department of Food Chemistry and Analysis, Institute of Food Technology and Food Chemistry, Technical University Berlin, Gustav Meyer Allee 25, 13355, Berlin, Germany
| | - Jürgen Kuballa
- GALAB Laboratories GmbH, Am Schleusengraben 7, 21029, Hamburg, Germany.
| |
Collapse
|
28
|
Sirocchi C, Biancucci F, Donati M, Bogliolo A, Magnani M, Menotta M, Montagna S. Exploring machine learning for untargeted metabolomics using molecular fingerprints. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 250:108163. [PMID: 38626559 DOI: 10.1016/j.cmpb.2024.108163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 03/15/2024] [Accepted: 04/03/2024] [Indexed: 04/18/2024]
Abstract
BACKGROUND Metabolomics, the study of substrates and products of cellular metabolism, offers valuable insights into an organism's state under specific conditions and has the potential to revolutionise preventive healthcare and pharmaceutical research. However, analysing large metabolomics datasets remains challenging, with available methods relying on limited and incompletely annotated metabolic pathways. METHODS This study, inspired by well-established methods in drug discovery, employs machine learning on metabolite fingerprints to explore the relationship of their structure with responses in experimental conditions beyond known pathways, shedding light on metabolic processes. It evaluates fingerprinting effectiveness in representing metabolites, addressing challenges like class imbalance, data sparsity, high dimensionality, duplicate structural encoding, and interpretable features. Feature importance analysis is then applied to reveal key chemical configurations affecting classification, identifying related metabolite groups. RESULTS The approach is tested on two datasets: one on Ataxia Telangiectasia and another on endothelial cells under low oxygen. Machine learning on molecular fingerprints predicts metabolite responses effectively, and feature importance analysis aligns with known metabolic pathways, unveiling new affected metabolite groups for further study. CONCLUSION In conclusion, the presented approach leverages the strengths of drug discovery to address critical issues in metabolomics research and aims to bridge the gap between these two disciplines. This work lays the foundation for future research in this direction, possibly exploring alternative structural encodings and machine learning models.
Collapse
Affiliation(s)
- Christel Sirocchi
- Department of Pure and Applied Sciences, University of Urbino, Piazza della Repubblica, 13, Urbino, 61029, Italy.
| | - Federica Biancucci
- Department of Biomolecular Sciences, University of Urbino, Via Saffi 2, Urbino, 61029, Italy
| | - Matteo Donati
- Department of Pure and Applied Sciences, University of Urbino, Piazza della Repubblica, 13, Urbino, 61029, Italy
| | - Alessandro Bogliolo
- Department of Pure and Applied Sciences, University of Urbino, Piazza della Repubblica, 13, Urbino, 61029, Italy
| | - Mauro Magnani
- Department of Biomolecular Sciences, University of Urbino, Via Saffi 2, Urbino, 61029, Italy
| | - Michele Menotta
- Department of Biomolecular Sciences, University of Urbino, Via Saffi 2, Urbino, 61029, Italy
| | - Sara Montagna
- Department of Pure and Applied Sciences, University of Urbino, Piazza della Repubblica, 13, Urbino, 61029, Italy
| |
Collapse
|
29
|
Al Ghadban Y, Du Y, Charnock-Jones DS, Garmire LX, Smith GCS, Sovio U. Prediction of spontaneous preterm birth using supervised machine learning on metabolomic data: A case-cohort study. BJOG 2024; 131:908-916. [PMID: 37984426 DOI: 10.1111/1471-0528.17723] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 09/11/2023] [Accepted: 11/02/2023] [Indexed: 11/22/2023]
Abstract
OBJECTIVES To identify and internally validate metabolites predictive of spontaneous preterm birth (sPTB) using multiple machine learning methods and sequential maternal serum samples, and to predict spontaneous early term birth (sETB) using these metabolites. DESIGN Case-cohort design within a prospective cohort study. SETTING Cambridge, UK. POPULATION OR SAMPLE A total of 399 Pregnancy Outcome Prediction study participants, including 98 cases of sPTB. METHODS An untargeted metabolomic analysis of maternal serum samples at 12, 20, 28 and 36 weeks of gestation was performed. We applied six supervised machine learning methods and a weighted Cox model to measurements at 28 weeks of gestation and sPTB, followed by feature selection. We used logistic regression with elastic net penalty, followed by best subset selection, to reduce the number of predictive metabolites further. We applied coefficients from the chosen models to measurements from different gestational ages to predict sPTB and sETB. MAIN OUTCOME MEASURES sPTB and sETB. RESULTS We identified 47 metabolites, mostly lipids, as important predictors of sPTB by two or more methods and 22 were identified by three or more methods. The best 4-predictor model had an optimism-corrected area under the receiver operating characteristics curve (AUC) of 0.703 at 28 weeks of gestation. The model also predicted sPTB in 12-week samples (0.606, 95% CI 0.544-0.667) and 20-week samples (0.657, 95% CI 0.597-0.717) and it predicted sETB in 36-week samples (0.727, 95% CI 0.606-0.849). A lysolipid, 1-palmitoleoyl-GPE (16:1)*, was the strongest predictor of sPTB at 12 weeks of gestation (0.609, 95% CI 0.548-0.670), 20 weeks (0.630, 95% CI 0.569-0.690) and 28 weeks (0.660, 95% CI 0.599-0.722), and of sETB at 36 weeks (0.739, 95% CI 0.618-0.860). CONCLUSIONS We identified and internally validated maternal serum metabolites predictive of sPTB. A lysolipid, 1-palmitoleoyl-GPE (16:1)*, is a novel predictor of sPTB and sETB. Further validation in external populations is required.
Collapse
Affiliation(s)
- Yasmina Al Ghadban
- Nuffield Department of Women's and Reproductive Health, University of Oxford, Oxford, UK
| | - Yuheng Du
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA
| | - D Stephen Charnock-Jones
- Department of Obstetrics and Gynaecology, University of Cambridge, Cambridge, UK
- NIHR Cambridge Biomedical Research Centre, Cambridge, UK
- Centre for Trophoblast Research (CTR), Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, UK
| | - Lana X Garmire
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA
| | - Gordon C S Smith
- Department of Obstetrics and Gynaecology, University of Cambridge, Cambridge, UK
- NIHR Cambridge Biomedical Research Centre, Cambridge, UK
- Centre for Trophoblast Research (CTR), Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, UK
| | - Ulla Sovio
- Department of Obstetrics and Gynaecology, University of Cambridge, Cambridge, UK
- NIHR Cambridge Biomedical Research Centre, Cambridge, UK
- Centre for Trophoblast Research (CTR), Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, UK
| |
Collapse
|
30
|
Keating MF, Wolfe CA, Liebenberg K, Montgomery A, Porcari AM, Fleming ND, Makarov A, Eberlin LS. Data Acquisition and Intraoperative Tissue Analysis on a Mobile, Battery-Operated, Orbitrap Mass Spectrometer. Anal Chem 2024; 96:8234-8242. [PMID: 38739527 DOI: 10.1021/acs.analchem.4c00722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Mass spectrometry has been increasingly explored in intraoperative studies as a potential technology to help guide surgical decision making. Yet, intraoperative experiments using high-performance mass spectrometry instrumentation present a unique set of operational challenges. For example, standard operating rooms are often not equipped with the electrical requirements to power a commercial mass spectrometer and are not designed to accommodate their permanent installation. These obstacles can impact progress and patient enrollment in intraoperative clinical studies because implementation of MS instrumentation becomes limited to specific operating rooms that have the required electrical connections and space. To expand our intraoperative clinical studies using the MasSpec Pen technology, we explored the feasibility of transporting and acquiring data on Orbitrap mass spectrometers operating on battery power in hospital buildings. We evaluated the effect of instrument movement including acceleration and rotational speeds on signal stability and mass accuracy by acquiring data using direct infusion electrospray ionization. Data were acquired while rolling the systems in/out of operating rooms and while descending/ascending a freight elevator. Despite these movements and operating the instrument on battery power, the relative standard deviation of the total ion current was <5% and the magnitude of the mass error relative to the internal calibrant never exceeded 5.06 ppm. We further evaluated the feasibility of performing intraoperative MasSpec Pen analysis while operating the Orbitrap mass spectrometer on battery power during an ovarian cancer surgery. We observed that the rich and tissue-specific molecular profile commonly detected from ovarian tissues was conserved when running on battery power. Together, these results demonstrate that Orbitrap mass spectrometers can be operated and acquire data on battery power while in motion and in rotation without losses in signal stability or mass accuracy. Furthermore, Orbitrap mass spectrometers can be used in conjunction to the MasSpec Pen while on battery power for intraoperative tissue analysis.
Collapse
Affiliation(s)
- Michael F Keating
- Department of Chemistry, University of Texas at Austin, Austin, Texas 78712, United States
- Department of Surgery, Baylor College of Medicine, Houston, Texas 77030, United States
| | - Charles A Wolfe
- Department of Surgery, Baylor College of Medicine, Houston, Texas 77030, United States
| | - Keziah Liebenberg
- Department of Surgery, Baylor College of Medicine, Houston, Texas 77030, United States
| | - Ashley Montgomery
- Department of Surgery, Baylor College of Medicine, Houston, Texas 77030, United States
| | - Andreia M Porcari
- Department of Surgery, Baylor College of Medicine, Houston, Texas 77030, United States
- MS4Life Laboratory of Mass Spectrometry, Health Sciences Postgraduate Program, Universidade São Francisco, Bragança Paulista, SP 12916-900, Brazil
| | - Nicole D Fleming
- Department of Surgery, MD Anderson Cancer Center, Houston, Texas 77030, United States
| | | | - Livia S Eberlin
- Department of Surgery, Baylor College of Medicine, Houston, Texas 77030, United States
| |
Collapse
|
31
|
Stamm J, Kwon S, Sandhu S, Sandhu J, Levine BG, Dantus M. Coherence mapping to identify the intermediates of multi-channel dissociative ionization. Commun Chem 2024; 7:103. [PMID: 38724724 PMCID: PMC11549452 DOI: 10.1038/s42004-024-01176-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 04/11/2024] [Indexed: 11/10/2024] Open
Abstract
Identifying the short-lived intermediates and reaction mechanisms of multi-channel radical cation fragmentation processes remains a current and important challenge to understanding and predicting mass spectra. We find that coherent oscillations in the femtosecond time-dependent yields of several product ions following ultrafast strong-field ionization represent spectroscopic signatures that elucidate their mechanism of formation and identify the intermediate(s) they originate from. Experiments on endo-dicyclopentadiene show that vibrational frequencies from various intermediates are mapped onto their resulting products. Aided by ab initio methods, we identify the vibrational modes of both the cleaved and intact molecular ion intermediates. These results confirm stepwise and concerted fragmentation pathways of the dicyclopentadiene ion. This study highlights the power of tracking the femtosecond dynamics of all product ions simultaneously and sheds further light onto one of the fundamental reaction mechanisms in mass spectrometry, the retro-Diels Alder reaction.
Collapse
Affiliation(s)
- Jacob Stamm
- Department of Chemistry, Michigan State University, S Shaw Ln, East Lansing, MI, 48824, USA
| | - Sung Kwon
- Department of Chemistry, Michigan State University, S Shaw Ln, East Lansing, MI, 48824, USA
| | - Shawn Sandhu
- Department of Chemistry, Michigan State University, S Shaw Ln, East Lansing, MI, 48824, USA
| | - Jesse Sandhu
- Department of Chemistry, Michigan State University, S Shaw Ln, East Lansing, MI, 48824, USA
| | - Benjamin G Levine
- Department of Chemistry, Stony Brook University, John S. Toll Drive, Stony Brook, NY, 11794, USA
- Institute for Advanced Computational Science, Stony Brook University, IACS Building, Stony Brook, NY, 11794, USA
| | - Marcos Dantus
- Department of Chemistry, Michigan State University, S Shaw Ln, East Lansing, MI, 48824, USA.
- Department of Physics and Astronomy, Michigan State University, Wilson Rd, East Lansing, MI, 48824, USA.
| |
Collapse
|
32
|
Zhang Q, Ren T, Cao K, Xu Z. Advances of machine learning-assisted small extracellular vesicles detection strategy. Biosens Bioelectron 2024; 251:116076. [PMID: 38340580 DOI: 10.1016/j.bios.2024.116076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 01/22/2024] [Accepted: 01/23/2024] [Indexed: 02/12/2024]
Abstract
Detection of extracellular vesicles (EVs), particularly small EVs (sEVs), is of great significance in exploring their physiological characteristics and clinical applications. The heterogeneity of sEVs plays a crucial role in distinguishing different types of cells and diseases. Machine learning, with its exceptional data processing capabilities, offers a solution to overcome the limitations of conventional detection methods for accurately classifying sEV subtypes and sources. Principal component analysis, linear discriminant analysis, partial least squares discriminant analysis, XGBoost, support vector machine, k-nearest neighbor, and deep learning, along with some combined methods such as principal component-linear discriminant analysis, have been successfully applied in the detection and identification of sEVs. This review focuses on machine learning-assisted detection strategies for cell identification and disease prediction via sEVs, and summarizes the integration of these strategies with surface-enhanced Raman scattering, electrochemistry, inductively coupled plasma mass spectrometry and fluorescence. The performance of different machine learning-based detection strategies is compared, and the advantages and limitations of various machine learning models are also evaluated. Finally, we discuss the merits and limitations of the current approaches and briefly outline the perspective of potential research directions in the field of sEV analysis based on machine learning.
Collapse
Affiliation(s)
- Qi Zhang
- Research Center for Analytical Sciences, Northeastern University, Shenyang, 110819, PR China
| | - Tingju Ren
- Research Center for Analytical Sciences, Northeastern University, Shenyang, 110819, PR China
| | - Ke Cao
- Research Center for Analytical Sciences, Northeastern University, Shenyang, 110819, PR China
| | - Zhangrun Xu
- Research Center for Analytical Sciences, Northeastern University, Shenyang, 110819, PR China.
| |
Collapse
|
33
|
Lan Y, Zou Z, Yang Z. Single Cell mass spectrometry: Towards quantification of small molecules in individual cells. Trends Analyt Chem 2024; 174:117657. [PMID: 39391010 PMCID: PMC11465888 DOI: 10.1016/j.trac.2024.117657] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/12/2024]
Abstract
Studying cell heterogeneity can provide a deeper understanding of biological activities, but appropriate studies cannot be performed using traditional bulk analysis methods. The development of diverse single cell bioanalysis methods is in urgent need and of great significance. Mass spectrometry (MS) has been recognized as a powerful technique for bioanalysis for its high sensitivity, wide applicability, label-free detection, and capability for quantitative analysis. In this review, the general development of single cell mass spectrometry (SCMS) field is covered. First, multiple existing SCMS techniques are described and compared. Next, the development of SCMS field is discussed in a chronological order. Last, the latest quantification studies on small molecules using SCMS have been described in detail.
Collapse
Affiliation(s)
| | | | - Zhibo Yang
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, OK, 73019, USA
| |
Collapse
|
34
|
Camelo ALM, Zamora Obando HR, Rocha I, Dias AC, Mesquita ADS, Simionato AVC. COVID-19 and Comorbidities: What Has Been Unveiled by Metabolomics? Metabolites 2024; 14:195. [PMID: 38668323 PMCID: PMC11051775 DOI: 10.3390/metabo14040195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 03/14/2024] [Accepted: 03/26/2024] [Indexed: 04/28/2024] Open
Abstract
The COVID-19 pandemic has brought about diverse impacts on the global population. Individuals with comorbidities were more susceptible to the severe symptoms caused by the virus. Within the crisis scenario, metabolomics represents a potential area of science capable of providing relevant information for understanding the metabolic pathways associated with the intricate interaction between the viral disease and previous comorbidities. This work aims to provide a comprehensive description of the scientific production pertaining to metabolomics within the specific context of COVID-19 and comorbidities, while highlighting promising areas for exploration by those interested in the subject. In this review, we highlighted the studies of metabolomics that indicated a variety of metabolites associated with comorbidities and COVID-19. Furthermore, we observed that the understanding of the metabolic processes involved between comorbidities and COVID-19 is limited due to the urgent need to report disease outcomes in individuals with comorbidities. The overlap of two or more comorbidities associated with the severity of COVID-19 hinders the comprehension of the significance of each condition. Most identified studies are observational, with a restricted number of patients, due to challenges in sample collection amidst the emergent situation.
Collapse
Affiliation(s)
- André Luiz Melo Camelo
- Laboratory of Analysis of Biomolecules Tiselius, Department of Analytical Chemistry, Institute of Chemistry, Universidade Estadual de Campinas (UNICAMP), Campinas 13083-970, São Paulo, Brazil; (A.L.M.C.); (H.R.Z.O.); (I.R.); (A.C.D.); (A.d.S.M.)
| | - Hans Rolando Zamora Obando
- Laboratory of Analysis of Biomolecules Tiselius, Department of Analytical Chemistry, Institute of Chemistry, Universidade Estadual de Campinas (UNICAMP), Campinas 13083-970, São Paulo, Brazil; (A.L.M.C.); (H.R.Z.O.); (I.R.); (A.C.D.); (A.d.S.M.)
| | - Isabela Rocha
- Laboratory of Analysis of Biomolecules Tiselius, Department of Analytical Chemistry, Institute of Chemistry, Universidade Estadual de Campinas (UNICAMP), Campinas 13083-970, São Paulo, Brazil; (A.L.M.C.); (H.R.Z.O.); (I.R.); (A.C.D.); (A.d.S.M.)
| | - Aline Cristina Dias
- Laboratory of Analysis of Biomolecules Tiselius, Department of Analytical Chemistry, Institute of Chemistry, Universidade Estadual de Campinas (UNICAMP), Campinas 13083-970, São Paulo, Brazil; (A.L.M.C.); (H.R.Z.O.); (I.R.); (A.C.D.); (A.d.S.M.)
| | - Alessandra de Sousa Mesquita
- Laboratory of Analysis of Biomolecules Tiselius, Department of Analytical Chemistry, Institute of Chemistry, Universidade Estadual de Campinas (UNICAMP), Campinas 13083-970, São Paulo, Brazil; (A.L.M.C.); (H.R.Z.O.); (I.R.); (A.C.D.); (A.d.S.M.)
| | - Ana Valéria Colnaghi Simionato
- Laboratory of Analysis of Biomolecules Tiselius, Department of Analytical Chemistry, Institute of Chemistry, Universidade Estadual de Campinas (UNICAMP), Campinas 13083-970, São Paulo, Brazil; (A.L.M.C.); (H.R.Z.O.); (I.R.); (A.C.D.); (A.d.S.M.)
- National Institute of Science and Technology for Bioanalytics—INCTBio, Institute of Chemistry, Universidade Estadual de (UNICAMP), Campinas 13083-970, São Paulo, Brazil
| |
Collapse
|
35
|
Fiorante A, Ye LA, Tata A, Kiyota T, Woolman M, Talbot F, Farahmand Y, Vlaminck D, Katz L, Massaro A, Ginsberg H, Aman A, Zarrine-Afsar A. A Workflow for Meaningful Interpretation of Classification Results from Handheld Ambient Mass Spectrometry Analysis Probes. Int J Mol Sci 2024; 25:3491. [PMID: 38542461 PMCID: PMC10970785 DOI: 10.3390/ijms25063491] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Revised: 03/11/2024] [Accepted: 03/14/2024] [Indexed: 11/11/2024] Open
Abstract
While untargeted analysis of biological tissues with ambient mass spectrometry analysis probes has been widely reported in the literature, there are currently no guidelines to standardize the workflows for the experimental design, creation, and validation of molecular models that are utilized in these methods to perform class predictions. By drawing parallels with hurdles that are faced in the field of food fraud detection with untargeted mass spectrometry, we provide a stepwise workflow for the creation, refinement, evaluation, and assessment of the robustness of molecular models, aimed at meaningful interpretation of mass spectrometry-based tissue classification results. We propose strategies to obtain a sufficient number of samples for the creation of molecular models and discuss the potential overfitting of data, emphasizing both the need for model validation using an independent cohort of test samples, as well as the use of a fully characterized feature-based approach that verifies the biological relevance of the features that are used to avoid false discoveries. We additionally highlight the need to treat molecular models as "dynamic" and "living" entities and to further refine them as new knowledge concerning disease pathways and classifier feature noise becomes apparent in large(r) population studies. Where appropriate, we have provided a discussion of the challenges that we faced in our development of a 10 s cancer classification method using picosecond infrared laser mass spectrometry (PIRL-MS) to facilitate clinical decision-making at the bedside.
Collapse
Affiliation(s)
- Alexa Fiorante
- Princess Margaret Cancer Centre, University Health Network, 101 College Street, Toronto, ON M5G 1L7, Canada; (A.F.); (L.A.Y.); (M.W.); (F.T.); (Y.F.); (D.V.); (L.K.)
- Department of Medical Biophysics, University of Toronto, 101 College Street, Toronto, ON M5G 1L7, Canada
| | - Lan Anna Ye
- Princess Margaret Cancer Centre, University Health Network, 101 College Street, Toronto, ON M5G 1L7, Canada; (A.F.); (L.A.Y.); (M.W.); (F.T.); (Y.F.); (D.V.); (L.K.)
| | - Alessandra Tata
- Istituto Zooprofilattico Sperimentale Delle Venezie, Viale Fiume, 78, 36100 Vicenza, Italy; (A.T.); (A.M.)
| | - Taira Kiyota
- Ontario Institute for Cancer Research (OICR), 661 University Ave Suite 510, Toronto, ON M5G 0A3, Canada; (T.K.); (A.A.)
| | - Michael Woolman
- Princess Margaret Cancer Centre, University Health Network, 101 College Street, Toronto, ON M5G 1L7, Canada; (A.F.); (L.A.Y.); (M.W.); (F.T.); (Y.F.); (D.V.); (L.K.)
- Department of Medical Biophysics, University of Toronto, 101 College Street, Toronto, ON M5G 1L7, Canada
| | - Francis Talbot
- Princess Margaret Cancer Centre, University Health Network, 101 College Street, Toronto, ON M5G 1L7, Canada; (A.F.); (L.A.Y.); (M.W.); (F.T.); (Y.F.); (D.V.); (L.K.)
| | - Yasamine Farahmand
- Princess Margaret Cancer Centre, University Health Network, 101 College Street, Toronto, ON M5G 1L7, Canada; (A.F.); (L.A.Y.); (M.W.); (F.T.); (Y.F.); (D.V.); (L.K.)
| | - Darah Vlaminck
- Princess Margaret Cancer Centre, University Health Network, 101 College Street, Toronto, ON M5G 1L7, Canada; (A.F.); (L.A.Y.); (M.W.); (F.T.); (Y.F.); (D.V.); (L.K.)
- Department of Medical Biophysics, University of Toronto, 101 College Street, Toronto, ON M5G 1L7, Canada
| | - Lauren Katz
- Princess Margaret Cancer Centre, University Health Network, 101 College Street, Toronto, ON M5G 1L7, Canada; (A.F.); (L.A.Y.); (M.W.); (F.T.); (Y.F.); (D.V.); (L.K.)
- Department of Medical Biophysics, University of Toronto, 101 College Street, Toronto, ON M5G 1L7, Canada
| | - Andrea Massaro
- Istituto Zooprofilattico Sperimentale Delle Venezie, Viale Fiume, 78, 36100 Vicenza, Italy; (A.T.); (A.M.)
| | - Howard Ginsberg
- Department of Surgery, University of Toronto, 149 College Street, Toronto, ON M5T 1P5, Canada;
- Keenan Research Center for Biomedical Science & the Li Ka Shing Knowledge Institute, St. Michael’s Hospital, 30 Bond Street, Toronto, ON M5B 1W8, Canada
- Department of Laboratory Medicine and Pathobiology, University of Toronto, 1 King’s College Circle, Toronto, ON M5S 1A8, Canada
| | - Ahmed Aman
- Ontario Institute for Cancer Research (OICR), 661 University Ave Suite 510, Toronto, ON M5G 0A3, Canada; (T.K.); (A.A.)
- Leslie Dan Faculty of Pharmacy, University of Toronto, 144 College St, Toronto, ON M5S 3M2, Canada
| | - Arash Zarrine-Afsar
- Princess Margaret Cancer Centre, University Health Network, 101 College Street, Toronto, ON M5G 1L7, Canada; (A.F.); (L.A.Y.); (M.W.); (F.T.); (Y.F.); (D.V.); (L.K.)
- Department of Medical Biophysics, University of Toronto, 101 College Street, Toronto, ON M5G 1L7, Canada
- Department of Surgery, University of Toronto, 149 College Street, Toronto, ON M5T 1P5, Canada;
- Keenan Research Center for Biomedical Science & the Li Ka Shing Knowledge Institute, St. Michael’s Hospital, 30 Bond Street, Toronto, ON M5B 1W8, Canada
| |
Collapse
|
36
|
Wang B, Li T, Xu L, Cai Y. Protective effect of FKBP12 on dextran sulfate sodium-induced ulcerative colitis in mice as a tacrolimus receptor. NUCLEOSIDES, NUCLEOTIDES & NUCLEIC ACIDS 2024:1-16. [PMID: 38466901 DOI: 10.1080/15257770.2024.2320817] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Accepted: 02/14/2024] [Indexed: 03/13/2024]
Abstract
Ulcerative colitis (UC) is a multifactorial intestinal disease with a high incidence. In recent years, there has been an urgent need for pleiotropic drugs with a clear biosafety profile. Tacrolimus (TAC) is an immunosuppressant with stronger in vivo effects and better gastrointestinal absorption and is considered a potential treatment for UC. FKBP12 is a mediator of TAC immunosuppression; however, it is unclear whether it can participate in the development of UC in combination with TAC. The purpose of this study is to preliminarily validate the function of FKBP12 by establishing dextran sulfate sodium (DSS)-induced UC model and TAC treatment. The results revealed that TAC was effective in alleviating DSS-induced UC symptoms such as body weight and disease activity index (DAI). TAC significantly protects colonic tissue and attenuates DSS-induced histomorphological changes. In addition, FKBP12 is down-regulated in the intestinal tissue of DSS-induced UC mice and in serum samples of UC patients. In conclusion, our study revealed that FKBP12 may act as a TAC receptor to have anti-inflammatory and protective effects on DSS-induced UC in mice, which will provide a new option for the treatment of UC.
Collapse
Affiliation(s)
- Birong Wang
- Department of Gastroenterology, BenQ Medical Center, The Affiliated BenQ Hospital of Nanjing Medical University, Nanjing, Jiangsu, China
| | - Tingzan Li
- Department of Gastroenterology, BenQ Medical Center, The Affiliated BenQ Hospital of Nanjing Medical University, Nanjing, Jiangsu, China
| | - Liqin Xu
- Department of Respiratory and Critical Care Medicine, Affiliated Hospital of Nantong University, Nantong, Jiangsu, China
| | - Yuxi Cai
- Department of Critical Care Medicine, BenQ Medical Center, The Affiliated BenQ Hospital of Nanjing Medical University, Nanjing, Jiangsu, China
| |
Collapse
|
37
|
More TH, Hiller K, Seifert M, Illig T, Schmidt R, Gronauer R, von Hahn T, Weilert H, Stang A. Metabolomics analysis reveals novel serum metabolite alterations in cancer cachexia. Front Oncol 2024; 14:1286896. [PMID: 38450189 PMCID: PMC10915872 DOI: 10.3389/fonc.2024.1286896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Accepted: 02/05/2024] [Indexed: 03/08/2024] Open
Abstract
Background Cachexia is a body wasting syndrome that significantly affects well-being and prognosis of cancer patients, without effective treatment. Serum metabolites take part in pathophysiological processes of cancer cachexia, but apart from altered levels of select serum metabolites, little is known on the global changes of the overall serum metabolome, which represents a functional readout of the whole-body metabolic state. Here, we aimed to comprehensively characterize serum metabolite alterations and analyze associated pathways in cachectic cancer patients to gain new insights that could help instruct strategies for novel interventions of greater clinical benefit. Methods Serum was sampled from 120 metastatic cancer patients (stage UICC IV). Patients were grouped as cachectic or non-cachectic according to the criteria for cancer cachexia agreed upon international consensus (main criterium: weight loss adjusted to body mass index). Samples were pooled by cachexia phenotype and assayed using non-targeted gas chromatography-mass spectrometry (GC-MS). Normalized metabolite levels were compared using t-test (p < 0.05, adjusted for false discovery rate) and partial least squares discriminant analysis (PLS-DA). Machine-learning models were applied to identify metabolite signatures for separating cachexia states. Significant metabolites underwent MetaboAnalyst 5.0 pathway analysis. Results Comparative analyses included 78 cachectic and 42 non-cachectic patients. Cachectic patients exhibited 19 annotable, significantly elevated (including glucose and fructose) or decreased (mostly amino acids) metabolites associating with aminoacyl-tRNA, glutathione and amino acid metabolism pathways. PLS-DA showed distinct clusters (accuracy: 85.6%), and machine-learning models identified metabolic signatures for separating cachectic states (accuracy: 83.2%; area under ROC: 88.0%). We newly identified altered blood levels of erythronic acid and glucuronic acid in human cancer cachexia, potentially linked to pentose-phosphate and detoxification pathways. Conclusion We found both known and yet unknown serum metabolite and metabolic pathway alterations in cachectic cancer patients that collectively support a whole-body metabolic state with impaired detoxification capability, altered glucose and fructose metabolism, and substrate supply for increased and/or distinct metabolic needs of cachexia-associated tumors. These findings together imply vulnerabilities, dependencies and targets for novel interventions that have potential to make a significant impact on future research in an important field of cancer patient care.
Collapse
Affiliation(s)
- Tushar H. More
- Department of Bioinformatics and Biochemistry, Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| | - Karsten Hiller
- Department of Bioinformatics and Biochemistry, Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| | - Martin Seifert
- Asklepios Precision Medicine, Asklepios Hospitals GmbH & Co KgaA, Königstein (Taunus), Germany
- Connexome GmbH, Fischen, Germany
| | - Thomas Illig
- Department of Human Genetics, Hannover Medical School, Hannover, Germany
- Hannover Unified Biobank (HUB), Hannover, Germany
| | - Rudi Schmidt
- Asklepios Precision Medicine, Asklepios Hospitals GmbH & Co KgaA, Königstein (Taunus), Germany
- Immunetrue, Cologne, Germany
| | - Raphael Gronauer
- Asklepios Precision Medicine, Asklepios Hospitals GmbH & Co KgaA, Königstein (Taunus), Germany
- Connexome GmbH, Fischen, Germany
| | - Thomas von Hahn
- Asklepios Hospital Barmbek, Department of Gastroenterology, Hepatology and Endoscopy, Hamburg, Germany
- Asklepios Tumorzentrum Hamburg, Hamburg, Germany
- Semmelweis University, Asklepios Campus Hamburg, Budapest, Hungary
| | - Hauke Weilert
- Asklepios Tumorzentrum Hamburg, Hamburg, Germany
- Semmelweis University, Asklepios Campus Hamburg, Budapest, Hungary
- Asklepios Hospital Barmbek, Department of Hematology, Oncology and Palliative Care Medicine, Hamburg, Germany
| | - Axel Stang
- Asklepios Tumorzentrum Hamburg, Hamburg, Germany
- Semmelweis University, Asklepios Campus Hamburg, Budapest, Hungary
- Asklepios Hospital Barmbek, Department of Hematology, Oncology and Palliative Care Medicine, Hamburg, Germany
| |
Collapse
|
38
|
Tanaka K, Bamba T, Kondo A, Hasunuma T. Metabolomics-based development of bioproduction processes toward industrial-scale production. Curr Opin Biotechnol 2024; 85:103057. [PMID: 38154323 DOI: 10.1016/j.copbio.2023.103057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 12/04/2023] [Accepted: 12/04/2023] [Indexed: 12/30/2023]
Abstract
Microbial biomanufacturing offers a promising, environment-friendly platform for next-generation chemical production. However, its limited industrial implementation is attributed to the slow production rates of target compounds and the time-intensive engineering of high-yield strains. This review highlights how metabolomics expedites bioproduction development, as demonstrated through case studies of its integration into microbial strain engineering, culture optimization, and model construction. The Design-Build-Test-Learn (DBTL) cycle serves as a standard workflow for strain engineering. Process development, including the optimization of culture conditions and scale-up, is crucial for industrial production. In silico models facilitate the development of strains and processes. Metabolomics is a powerful driver of the DBTL framework, process development, and model construction.
Collapse
Affiliation(s)
- Kenya Tanaka
- Engineering Biology Research Center, Kobe University, 1-1 Rokkodai, Nada, Kobe 657-8501, Japan; Graduate School of Science, Technology and Innovation, Kobe University, 1-1 Rokkodai, Nada, Kobe 657-8501, Japan
| | - Takahiro Bamba
- Engineering Biology Research Center, Kobe University, 1-1 Rokkodai, Nada, Kobe 657-8501, Japan
| | - Akihiko Kondo
- Engineering Biology Research Center, Kobe University, 1-1 Rokkodai, Nada, Kobe 657-8501, Japan; Graduate School of Science, Technology and Innovation, Kobe University, 1-1 Rokkodai, Nada, Kobe 657-8501, Japan; RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | - Tomohisa Hasunuma
- Engineering Biology Research Center, Kobe University, 1-1 Rokkodai, Nada, Kobe 657-8501, Japan; Graduate School of Science, Technology and Innovation, Kobe University, 1-1 Rokkodai, Nada, Kobe 657-8501, Japan; RIKEN Center for Sustainable Resource Science, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan.
| |
Collapse
|
39
|
Sandström H, Rissanen M, Rousu J, Rinke P. Data-Driven Compound Identification in Atmospheric Mass Spectrometry. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2306235. [PMID: 38095508 PMCID: PMC10885664 DOI: 10.1002/advs.202306235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 11/04/2023] [Indexed: 02/24/2024]
Abstract
Aerosol particles found in the atmosphere affect the climate and worsen air quality. To mitigate these adverse impacts, aerosol particle formation and aerosol chemistry in the atmosphere need to be better mapped out and understood. Currently, mass spectrometry is the single most important analytical technique in atmospheric chemistry and is used to track and identify compounds and processes. Large amounts of data are collected in each measurement of current time-of-flight and orbitrap mass spectrometers using modern rapid data acquisition practices. However, compound identification remains a major bottleneck during data analysis due to lacking reference libraries and analysis tools. Data-driven compound identification approaches could alleviate the problem, yet remain rare to non-existent in atmospheric science. In this perspective, the authors review the current state of data-driven compound identification with mass spectrometry in atmospheric science and discuss current challenges and possible future steps toward a digital era for atmospheric mass spectrometry.
Collapse
Affiliation(s)
- Hilda Sandström
- Department of Applied Physics, Aalto University, P.O. Box 11000, FI-00076, Aalto, Espoo, Finland
| | - Matti Rissanen
- Aerosol Physics Laboratory, Tampere University, FI-33720, Tampere, Finland
- Department of Chemistry, University of Helsinki, P.O. Box 55, A.I. Virtasen aukio 1, FI-00560, Helsinki, Finland
| | - Juho Rousu
- Department of Computer Science, Aalto University, P.O. Box 11000, FI-00076, Aalto, Espoo, Finland
| | - Patrick Rinke
- Department of Applied Physics, Aalto University, P.O. Box 11000, FI-00076, Aalto, Espoo, Finland
| |
Collapse
|
40
|
Xu R, Zhang H, Crowder MW, Zhu J. Multiple and Optimal Screening Subset: a method selecting global characteristic congeners for robust foodomics analysis. Brief Bioinform 2024; 25:bbae046. [PMID: 38385875 PMCID: PMC10883140 DOI: 10.1093/bib/bbae046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Revised: 01/04/2024] [Accepted: 01/26/2024] [Indexed: 02/23/2024] Open
Abstract
Metabolomics and foodomics shed light on the molecular processes within living organisms and the complex food composition by leveraging sophisticated analytical techniques to systematically analyze the vast array of molecular features. The traditional feature-picking method often results in arbitrary selections of the model, feature ranking, and cut-off, which may lead to suboptimal results. Thus, a Multiple and Optimal Screening Subset (MOSS) approach was developed in this study to achieve a balance between a minimal number of predictors and high predictive accuracy during statistical model setup. The MOSS approach compares five commonly used models in the context of food matrix analysis, specifically bourbons. These models include Student's t-test, receiver operating characteristic curve, partial least squares-discriminant analysis (PLS-DA), random forests, and support vector machines. The approach employs cross-validation to identify promising subset feature candidates that contribute to food characteristic classification. It then determines the optimal subset size by comparing it to the corresponding top-ranked features. Finally, it selects the optimal feature subset by traversing all possible feature candidate combinations. By utilizing MOSS approach to analyze 1406 mass spectral features from a collection of 122 bourbon samples, we were able to generate a subset of features for bourbon age prediction with 88% accuracy. Additionally, MOSS increased the area under the curve performance of sweetness prediction to 0.898 with only four predictors compared with the top-ranked four features at 0.681 based on the PLS-DA model. Overall, we demonstrated that MOSS provides an efficient and effective approach for selecting optimal features compared with other frequently utilized methods.
Collapse
Affiliation(s)
- Rui Xu
- Human Nutrition Program, Department of Human Sciences, The Ohio State University, Columbus, Ohio, USA 43210
- Comprehensive Cancer Center, The Ohio State University, Columbus, Ohio, USA 43210
| | - Huan Zhang
- Human Nutrition Program, Department of Human Sciences, The Ohio State University, Columbus, Ohio, USA 43210
- Comprehensive Cancer Center, The Ohio State University, Columbus, Ohio, USA 43210
| | - Michael W Crowder
- Department of Chemistry and Biochemistry, Miami University, Oxford, Ohio, USA 45056
| | - Jiangjiang Zhu
- Human Nutrition Program, Department of Human Sciences, The Ohio State University, Columbus, Ohio, USA 43210
- Comprehensive Cancer Center, The Ohio State University, Columbus, Ohio, USA 43210
| |
Collapse
|
41
|
Williams OHL, Rusli O, Ezzedinloo L, Dodgen TM, Clegg JK, Rijs NJ. Automated Structural Activity Screening of β-Diketonate Assemblies with High-Throughput Ion Mobility-Mass Spectrometry. Angew Chem Int Ed Engl 2024; 63:e202313892. [PMID: 38012094 DOI: 10.1002/anie.202313892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 11/08/2023] [Accepted: 11/27/2023] [Indexed: 11/29/2023]
Abstract
Embracing complexity in design, metallo-supramolecular self-assembly presents an opportunity for fabricating materials of economic significance. The array of accessible supramolecules is alluring, along with favourable energy requirements. Implementation is hampered by an inability to efficiently characterise complex mixtures. The stoichiometry, size, shape, guest binding properties and reactivity of individual components and combinations thereof are inherently challenging to resolve. A large combinatorial library of four transition metals (Fe, Cu, Ni and Zn), and six β-diketonate ligands at different molar ratios and pH was robotically prepared and directly analysed over multiple timepoints with electrospray ionisation travelling wave ion mobility-mass spectrometry. The dataset was parsed for self-assembling activity without first attempting to structurally assign individual species. Self-assembling systems were readily categorised without manual data-handling, allowing efficient screening of self-assembly activity. This workflow clarifies solution phase supramolecular assembly processes without manual, bottom-up processing. The complex behaviour of the self-assembling systems was reduced to simpler qualities, which could be automatically processed.
Collapse
Affiliation(s)
| | - Olivia Rusli
- School of Chemistry, UNSW Sydney, Sydney, NSW, 2052, Australia
| | - Lida Ezzedinloo
- School of Chemistry, UNSW Sydney, Sydney, NSW, 2052, Australia
| | - Tyren M Dodgen
- Waters Corporation Australia, Rydalmere, NSW, 2116, Australia
| | - Jack K Clegg
- School of Chemistry and Molecular Biosciences, The University of Queensland, St Lucia, Queensland, 4072, Australia
| | - Nicole J Rijs
- School of Chemistry, UNSW Sydney, Sydney, NSW, 2052, Australia
| |
Collapse
|
42
|
Thukral M, Allen AE, Petras D. Progress and challenges in exploring aquatic microbial communities using non-targeted metabolomics. THE ISME JOURNAL 2023; 17:2147-2159. [PMID: 37857709 PMCID: PMC10689791 DOI: 10.1038/s41396-023-01532-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 09/27/2023] [Accepted: 10/02/2023] [Indexed: 10/21/2023]
Abstract
Advances in bioanalytical technologies are constantly expanding our insights into complex ecosystems. Here, we highlight strategies and applications that make use of non-targeted metabolomics methods in aquatic chemical ecology research and discuss opportunities and remaining challenges of mass spectrometry-based methods to broaden our understanding of environmental systems.
Collapse
Affiliation(s)
- Monica Thukral
- University of California San Diego, Scripps Institution of Oceanography, La Jolla, CA, USA
- J. Craig Venter Institute, Microbial and Environmental Genomics Group, La Jolla, CA, USA
| | - Andrew E Allen
- University of California San Diego, Scripps Institution of Oceanography, La Jolla, CA, USA
- J. Craig Venter Institute, Microbial and Environmental Genomics Group, La Jolla, CA, USA
| | - Daniel Petras
- University of Tuebingen, CMFI Cluster of Excellence, Tuebingen, Germany.
- University of California Riverside, Department of Biochemistry, Riverside, CA, USA.
| |
Collapse
|
43
|
Wevers D, Ramautar R, Clark C, Hankemeier T, Ali A. Opportunities and challenges for sample preparation and enrichment in mass spectrometry for single-cell metabolomics. Electrophoresis 2023; 44:2000-2024. [PMID: 37667867 DOI: 10.1002/elps.202300105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 08/08/2023] [Accepted: 08/19/2023] [Indexed: 09/06/2023]
Abstract
Single-cell heterogeneity in metabolism, drug resistance and disease type poses the need for analytical techniques for single-cell analysis. As the metabolome provides the closest view of the status quo in the cell, studying the metabolome at single-cell resolution may unravel said heterogeneity. A challenge in single-cell metabolome analysis is that metabolites cannot be amplified, so one needs to deal with picolitre volumes and a wide range of analyte concentrations. Due to high sensitivity and resolution, MS is preferred in single-cell metabolomics. Large numbers of cells need to be analysed for proper statistics; this requires high-throughput analysis, and hence automation of the analytical workflow. Significant advances in (micro)sampling methods, CE and ion mobility spectrometry have been made, some of which have been applied in high-throughput analyses. Microfluidics has enabled an automation of cell picking and metabolite extraction; image recognition has enabled automated cell identification. Many techniques have been used for data analysis, varying from conventional techniques to novel combinations of advanced chemometric approaches. Steps have been set in making data more findable, accessible, interoperable and reusable, but significant opportunities for improvement remain. Herein, advances in single-cell analysis workflows and data analysis are discussed, and recommendations are made based on the experimental goal.
Collapse
Affiliation(s)
- Dirk Wevers
- Wageningen University and Research, Wageningen, The Netherlands
- Metabolomics and Analytics Centre, Leiden Academic Centre for Drug Research, Leiden, The Netherlands
| | - Rawi Ramautar
- Metabolomics and Analytics Centre, Leiden Academic Centre for Drug Research, Leiden, The Netherlands
| | - Charlie Clark
- Metabolomics and Analytics Centre, Leiden Academic Centre for Drug Research, Leiden, The Netherlands
| | - Thomas Hankemeier
- Metabolomics and Analytics Centre, Leiden Academic Centre for Drug Research, Leiden, The Netherlands
| | - Ahmed Ali
- Metabolomics and Analytics Centre, Leiden Academic Centre for Drug Research, Leiden, The Netherlands
| |
Collapse
|
44
|
Gombert M, Reisdorph N, Morton SJ, Wright KP, Depner CM. Insufficient sleep and weekend recovery sleep: classification by a metabolomics-based machine learning ensemble. Sci Rep 2023; 13:21123. [PMID: 38036605 PMCID: PMC10689438 DOI: 10.1038/s41598-023-48208-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Accepted: 11/23/2023] [Indexed: 12/02/2023] Open
Abstract
Although weekend recovery sleep is common, the physiological responses to weekend recovery sleep are not fully elucidated. Identifying molecular biomarkers that represent adequate versus insufficient sleep could help advance our understanding of weekend recovery sleep. Here, we identified potential molecular biomarkers of insufficient sleep and defined the impact of weekend recovery sleep on these biomarkers using metabolomics in a randomized controlled trial. Healthy adults (n = 34) were randomized into three groups: control (CON: 9-h sleep opportunities); sleep restriction (SR: 5-h sleep opportunities); or weekend recovery (WR: simulated workweek of 5-h sleep opportunities followed by ad libitum weekend recovery sleep and then 2 days with 5-h sleep opportunities). Blood for metabolomics was collected on the simulated Monday immediately following the weekend. Nine machine learning models, including a machine learning ensemble, were built to classify samples from SR versus CON. Notably, SR showed decreased glycerophospholipids and sphingolipids versus CON. The machine learning ensemble showed the highest G-mean performance and classified 50% of the WR samples as insufficient sleep. Our findings show insufficient sleep and recovery sleep influence the plasma metabolome and suggest more than one weekend of recovery sleep may be necessary for the identified biomarkers to return to healthy adequate sleep levels.
Collapse
Affiliation(s)
- Marie Gombert
- Department of Pediatrics, Obstetrics and Gynecology, University of Valencia, 46010, Valencia, Spain
- Center for Health Sciences, SRI International, Menlo Park, CA, USA
| | - Nichole Reisdorph
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Sarah J Morton
- Sleep and Chronobiology Laboratory, Department of Integrative Physiology, University of Colorado Boulder, 1725 Pleasant Street; Clare Small 114, Boulder, CO, 80309-0354, USA
| | - Kenneth P Wright
- Sleep and Chronobiology Laboratory, Department of Integrative Physiology, University of Colorado Boulder, 1725 Pleasant Street; Clare Small 114, Boulder, CO, 80309-0354, USA.
- Division of Endocrinology, Metabolism, and Diabetes, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA.
| | - Christopher M Depner
- Sleep and Chronobiology Laboratory, Department of Integrative Physiology, University of Colorado Boulder, 1725 Pleasant Street; Clare Small 114, Boulder, CO, 80309-0354, USA.
- Department of Health and Kinesiology, University of Utah, 250 S 1850 E; HPER North, RM 206, Salt Lake City, UT, 84112, USA.
| |
Collapse
|
45
|
Djoumbou-Feunang Y, Wilmot J, Kinney J, Chanda P, Yu P, Sader A, Sharifi M, Smith S, Ou J, Hu J, Shipp E, Tomandl D, Kumpatla SP. Cheminformatics and artificial intelligence for accelerating agrochemical discovery. Front Chem 2023; 11:1292027. [PMID: 38093816 PMCID: PMC10716421 DOI: 10.3389/fchem.2023.1292027] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Accepted: 11/09/2023] [Indexed: 10/17/2024] Open
Abstract
The global cost-benefit analysis of pesticide use during the last 30 years has been characterized by a significant increase during the period from 1990 to 2007 followed by a decline. This observation can be attributed to several factors including, but not limited to, pest resistance, lack of novelty with respect to modes of action or classes of chemistry, and regulatory action. Due to current and projected increases of the global population, it is evident that the demand for food, and consequently, the usage of pesticides to improve yields will increase. Addressing these challenges and needs while promoting new crop protection agents through an increasingly stringent regulatory landscape requires the development and integration of infrastructures for innovative, cost- and time-effective discovery and development of novel and sustainable molecules. Significant advances in artificial intelligence (AI) and cheminformatics over the last two decades have improved the decision-making power of research scientists in the discovery of bioactive molecules. AI- and cheminformatics-driven molecule discovery offers the opportunity of moving experiments from the greenhouse to a virtual environment where thousands to billions of molecules can be investigated at a rapid pace, providing unbiased hypothesis for lead generation, optimization, and effective suggestions for compound synthesis and testing. To date, this is illustrated to a far lesser extent in the publicly available agrochemical research literature compared to drug discovery. In this review, we provide an overview of the crop protection discovery pipeline and how traditional, cheminformatics, and AI technologies can help to address the needs and challenges of agrochemical discovery towards rapidly developing novel and more sustainable products.
Collapse
Affiliation(s)
| | - Jeremy Wilmot
- Corteva Agriscience, Crop Protection Discovery and Development, Indianapolis, IN, United States
| | - John Kinney
- Corteva Agriscience, Farming Solutions and Digital, Indianapolis, IN, United States
| | - Pritam Chanda
- Corteva Agriscience, Farming Solutions and Digital, Indianapolis, IN, United States
| | - Pulan Yu
- Corteva Agriscience, Crop Protection Discovery and Development, Indianapolis, IN, United States
| | - Avery Sader
- Corteva Agriscience, Crop Protection Discovery and Development, Indianapolis, IN, United States
| | - Max Sharifi
- Corteva Agriscience, Regulatory and Stewardship, Indianapolis, IN, United States
| | - Scott Smith
- Corteva Agriscience, Farming Solutions and Digital, Indianapolis, IN, United States
| | - Junjun Ou
- Corteva Agriscience, Crop Protection Discovery and Development, Indianapolis, IN, United States
| | - Jie Hu
- Corteva Agriscience, Farming Solutions and Digital, Indianapolis, IN, United States
| | - Elizabeth Shipp
- Corteva Agriscience UK Limited, Regulation Innovation Center, Abingdon, United Kingdom
| | | | | |
Collapse
|
46
|
Verhaar BJH, Mosterd CM, Collard D, Galenkamp H, Muller M, Rampanelli E, van Raalte DH, Nieuwdorp M, van den Born BJH. Sex differences in associations of plasma metabolites with blood pressure and heart rate variability: The HELIUS study. Atherosclerosis 2023; 384:117147. [PMID: 37286456 DOI: 10.1016/j.atherosclerosis.2023.05.016] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Revised: 04/27/2023] [Accepted: 05/17/2023] [Indexed: 06/09/2023]
Abstract
BACKGROUND AND AIMS Since plasma metabolites can modulate blood pressure (BP) and vary between men and women, we examined sex differences in plasma metabolite profiles associated with BP and sympathicovagal balance. Our secondary aim was to investigate associations between gut microbiota composition and plasma metabolites predictive of BP and heart rate variability (HRV). METHODS From the HELIUS cohort, we included 196 women and 173 men. Office systolic BP and diastolic BP were recorded, and heart rate variability (HRV) and baroreceptor sensitivity (BRS) were calculated using finger photoplethysmography. Plasma metabolomics was measured using untargeted LC-MS/MS. Gut microbiota composition was determined using 16S sequencing. We used machine learning models to predict BP and HRV from metabolite profiles, and to predict metabolite levels from gut microbiota composition. RESULTS In women, best predicting metabolites for systolic BP included dihomo-lineoylcarnitine, 4-hydroxyphenylacetateglutamine and vanillactate. In men, top predictors included sphingomyelins, N-formylmethionine and conjugated bile acids. Best predictors for HRV in men included phenylacetate and gentisate, which were associated with lower HRV in men but not in women. Several of these metabolites were associated with gut microbiota composition, including phenylacetate, multiple sphingomyelins and gentisate. CONCLUSIONS Plasma metabolite profiles are associated with BP in a sex-specific manner. Catecholamine derivatives were more important predictors for BP in women, while sphingomyelins were more important in men. Several metabolites were associated with gut microbiota composition, providing potential targets for intervention.
Collapse
Affiliation(s)
- Barbara J H Verhaar
- Department of Internal and Vascular Medicine, Amsterdam UMC, Location AMC, Amsterdam, the Netherlands; Department of Internal Medicine - Geriatrics, Amsterdam UMC, Location VUmc, Amsterdam Cardiovascular Sciences, Amsterdam, the Netherlands.
| | - Charlotte M Mosterd
- Department of Internal and Vascular Medicine, Amsterdam UMC, Location AMC, Amsterdam, the Netherlands; Diabetes Center, Department of Internal Medicine, Amsterdam UMC, Location VUmc, Amsterdam, the Netherlands
| | - Didier Collard
- Department of Internal and Vascular Medicine, Amsterdam UMC, Location AMC, Amsterdam, the Netherlands
| | - Henrike Galenkamp
- Department of Public and Occupational Health, Amsterdam UMC, Location AMC, Amsterdam, the Netherlands; Amsterdam Public Health Research Institute, Health Behaviors and Chronic Diseases, Amsterdam, the Netherlands
| | - Majon Muller
- Department of Internal Medicine - Geriatrics, Amsterdam UMC, Location VUmc, Amsterdam Cardiovascular Sciences, Amsterdam, the Netherlands
| | - Elena Rampanelli
- Department of Internal and Vascular Medicine, Amsterdam UMC, Location AMC, Amsterdam, the Netherlands
| | - Daniël H van Raalte
- Department of Internal and Vascular Medicine, Amsterdam UMC, Location AMC, Amsterdam, the Netherlands; Diabetes Center, Department of Internal Medicine, Amsterdam UMC, Location VUmc, Amsterdam, the Netherlands
| | - Max Nieuwdorp
- Department of Internal and Vascular Medicine, Amsterdam UMC, Location AMC, Amsterdam, the Netherlands; Wallenberg Laboratory, Department of Molecular and Clinical Medicine, Sahlgrenska Academy, Goteborgs Universitet, Gothenburg, Sweden
| | - Bert-Jan H van den Born
- Department of Internal and Vascular Medicine, Amsterdam UMC, Location AMC, Amsterdam, the Netherlands; Department of Public and Occupational Health, Amsterdam UMC, Location AMC, Amsterdam, the Netherlands
| |
Collapse
|
47
|
Chen CJ, Lee DY, Yu J, Lin YN, Lin TM. Recent advances in LC-MS-based metabolomics for clinical biomarker discovery. MASS SPECTROMETRY REVIEWS 2023; 42:2349-2378. [PMID: 35645144 DOI: 10.1002/mas.21785] [Citation(s) in RCA: 35] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 10/14/2021] [Accepted: 11/18/2021] [Indexed: 06/15/2023]
Abstract
The employment of liquid chromatography-mass spectrometry (LC-MS) untargeted and targeted metabolomics has led to the discovery of novel biomarkers and improved the understanding of various disease mechanisms. Numerous strategies have been reported to expand the metabolite coverage in LC-MS-untargeted and targeted metabolomics. To improve the sensitivity of low-abundance or poor-ionized metabolites for reducing the amount of clinical sample, chemical derivatization methods are used to target different functional groups. Proper sample preparation is beneficial for reducing the matrix effect, maintaining the stability of the LC-MS system, and increasing the metabolite coverage. Machine learning has recently been integrated into the workflow of LC-MS metabolomics to accelerate metabolite identification and data-processing automation, and increase the accuracy of disease classification and clinical outcome prediction. Due to the rapidly growing utility of LC-MS metabolomics in discovering disease markers, this review will address the recent advances in the field and offer perspectives on various strategies for expanding metabolite coverage, chemical derivatization, sample preparation, clinical disease markers, and machining learning for disease modeling.
Collapse
Affiliation(s)
- Chao-Jung Chen
- Graduate Institute of Integrated Medicine, College of Chinese Medicine, China Medical University, Taichung, Taiwan
- Proteomics Core Laboratory, Department of Medical Research, China Medical University Hospital, Taichung, Taiwan
| | - Der-Yen Lee
- Graduate Institute of Integrated Medicine, College of Chinese Medicine, China Medical University, Taichung, Taiwan
| | - Jiaxin Yu
- AI Innovation Center, China Medical University Hospital, Taichung, Taiwan
| | - Yu-Ning Lin
- Proteomics Core Laboratory, Department of Medical Research, China Medical University Hospital, Taichung, Taiwan
| | - Tsung-Min Lin
- Proteomics Core Laboratory, Department of Medical Research, China Medical University Hospital, Taichung, Taiwan
| |
Collapse
|
48
|
Bifarin OO, Fernández FM. Automated machine learning and explainable AI (AutoML-XAI) for metabolomics: improving cancer diagnostics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.26.564244. [PMID: 37961534 PMCID: PMC10634896 DOI: 10.1101/2023.10.26.564244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Motivation Metabolomics generates complex data necessitating advanced computational methods for generating biological insight. While machine learning (ML) is promising, the challenges of selecting the best algorithms and tuning hyperparameters, particularly for non-experts, remain. Automated machine learning (AutoML) can streamline this process; however, the issue of interpretability could persist. This research introduces a unified pipeline that combines AutoML with explainable AI (XAI) techniques to optimize metabolomics analysis. Results We tested our approach on two datasets: renal cell carcinoma (RCC) urine metabolomics and ovarian cancer (OC) serum metabolomics. AutoML, using auto-sklearn, surpassed standalone ML algorithms such as SVM and random forest in differentiating between RCC and healthy controls, as well as OC patients and those with other gynecological cancers (Non-OC). Auto-sklearn employed a mix of algorithms and ensemble techniques, yielding a superior performance (AUC of 0.97 for RCC and 0.85 for OC). Shapley Additive Explanations (SHAP) provided a global ranking of feature importance, identifying dibutylamine and ganglioside GM(d34:1) as the top discriminative metabolites for RCC and OC, respectively. Waterfall plots offered local explanations by illustrating the influence of each metabolite on individual predictions. Dependence plots spotlighted metabolite interactions, such as the connection between hippuric acid and one of its derivatives in RCC, and between GM3(d34:1) and GM3(18:1_16:0) in OC, hinting at potential mechanistic relationships. Through decision plots, a detailed error analysis was conducted, contrasting feature importance for correctly versus incorrectly classified samples. In essence, our pipeline emphasizes the importance of harmonizing AutoML and XAI, facilitating both simplified ML application and improved interpretability in metabolomics data science. Availability https://github.com/obifarin/automl-xai-metabolomics.
Collapse
Affiliation(s)
- Olatomiwa O. Bifarin
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Facundo M. Fernández
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA 30332, USA
- Petit Institute of Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GA 30332, USA
| |
Collapse
|
49
|
Safari Yazd H, Bazargani SF, Fitzpatrick G, Yost RA, Kresak J, Garrett TJ. Metabolomic and Lipidomic Characterization of Meningioma Grades Using LC-HRMS and Machine Learning. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2023; 34:2187-2198. [PMID: 37708056 DOI: 10.1021/jasms.3c00158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/16/2023]
Abstract
Meningiomas are among the most common brain tumors that arise from the leptomeningeal cover of the brain and spinal cord and account for around 37% of all central nervous system tumors. According to the World Health Organization, meningiomas are classified into three histological subtypes: benign, atypical, and anaplastic. Sometimes, meningiomas with a histological diagnosis of benign tumors show clinical characteristics and behavior of aggressive tumors. In this study, we examined the metabolomic and lipidomic profiles of meningioma tumors, focusing on comparing low-grade and high-grade tumors and identifying potential markers that can discriminate between benign and malignant tumors. High-resolution mass spectrometry coupled to liquid chromatography was used for untargeted metabolomics and lipidomics analyses of 85 tumor biopsy samples with different meningioma grades. We then applied feature selection and machine learning techniques to find the features with the highest information to aid in the diagnosis of meningioma grades. Three biomarkers were identified to differentiate low- and high-grade meningioma brain tumors. The use of mass-spectrometry-based metabolomics and lipidomics combined with machine learning analyses to prospect and characterize biomarkers associated with meningioma grades may pave the way for elucidating potential therapeutic and prognostic targets.
Collapse
Affiliation(s)
- Hoda Safari Yazd
- Department of Chemistry, University of Florida, Gainesville, Florida 32610, United States
| | | | - Garrett Fitzpatrick
- Department of Pathology, Immunology and Laboratory Medicine, University of Florida, Gainesville, Florida 32610, United States
| | - Richard A Yost
- Department of Chemistry, University of Florida, Gainesville, Florida 32610, United States
| | - Jesse Kresak
- Department of Pathology, Immunology and Laboratory Medicine, University of Florida, Gainesville, Florida 32610, United States
| | - Timothy J Garrett
- Department of Pathology, Immunology and Laboratory Medicine, University of Florida, Gainesville, Florida 32610, United States
| |
Collapse
|
50
|
Hu X, Mar D, Suzuki N, Zhang B, Peter KT, Beck DAC, Kolodziej EP. Mass-Suite: a novel open-source python package for high-resolution mass spectrometry data analysis. J Cheminform 2023; 15:87. [PMID: 37741995 PMCID: PMC10517472 DOI: 10.1186/s13321-023-00741-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 07/30/2023] [Indexed: 09/25/2023] Open
Abstract
Mass-Suite (MSS) is a Python-based, open-source software package designed to analyze high-resolution mass spectrometry (HRMS)-based non-targeted analysis (NTA) data, particularly for water quality assessment and other environmental applications. MSS provides flexible, user-defined workflows for HRMS data processing and analysis, including both basic functions (e.g., feature extraction, data reduction, feature annotation, data visualization, and statistical analyses) and advanced exploratory data mining and predictive modeling capabilities that are not provided by currently available open-source software (e.g., unsupervised clustering analyses, a machine learning-based source tracking and apportionment tool). As a key advance, most core MSS functions are supported by machine learning algorithms (e.g., clustering algorithms and predictive modeling algorithms) to facilitate function accuracy and/or efficiency. MSS reliability was validated with mixed chemical standards of known composition, with 99.5% feature extraction accuracy and ~ 52% overlap of extracted features relative to other open-source software tools. Example user cases of laboratory data evaluation are provided to illustrate MSS functionalities and demonstrate reliability. MSS expands available HRMS data analysis workflows for water quality evaluation and environmental forensics, and is readily integrated with existing capabilities. As an open-source package, we anticipate further development of improved data analysis capabilities in collaboration with interested users.
Collapse
Affiliation(s)
- Ximin Hu
- Center for Urban Waters, University of Washington Tacoma, Tacoma, WA, 98421, USA
- Department of Civil and Environmental Engineering, University of Washington, Seattle, WA, 98195, USA
| | - Derek Mar
- Department of Material Science and Engineering, University of Washington, Seattle, WA, 98195, USA
| | - Nozomi Suzuki
- Department of Material Science and Engineering, University of Washington, Seattle, WA, 98195, USA
| | - Bowei Zhang
- Department of Material Science and Engineering, University of Washington, Seattle, WA, 98195, USA
| | - Katherine T Peter
- Center for Urban Waters, University of Washington Tacoma, Tacoma, WA, 98421, USA
- Interdisciplinary Arts and Sciences, University of Washington Tacoma, Tacoma, WA, 98421, USA
| | - David A C Beck
- Department of Chemical Engineering, University of Washington, Seattle, WA, 98195, USA.
- eScience Institute, University of Washington, Seattle, WA, 98195, USA.
| | - Edward P Kolodziej
- Center for Urban Waters, University of Washington Tacoma, Tacoma, WA, 98421, USA.
- Department of Civil and Environmental Engineering, University of Washington, Seattle, WA, 98195, USA.
- Interdisciplinary Arts and Sciences, University of Washington Tacoma, Tacoma, WA, 98421, USA.
| |
Collapse
|