1
|
Fok WYR, Grashei M, Skinner JG, Menze BH, Schilling F. Prediction of multiple pH compartments by deep learning in magnetic resonance spectroscopy with hyperpolarized 13C-labelled zymonic acid. EJNMMI Res 2022; 12:24. [PMID: 35460436 PMCID: PMC9035201 DOI: 10.1186/s13550-022-00894-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Accepted: 04/05/2022] [Indexed: 11/11/2022] Open
Abstract
BACKGROUND Hyperpolarization enhances the sensitivity of nuclear magnetic resonance experiments by between four and five orders of magnitude. Several hyperpolarized sensor molecules have been introduced that enable high sensitivity detection of metabolism and physiological parameters. However, hyperpolarized magnetic resonance spectroscopy imaging (MRSI) often suffers from poor signal-to-noise ratio and spectral analysis is complicated by peak overlap. Here, we study measurements of extracellular pH (pHe) by hyperpolarized zymonic acid, where multiple pHe compartments, such as those observed in healthy kidney or other heterogeneous tissue, result in a cluster of spectrally overlapping peaks, which is hard to resolve with conventional spectroscopy analysis routines. METHODS We investigate whether deep learning methods can yield improved pHe prediction in hyperpolarized zymonic acid spectra of multiple pHe compartments compared to conventional line fitting. As hyperpolarized 13C-MRSI data sets are often small, a convolutional neural network (CNN) and a multilayer perceptron (MLP) were trained with either a synthetic or a mixed (synthetic and augmented) data set of acquisitions from the kidneys of healthy mice. RESULTS Comparing the networks' performances compartment-wise on a synthetic test data set and eight real kidney data shows superior performance of CNN compared to MLP and equal or superior performance compared to conventional line fitting. For correct prediction of real kidney pHe values, training with a mixed data set containing only 0.5% real data shows a large improvement compared to training with synthetic data only. Using a manual segmentation approach, pH maps of kidney compartments can be improved by neural network predictions for voxels including three pH compartments. CONCLUSION The results of this study indicate that CNNs offer a reliable, accurate, fast and non-interactive method for analysis of hyperpolarized 13C MRS and MRSI data, where low amounts of acquired data can be complemented to achieve suitable network training.
Collapse
Affiliation(s)
- Wai-Yan Ryana Fok
- Department of Informatics, Technical University of Munich, 85748, Garching, Germany
| | - Martin Grashei
- Department of Nuclear Medicine, TUM School of Medicine, Klinikum Rechts der Isar, Technical University of Munich, 81675, Munich, Germany
| | - Jason G Skinner
- Department of Nuclear Medicine, TUM School of Medicine, Klinikum Rechts der Isar, Technical University of Munich, 81675, Munich, Germany
| | - Bjoern H Menze
- Department of Informatics, Technical University of Munich, 85748, Garching, Germany
| | - Franz Schilling
- Department of Nuclear Medicine, TUM School of Medicine, Klinikum Rechts der Isar, Technical University of Munich, 81675, Munich, Germany.
- Munich Institute of Biomedical Engineering, Technical University of Munich, 85748, Garching, Germany.
| |
Collapse
|
2
|
NMR in Metabolomics: From Conventional Statistics to Machine Learning and Neural Network Approaches. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12062824] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
NMR measurements combined with chemometrics allow achieving a great amount of information for the identification of potential biomarkers responsible for a precise metabolic pathway. These kinds of data are useful in different fields, ranging from food to biomedical fields, including health science. The investigation of the whole set of metabolites in a sample, representing its fingerprint in the considered condition, is known as metabolomics and may take advantage of different statistical tools. The new frontier is to adopt self-learning techniques to enhance clustering or classification actions that can improve the predictive power over large amounts of data. Although machine learning is already employed in metabolomics, deep learning and artificial neural networks approaches were only recently successfully applied. In this work, we give an overview of the statistical approaches underlying the wide range of opportunities that machine learning and neural networks allow to perform with accurate metabolites assignment and quantification.Various actual challenges are discussed, such as proper metabolomics, deep learning architectures and model accuracy.
Collapse
|
3
|
Liu L, Wu Q, Miao X, Fan T, Meng Z, Chen X, Zhu W. Study on toxicity effects of environmental pollutants based on metabolomics: A review. CHEMOSPHERE 2022; 286:131815. [PMID: 34375834 DOI: 10.1016/j.chemosphere.2021.131815] [Citation(s) in RCA: 52] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 07/23/2021] [Accepted: 08/04/2021] [Indexed: 06/13/2023]
Abstract
In the past few decades, the toxic effects of environmental pollutants on non-target organisms have received more and more attention. As a new omics technology, metabolomics can clarify the metabolic homeostasis of the organism at the overall level by studying the changes in the relative contents of endogenous metabolites in the organism. Recently, a large number of studies have used metabolomics technology to study the toxic effects of environmental pollutants on organisms. In this review, we reviewed the analysis processes and data processes of metabolomics and its application in the study of the toxic effects of environmental pollutants including heavy metals, pesticides, polychlorinated biphenyls, polycyclic aromatic hydrocarbons, polybrominated diphenyl ethers and microplastics. In addition, we emphasized that the combination of metabolomics and other omics technologies will help to explore the toxic mechanism of environmental pollutants and provide new research ideas for the toxicological evaluation of environmental pollutants.
Collapse
Affiliation(s)
- Li Liu
- School of Food Science and Engineering, Yangzhou University, Yangzhou, Jiangsu, 225009, China
| | - Qinchao Wu
- School of Horticulture and Plant Protection, Yangzhou University, Yangzhou, Jiangsu, 225009, China
| | - Xinyi Miao
- School of Horticulture and Plant Protection, Yangzhou University, Yangzhou, Jiangsu, 225009, China
| | - Tianle Fan
- School of Horticulture and Plant Protection, Yangzhou University, Yangzhou, Jiangsu, 225009, China
| | - Zhiyuan Meng
- School of Horticulture and Plant Protection, Yangzhou University, Yangzhou, Jiangsu, 225009, China.
| | - Xiaojun Chen
- School of Horticulture and Plant Protection, Yangzhou University, Yangzhou, Jiangsu, 225009, China
| | - Wentao Zhu
- Innovation Center of Pesticide Research, Department of Applied Chemistry, College of Science, China Agricultural University, Beijing, 100193, China
| |
Collapse
|
4
|
Lo-Thong O, Charton P, Cadet XF, Grondin-Perez B, Saavedra E, Damour C, Cadet F. Identification of flux checkpoints in a metabolic pathway through white-box, grey-box and black-box modeling approaches. Sci Rep 2020; 10:13446. [PMID: 32778715 PMCID: PMC7417601 DOI: 10.1038/s41598-020-70295-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Accepted: 07/27/2020] [Indexed: 11/29/2022] Open
Abstract
Metabolic pathway modeling plays an increasing role in drug design by allowing better understanding of the underlying regulation and controlling networks in the metabolism of living organisms. However, despite rapid progress in this area, pathway modeling can become a real nightmare for researchers, notably when few experimental data are available or when the pathway is highly complex. Here, three different approaches were developed to model the second part of glycolysis of E. histolytica as an application example, and have succeeded in predicting the final pathway flux: one including detailed kinetic information (white-box), another with an added adjustment term (grey-box) and the last one using an artificial neural network method (black-box). Afterwards, each model was used for metabolic control analysis and flux control coefficient determination. The first two enzymes of this pathway are identified as the key enzymes playing a role in flux control. This study revealed the significance of the three methods for building suitable models adjusted to the available data in the field of metabolic pathway modeling, and could be useful to biologists and modelers.
Collapse
Affiliation(s)
- Ophélie Lo-Thong
- University of Paris, UMR_S1134, BIGR, Inserm, 75015, Paris, France.,DSIMB, UMR_S1134, BIGR, Inserm, Laboratory of Excellence GR-Ex, Faculty of Sciences and Technology, University of La Reunion, 97715, Saint-Denis, France
| | - Philippe Charton
- University of Paris, UMR_S1134, BIGR, Inserm, 75015, Paris, France.,DSIMB, UMR_S1134, BIGR, Inserm, Laboratory of Excellence GR-Ex, Faculty of Sciences and Technology, University of La Reunion, 97715, Saint-Denis, France
| | - Xavier F Cadet
- PEACCEL, Artificial Intelligence Department, 6 square Albin Cachot, box 42, 75013, Paris, France
| | - Brigitte Grondin-Perez
- LE2P, Laboratory of Energy, Electronics and Processes EA 4079, Faculty of Sciences and Technology, University of La Reunion, 97444, St Denis cedex, France
| | - Emma Saavedra
- Departamento de Bioquímica, Instituto Nacional de Cardiología Ignacio Chávez, 14080, Mexico City, Mexico
| | - Cédric Damour
- LE2P, Laboratory of Energy, Electronics and Processes EA 4079, Faculty of Sciences and Technology, University of La Reunion, 97444, St Denis cedex, France
| | - Frédéric Cadet
- University of Paris, UMR_S1134, BIGR, Inserm, 75015, Paris, France. .,DSIMB, UMR_S1134, BIGR, Inserm, Laboratory of Excellence GR-Ex, Faculty of Sciences and Technology, University of La Reunion, 97715, Saint-Denis, France.
| |
Collapse
|
5
|
Computational Methods for the Discovery of Metabolic Markers of Complex Traits. Metabolites 2019; 9:metabo9040066. [PMID: 30987289 PMCID: PMC6523328 DOI: 10.3390/metabo9040066] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2019] [Revised: 03/19/2019] [Accepted: 04/01/2019] [Indexed: 12/21/2022] Open
Abstract
Metabolomics uses quantitative analyses of metabolites from tissues or bodily fluids to acquire a functional readout of the physiological state. Complex diseases arise from the influence of multiple factors, such as genetics, environment and lifestyle. Since genes, RNAs and proteins converge onto the terminal downstream metabolome, metabolomics datasets offer a rich source of information in a complex and convoluted presentation. Thus, powerful computational methods capable of deciphering the effects of many upstream influences have become increasingly necessary. In this review, the workflow of metabolic marker discovery is outlined from metabolite extraction to model interpretation and validation. Additionally, current metabolomics research in various complex disease areas is examined to identify gaps and trends in the use of several statistical and computational algorithms. Then, we highlight and discuss three advanced machine-learning algorithms, specifically ensemble learning, artificial neural networks, and genetic programming, that are currently less visible, but are budding with high potential for utility in metabolomics research. With an upward trend in the use of highly-accurate, multivariate models in the metabolomics literature, diagnostic biomarker panels of complex diseases are more recently achieving accuracies approaching or exceeding traditional diagnostic procedures. This review aims to provide an overview of computational methods in metabolomics and promote the use of up-to-date machine-learning and computational methods by metabolomics researchers.
Collapse
|
6
|
Handelman GS, Kok HK, Chandra RV, Razavi AH, Lee MJ, Asadi H. eDoctor: machine learning and the future of medicine. J Intern Med 2018; 284:603-619. [PMID: 30102808 DOI: 10.1111/joim.12822] [Citation(s) in RCA: 401] [Impact Index Per Article: 57.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Machine learning (ML) is a burgeoning field of medicine with huge resources being applied to fuse computer science and statistics to medical problems. Proponents of ML extol its ability to deal with large, complex and disparate data, often found within medicine and feel that ML is the future for biomedical research, personalized medicine, computer-aided diagnosis to significantly advance global health care. However, the concepts of ML are unfamiliar to many medical professionals and there is untapped potential in the use of ML as a research tool. In this article, we provide an overview of the theory behind ML, explore the common ML algorithms used in medicine including their pitfalls and discuss the potential future of ML in medicine.
Collapse
Affiliation(s)
| | - H K Kok
- Interventional Radiology Service, Northern Hospital Radiology, Epping, Vic, Australia
| | - R V Chandra
- Interventional Neuroradiology Service, Monash Imaging, Monash Health, Clayton, Vic, Australia.,Faculty of Medicine, Nursing and Health Sciences, Monash University, Clayton, Vic, Australia
| | - A H Razavi
- School of Information Technology and Engineering, University of Ottawa, Ottawa, ON, Canada.,BCE Corporate Security, Ottawa, ON, Canada
| | - M J Lee
- Department of Radiology, Beaumont Hospital and Royal College of Surgeons in Ireland, Dublin, Ireland
| | - H Asadi
- Interventional Neuroradiology Service, Monash Imaging, Monash Health, Clayton, Vic, Australia.,Department of Radiology, Interventional Neuroradiology Service, Austin Health, Heidelberg, Vic, Australia.,School of Medicine, Faculty of Health, Deakin University, Waurn Ponds, Vic, Australia
| |
Collapse
|
7
|
Chihanga T, Hausmann SM, Ni S, Kennedy MA. Influence of media selection on NMR based metabolic profiling of human cell lines. Metabolomics 2018; 14:28. [PMID: 30830358 DOI: 10.1007/s11306-018-1323-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Accepted: 01/12/2018] [Indexed: 01/20/2023]
Abstract
INTRODUCTION Comparative metabolic profiling of different human cancer cell lines can reveal metabolic pathways up-regulated or down-regulated in each cell line, potentially providing insight into distinct metabolism taking place in different types of cancer cells. It is noteworthy, however, that human cell lines available from public repositories are deposited with recommended media for optimal growth, and if cell lines to be compared are cultured on different growth media, this introduces a potentially serious confounding variable in metabolic profiling studies designed to identify intrinsic metabolic pathways active in each cell line. OBJECTIVES The goal of this study was to determine if the culture media used to grow human cell lines had a significant impact on the measured metabolic profiles. METHODS NMR-based metabolic profiles of hydrophilic extracts of three human pancreatic cancer cell lines, AsPC-1, MiaPaCa-2 and Panc-1, were compared after culture on Dulbecco's Modified Eagle Medium (DMEM) or Roswell Park Memorial Institute (RPMI-1640) medium. RESULTS Comparisons of the same cell lines cultured on different media revealed that the concentrations of many metabolites depended strongly on the choice of culture media. Analyses of different cell lines grown on the same media revealed insight into their metabolic differences. CONCLUSION The choice of culture media can significantly impact metabolic profiles of human cell lines and should be considered an important variable when designing metabolic profiling studies. Also, the metabolic differences of cells cultured on media recommended for optimal growth in comparison to a second growth medium can reveal critical insight into metabolic pathways active in each cell line.
Collapse
Affiliation(s)
- Tafadzwa Chihanga
- Department of Chemistry and Biochemistry, Miami University, Oxford, OH, 45056, USA
| | - Sarah M Hausmann
- Department of Chemistry and Biochemistry, Miami University, Oxford, OH, 45056, USA
| | - Shuisong Ni
- Department of Chemistry and Biochemistry, Miami University, Oxford, OH, 45056, USA
| | - Michael A Kennedy
- Department of Chemistry and Biochemistry, Miami University, Oxford, OH, 45056, USA.
| |
Collapse
|
8
|
Cuperlovic-Culf M. Machine Learning Methods for Analysis of Metabolic Data and Metabolic Pathway Modeling. Metabolites 2018; 8:E4. [PMID: 29324649 PMCID: PMC5875994 DOI: 10.3390/metabo8010004] [Citation(s) in RCA: 80] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2017] [Revised: 01/08/2018] [Accepted: 01/09/2018] [Indexed: 01/15/2023] Open
Abstract
Machine learning uses experimental data to optimize clustering or classification of samples or features, or to develop, augment or verify models that can be used to predict behavior or properties of systems. It is expected that machine learning will help provide actionable knowledge from a variety of big data including metabolomics data, as well as results of metabolism models. A variety of machine learning methods has been applied in bioinformatics and metabolism analyses including self-organizing maps, support vector machines, the kernel machine, Bayesian networks or fuzzy logic. To a lesser extent, machine learning has also been utilized to take advantage of the increasing availability of genomics and metabolomics data for the optimization of metabolic network models and their analysis. In this context, machine learning has aided the development of metabolic networks, the calculation of parameters for stoichiometric and kinetic models, as well as the analysis of major features in the model for the optimal application of bioreactors. Examples of this very interesting, albeit highly complex, application of machine learning for metabolism modeling will be the primary focus of this review presenting several different types of applications for model optimization, parameter determination or system analysis using models, as well as the utilization of several different types of machine learning technologies.
Collapse
Affiliation(s)
- Miroslava Cuperlovic-Culf
- Digital Technologies Research Center, National Research Council of Canada, 1200 Montreal Road, Ottawa, ON K1A 0R6, Canada.
| |
Collapse
|
9
|
Euceda LR, Haukaas TH, Bathen TF, Giskeødegård GF. Prediction of Clinical Endpoints in Breast Cancer Using NMR Metabolic Profiles. Methods Mol Biol 2018; 1711:167-189. [PMID: 29344890 DOI: 10.1007/978-1-4939-7493-1_9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Metabolic profiles reflect biological conditions as a result of biochemical changes within a living system. It is therefore possible to associate metabolic signatures with clinical endpoints of diseases, such as breast cancer. Nuclear magnetic resonance (NMR) spectroscopy is one of the most common techniques used for metabolic profiling, and produces high dimensional datasets from which meaningful biological information can be extracted. Here, we present an overview of data analysis techniques used to achieve this, describing key steps in the procedure. Moreover, examples of clinical endpoints of interest are provided. Although these are specific for breast cancer, the procedures for the analysis of NMR spectra as described here are applicable to any type of cancer and to other diseases.
Collapse
Affiliation(s)
- Leslie R Euceda
- Department of Circulation and Medical Imaging - MR Center, Faculty of Medicine and Health Sciences, NTNU - Norwegian University of Science and Technology, Postboks 8905, MTFS, 7489, Trondheim, Norway
| | - Tonje H Haukaas
- Department of Circulation and Medical Imaging - MR Center, Faculty of Medicine and Health Sciences, NTNU - Norwegian University of Science and Technology, Postboks 8905, MTFS, 7489, Trondheim, Norway
| | - Tone F Bathen
- Department of Circulation and Medical Imaging - MR Center, Faculty of Medicine and Health Sciences, NTNU - Norwegian University of Science and Technology, Postboks 8905, MTFS, 7489, Trondheim, Norway
| | - Guro F Giskeødegård
- Department of Circulation and Medical Imaging - MR Center, Faculty of Medicine and Health Sciences, NTNU - Norwegian University of Science and Technology, Postboks 8905, MTFS, 7489, Trondheim, Norway.
- St. Olavs Hospital, Trondheim University Hospital, Trondheim, Norway.
| |
Collapse
|
10
|
Stryeck S, Birner-Gruenberger R, Madl T. Integrative metabolomics as emerging tool to study autophagy regulation. MICROBIAL CELL (GRAZ, AUSTRIA) 2017; 4:240-258. [PMID: 28845422 PMCID: PMC5568430 DOI: 10.15698/mic2017.08.584] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Accepted: 07/01/2017] [Indexed: 12/15/2022]
Abstract
Recent technological developments in metabolomics research have enabled in-depth characterization of complex metabolite mixtures in a wide range of biological, biomedical, environmental, agricultural, and nutritional research fields. Nuclear magnetic resonance spectroscopy and mass spectrometry are the two main platforms for performing metabolomics studies. Given their broad applicability and the systemic insight into metabolism that can be obtained it is not surprising that metabolomics becomes increasingly popular in basic biological research. In this review, we provide an overview on key metabolites, recent studies, and future opportunities for metabolomics in studying autophagy regulation. Metabolites play a pivotal role in autophagy regulation and are therefore key targets for autophagy research. Given the recent success of metabolomics, it can be expected that metabolomics approaches will contribute significantly to deciphering the complex regulatory mechanisms involved in autophagy in the near future and promote understanding of autophagy and autophagy-related diseases in living cells and organisms.
Collapse
Affiliation(s)
- Sarah Stryeck
- Institute of Molecular Biology and Biochemistry, Center of Molecular Medicine, Medical University of Graz, 8010 Graz, Austria
| | - Ruth Birner-Gruenberger
- Research Unit for Functional Proteomics and Metabolic Pathways, Institute of Pathology, Medical University of Graz, 8010 Graz, Austria
| | - Tobias Madl
- Institute of Molecular Biology and Biochemistry, Center of Molecular Medicine, Medical University of Graz, 8010 Graz, Austria
| |
Collapse
|
11
|
Trainor PJ, DeFilippis AP, Rai SN. Evaluation of Classifier Performance for Multiclass Phenotype Discrimination in Untargeted Metabolomics. Metabolites 2017. [PMID: 28635678 PMCID: PMC5488001 DOI: 10.3390/metabo7020030] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Statistical classification is a critical component of utilizing metabolomics data for examining the molecular determinants of phenotypes. Despite this, a comprehensive and rigorous evaluation of the accuracy of classification techniques for phenotype discrimination given metabolomics data has not been conducted. We conducted such an evaluation using both simulated and real metabolomics datasets, comparing Partial Least Squares-Discriminant Analysis (PLS-DA), Sparse PLS-DA, Random Forests, Support Vector Machines (SVM), Artificial Neural Network, k-Nearest Neighbors (k-NN), and Naïve Bayes classification techniques for discrimination. We evaluated the techniques on simulated data generated to mimic global untargeted metabolomics data by incorporating realistic block-wise correlation and partial correlation structures for mimicking the correlations and metabolite clustering generated by biological processes. Over the simulation studies, covariance structures, means, and effect sizes were stochastically varied to provide consistent estimates of classifier performance over a wide range of possible scenarios. The effects of the presence of non-normal error distributions, the introduction of biological and technical outliers, unbalanced phenotype allocation, missing values due to abundances below a limit of detection, and the effect of prior-significance filtering (dimension reduction) were evaluated via simulation. In each simulation, classifier parameters, such as the number of hidden nodes in a Neural Network, were optimized by cross-validation to minimize the probability of detecting spurious results due to poorly tuned classifiers. Classifier performance was then evaluated using real metabolomics datasets of varying sample medium, sample size, and experimental design. We report that in the most realistic simulation studies that incorporated non-normal error distributions, unbalanced phenotype allocation, outliers, missing values, and dimension reduction, classifier performance (least to greatest error) was ranked as follows: SVM, Random Forest, Naïve Bayes, sPLS-DA, Neural Networks, PLS-DA and k-NN classifiers. When non-normal error distributions were introduced, the performance of PLS-DA and k-NN classifiers deteriorated further relative to the remaining techniques. Over the real datasets, a trend of better performance of SVM and Random Forest classifier performance was observed.
Collapse
Affiliation(s)
- Patrick J Trainor
- Division of Cardiovascular Medicine, Department of Medicine, University of Louisville, 580 S. Preston St., Louisville, KY 40202, USA.
| | - Andrew P DeFilippis
- Division of Cardiovascular Medicine, Department of Medicine, University of Louisville, 580 S. Preston St., Louisville, KY 40202, USA.
| | - Shesh N Rai
- Department of Bioinformatics and Biostatistics, University of Louisville, 505 S. Hancock St., Louisville, KY 40202, USA.
| |
Collapse
|
12
|
Multivariate Calibration Approach for Quantitative Determination of Cell-Line Cross Contamination by Intact Cell Mass Spectrometry and Artificial Neural Networks. PLoS One 2016; 11:e0147414. [PMID: 26821236 PMCID: PMC4731057 DOI: 10.1371/journal.pone.0147414] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2015] [Accepted: 01/04/2016] [Indexed: 12/30/2022] Open
Abstract
Cross-contamination of eukaryotic cell lines used in biomedical research represents a highly relevant problem. Analysis of repetitive DNA sequences, such as Short Tandem Repeats (STR), or Simple Sequence Repeats (SSR), is a widely accepted, simple, and commercially available technique to authenticate cell lines. However, it provides only qualitative information that depends on the extent of reference databases for interpretation. In this work, we developed and validated a rapid and routinely applicable method for evaluation of cell culture cross-contamination levels based on mass spectrometric fingerprints of intact mammalian cells coupled with artificial neural networks (ANNs). We used human embryonic stem cells (hESCs) contaminated by either mouse embryonic stem cells (mESCs) or mouse embryonic fibroblasts (MEFs) as a model. We determined the contamination level using a mass spectra database of known calibration mixtures that served as training input for an ANN. The ANN was then capable of correct quantification of the level of contamination of hESCs by mESCs or MEFs. We demonstrate that MS analysis, when linked to proper mathematical instruments, is a tangible tool for unraveling and quantifying heterogeneity in cell cultures. The analysis is applicable in routine scenarios for cell authentication and/or cell phenotyping in general.
Collapse
|
13
|
Latino DARS, Aires-de-Sousa J. Automatic NMR-based identification of chemical reaction types in mixtures of co-occurring reactions. PLoS One 2014; 9:e88499. [PMID: 24551112 PMCID: PMC3923800 DOI: 10.1371/journal.pone.0088499] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2012] [Accepted: 01/08/2014] [Indexed: 11/18/2022] Open
Abstract
The combination of chemoinformatics approaches with NMR techniques and the increasing availability of data allow the resolution of problems far beyond the original application of NMR in structure elucidation/verification. The diversity of applications can range from process monitoring, metabolic profiling, authentication of products, to quality control. An application related to the automatic analysis of complex mixtures concerns mixtures of chemical reactions. We encoded mixtures of chemical reactions with the difference between the 1H NMR spectra of the products and the reactants. All the signals arising from all the reactants of the co-occurring reactions were taken together (a simulated spectrum of the mixture of reactants) and the same was done for products. The difference spectrum is taken as the representation of the mixture of chemical reactions. A data set of 181 chemical reactions was used, each reaction manually assigned to one of 6 types. From this dataset, we simulated mixtures where two reactions of different types would occur simultaneously. Automatic learning methods were trained to classify the reactions occurring in a mixture from the 1H NMR-based descriptor of the mixture. Unsupervised learning methods (self-organizing maps) produced a reasonable clustering of the mixtures by reaction type, and allowed the correct classification of 80% and 63% of the mixtures in two independent test sets of different similarity to the training set. With random forests (RF), the percentage of correct classifications was increased to 99% and 80% for the same test sets. The RF probability associated to the predictions yielded a robust indication of their reliability. This study demonstrates the possibility of applying machine learning methods to automatically identify types of co-occurring chemical reactions from NMR data. Using no explicit structural information about the reactions participants, reaction elucidation is performed without structure elucidation of the molecules in the mixtures.
Collapse
Affiliation(s)
- Diogo A R S Latino
- CQFB, REQUIMTE, Departamento de Química, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, Caparica, Portugal ; CCMM, Departamento de Química e Bioquímica, Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal
| | - João Aires-de-Sousa
- CQFB, REQUIMTE, Departamento de Química, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, Caparica, Portugal
| |
Collapse
|
14
|
|
15
|
Blanco F, López-Mesas M, Serranti S, Bonifazi G, Havel J, Valiente M. Hyperspectral imaging based method for fast characterization of kidney stone types. JOURNAL OF BIOMEDICAL OPTICS 2012; 17:076027. [PMID: 22894510 DOI: 10.1117/1.jbo.17.7.076027] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
The formation of kidney stones is a common and highly studied disease, which causes intense pain and presents a high recidivism. In order to find the causes of this problem, the characterization of the main compounds is of great importance. In this sense, the analysis of the composition and structure of the stone can give key information about the urine parameters during the crystal growth. But the usual methods employed are slow, analyst dependent and the information obtained is poor. In the present work, the near infrared (NIR)-hyperspectral imaging technique was used for the analysis of 215 samples of kidney stones, including the main types usually found and their mixtures. The NIR reflectance spectra of the analyzed stones showed significant differences that were used for their classification. To do so, a method was created by the use of artificial neural networks, which showed a probability higher than 90% for right classification of the stones. The promising results, robust methodology, and the fast analytical process, without the need of an expert assistance, lead to an easy implementation at the clinical laboratories, offering the urologist a rapid diagnosis that shall contribute to minimize urolithiasis recidivism.
Collapse
Affiliation(s)
- Francisco Blanco
- Universitat Autònoma de Barcelona, Centre Grup de Tècniques de Separació en Química (GTS), Unitat de Química Analítica, Departament de Química, 08193 Bellaterra, Spain
| | | | | | | | | | | |
Collapse
|
16
|
Sugimoto M, Kawakami M, Robert M, Soga T, Tomita M. Bioinformatics Tools for Mass Spectroscopy-Based Metabolomic Data Processing and Analysis. Curr Bioinform 2012; 7:96-108. [PMID: 22438836 PMCID: PMC3299976 DOI: 10.2174/157489312799304431] [Citation(s) in RCA: 189] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2011] [Revised: 10/25/2011] [Accepted: 12/07/2011] [Indexed: 01/04/2023]
Abstract
Biological systems are increasingly being studied in a holistic manner, using omics approaches, to provide quantitative and qualitative descriptions of the diverse collection of cellular components. Among the omics approaches, metabolomics, which deals with the quantitative global profiling of small molecules or metabolites, is being used extensively to explore the dynamic response of living systems, such as organelles, cells, tissues, organs and whole organisms, under diverse physiological and pathological conditions. This technology is now used routinely in a number of applications, including basic and clinical research, agriculture, microbiology, food science, nutrition, pharmaceutical research, environmental science and the development of biofuels. Of the multiple analytical platforms available to perform such analyses, nuclear magnetic resonance and mass spectrometry have come to dominate, owing to the high resolution and large datasets that can be generated with these techniques. The large multidimensional datasets that result from such studies must be processed and analyzed to render this data meaningful. Thus, bioinformatics tools are essential for the efficient processing of huge datasets, the characterization of the detected signals, and to align multiple datasets and their features. This paper provides a state-of-the-art overview of the data processing tools available, and reviews a collection of recent reports on the topic. Data conversion, pre-processing, alignment, normalization and statistical analysis are introduced, with their advantages and disadvantages, and comparisons are made to guide the reader.
Collapse
Affiliation(s)
- Masahiro Sugimoto
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata 997-0017, Japan
- Systems Biology Program, Graduate School of Media and Governance, Keio University, Fujisawa, Kanagawa 252-8520, Japan
- Graduate School of Medicine and Faculty of Medicine Kyoto University, Yoshida-Konoe-cho, Sakyo-ku, Kyoto 606-8501, Japan
| | - Masato Kawakami
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata 997-0017, Japan
- Department of Environment and Information Studies, Keio University, Fujisawa, Kanagawa 252-8520, Japan
| | - Martin Robert
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata 997-0017, Japan
- Systems Biology Program, Graduate School of Media and Governance, Keio University, Fujisawa, Kanagawa 252-8520, Japan
| | - Tomoyoshi Soga
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata 997-0017, Japan
- Department of Environment and Information Studies, Keio University, Fujisawa, Kanagawa 252-8520, Japan
| | - Masaru Tomita
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata 997-0017, Japan
- Department of Environment and Information Studies, Keio University, Fujisawa, Kanagawa 252-8520, Japan
| |
Collapse
|
17
|
Abstract
Metabolomics--the nontargeted measurement of all metabolites produced by the body--is beginning to show promise in both biomarker discovery and, in the form of pharmacometabolomics, in aiding the choice of therapy for patients with specific diseases. In its two basic forms (pattern recognition and metabolite identification), this developing field has been used to discover potential biomarkers in several renal diseases, including acute kidney injury (attributable to a variety of causes), autosomal dominant polycystic kidney disease and kidney cancer. NMR and gas chromatography or liquid chromatography, together with mass spectrometry, are generally used to separate and identify metabolites. Many hurdles need to be overcome in this field, such as achieving consistency in collection of biofluid samples, controlling for batch effects during the analysis and applying the most appropriate statistical analysis to extract the maximum amount of biological information from the data obtained. Pathway and network analyses have both been applied to metabolomic analysis, which vastly extends its clinical relevance and effects. In addition, pharmacometabolomics analyses, in which a metabolomic signature can be associated with a given therapeutic effect, are beginning to appear in the literature, which will lead to personalized therapies. Thus, metabolomics holds promise for early diagnosis, increased choice of therapy and the identification of new metabolic pathways that could potentially be targeted in kidney disease.
Collapse
|
18
|
Abstract
Metabolomics--the nontargeted measurement of all metabolites produced by the body--is beginning to show promise in both biomarker discovery and, in the form of pharmacometabolomics, in aiding the choice of therapy for patients with specific diseases. In its two basic forms (pattern recognition and metabolite identification), this developing field has been used to discover potential biomarkers in several renal diseases, including acute kidney injury (attributable to a variety of causes), autosomal dominant polycystic kidney disease and kidney cancer. NMR and gas chromatography or liquid chromatography, together with mass spectrometry, are generally used to separate and identify metabolites. Many hurdles need to be overcome in this field, such as achieving consistency in collection of biofluid samples, controlling for batch effects during the analysis and applying the most appropriate statistical analysis to extract the maximum amount of biological information from the data obtained. Pathway and network analyses have both been applied to metabolomic analysis, which vastly extends its clinical relevance and effects. In addition, pharmacometabolomics analyses, in which a metabolomic signature can be associated with a given therapeutic effect, are beginning to appear in the literature, which will lead to personalized therapies. Thus, metabolomics holds promise for early diagnosis, increased choice of therapy and the identification of new metabolic pathways that could potentially be targeted in kidney disease.
Collapse
|