1
|
Glace M, Moazeni-Pourasil RS, Cook DW, Roper TD. Iterative Regression of Corrective Baselines (IRCB): A New Model for Quantitative Spectroscopy. J Chem Inf Model 2024; 64:5006-5015. [PMID: 38897609 PMCID: PMC11234360 DOI: 10.1021/acs.jcim.4c00359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Revised: 06/05/2024] [Accepted: 06/10/2024] [Indexed: 06/21/2024]
Abstract
In this work, a new model with broad utility for quantitative spectroscopy development is reported. A primary objective of this work is to create a novel modeling procedure that may allow for higher automation of the model development process. The fundamental concept is simple yet powerful even for complex spectra and is employed with no additional preprocessing. This approach is applicable for several types of spectroscopic data to develop regression models that have similar or greater quality than the current methods. The key modeling steps are a matrix transformation and subsequent feature selection process that are collectively referred to as iterative regression of corrective baselines (IRCB). The transformed matrix (Xtransform) is a linearized form of the original X data set. Features from Xtransform that are predictive of Y can be ranked and selected by ordinary least-squares regression. The best features (rows of Xtransform) are linear depictions of Y that can be utilized to develop regression models with several machine learning models. The IRCB workflow is first detailed by using a case study of Fourier transform infrared (FTIR) spectroscopy for prepared solutions of a three-component mixture. Next, IRCB is applied and compared to benchmark results for the 2006 "Chimiométrie" near-infrared spectroscopy (NIR) soil composition challenge and Raman measurements of a simulated nuclear waste slurry.
Collapse
Affiliation(s)
- Matthew Glace
- Department
of Chemical and Life Sciences Engineering, Virginia Commonwealth University, Richmond, Virginia 23284, United States
| | | | - Daniel W. Cook
- Medicines
for All Institute, Virginia Commonwealth
University, Richmond, Virginia 23284, United States
| | - Thomas D. Roper
- Department
of Chemical and Life Sciences Engineering, Virginia Commonwealth University, Richmond, Virginia 23284, United States
| |
Collapse
|
2
|
Menżyk A, Martyna A, Damin A, Vincenti M, Zadora G. Breaking with trends in forensic dating: A likelihood ratio-based comparison approach. Forensic Sci Int 2023; 349:111763. [PMID: 37356322 DOI: 10.1016/j.forsciint.2023.111763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 06/04/2023] [Accepted: 06/08/2023] [Indexed: 06/27/2023]
Abstract
Further steps toward understanding the time-related information contained within bloodstains found at the crime scene are rightly considered a top priority in forensic science. Contrary to widely held assumptions, the reason for the delayed exploitation of bloodstains dating methods in practice is not the lack of suitable analytical techniques for monitoring degradation processes. The problem lies in the variability of the environmental and circumstantial conditions, playing a vital role in the degradation kinetics of blood deposits. The present article demonstrates the possibility of breaking with current approaches based on absolute age estimations to finally answer time-centered questions in real forensic scenarios. The proposed novel framework for situating forensic traces in time is based on the likelihood ratio assessment of the (dis)similarity between the evidence decomposition and sets of reference materials obtained through supervised aging. In such a strategy, every dating procedure is constructed on a case-by-case basis to fit examined blood traces, thereby limiting the adverse influence of external factors on the validity of age estimations and providing a way for future crime scene implementation.
Collapse
Affiliation(s)
- Alicja Menżyk
- Forensic Chemistry Research Group, Institute of Chemistry, University of Silesia in Katowice, Szkolna 9, 40-006 Katowice, Poland; Institute of Forensic Research in Krakow, Westerplatte 9, 31-003, Krakow, Poland.
| | - Agnieszka Martyna
- Forensic Chemistry Research Group, Institute of Chemistry, University of Silesia in Katowice, Szkolna 9, 40-006 Katowice, Poland
| | - Alessandro Damin
- Dipartimento di Chimica, Universita degli Studi di Torino, Via P. Giuria 7, 10125 Torino, Italy
| | - Marco Vincenti
- Dipartimento di Chimica, Universita degli Studi di Torino, Via P. Giuria 7, 10125 Torino, Italy; Centro Regionale Antidoping e di Tossicologia "A. Bertinaria", Regione Gonzole 10/1, Orbassano, 10043 Torino, Italy
| | - Grzegorz Zadora
- Forensic Chemistry Research Group, Institute of Chemistry, University of Silesia in Katowice, Szkolna 9, 40-006 Katowice, Poland; Institute of Forensic Research in Krakow, Westerplatte 9, 31-003, Krakow, Poland
| |
Collapse
|
3
|
Xu Q, Chen H, Ye S, Zeng Y, Lu H, Zhang Z. Standardization of Raman spectra using variable penalty dynamic time warping. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2021; 13:3414-3423. [PMID: 34254087 DOI: 10.1039/d1ay00541c] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Raman spectroscopy can provide structural fingerprints to identify molecules by means of spectral library searching. However, it is difficult to share the spectral library between different Raman spectrometers because of the nonlinear displacement in Raman shift. In this study, we propose a Raman spectra Standardization method using Variable Penalty dynamic time warping (RS-VPdtw), which can synchronize the nonlinear displacement between spectra acquired with different spectrometers. We have compared the standardization performance of RS-VPdtw and MWFFT on the spectra of 13 real samples acquired with 6 different spectrometers. The mean spectral similarity of RS-VPdtw and MWFFT increased from 0.79 to 0.97 and 0.91 respectively. Results show that RS-VPdtw is significantly better than MWFFT in Raman spectra standardization. The Raman spectra acquired with different spectrometers can be standardized by RS-VPdtw to search the same spectral library, which can avoid the time-consuming and labor-intensive reestablishment of spectral libraries for different spectrometers. This means that RS-VPdtw is a promising and valuable method to solve the spectra standardization problem in large-scale applications of Raman spectroscopy.
Collapse
Affiliation(s)
- Qingyu Xu
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, China.
| | | | | | | | | | | |
Collapse
|
4
|
Zhao Z, Jin Y, Shi P, Xue Y, Zhao B, Zhang Y, Huang F, Bi P, Wang Q. An Improved High-Throughput Data Processing Based on Combinatorial Materials Chip Approach for Rapid Construction of Fe-Cr-Ni Composition-Phase Map. ACS COMBINATORIAL SCIENCE 2019; 21:833-842. [PMID: 31663716 DOI: 10.1021/acscombsci.9b00149] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The combinatorial materials chip approach is vastly superior to the conventional one that characterizes one sample at a time in the efficiency of composition-phase map construction. However, the resolution of its high-throughput characterization and the correct rate of automated composition-phase mapping are often affected by inherent experimental limitations and imperfect automated analyses, respectively. Therefore, effective data preprocessing and refined automated analysis methods are required to automatically process huge amounts of experiment data to score a higher correct rate. In this work, the pixel-by-pixel structural and compositional characterization of the Fe-Cr-Ni combinatorial materials chip annealed at 750 °C was performed by microbeam X-ray at a synchrotron light source and by electron probe microanalysis, respectively. The severe baseline drift and system noise in the X-ray diffraction patterns were successfully eliminated by the three-step automated preprocessing (baseline drift removal, noise elimination, and baseline correction) proposed, which was beneficial to the subsequent quantitative analysis of the patterns. Through the injection of human experience, hierarchy clustering analyses, based on three dissimilarity measures (the cosine, Pearson correlation coefficient, and Jenson-Shannon divergence), were further accelerated by the simplified vectorization of the preprocessed X-ray diffraction patterns. As a result, a correct rate of 91.15% was reached for the whole map built automatically in comparison with the one constructed manually, which confirmed that the present data processing could assist humans to improve and expedite the processing of X-ray diffraction patterns and was feasible for composition-phase mapping. The constructed maps were generally consistent with the corresponding isothermal section of the Fe-Cr-Ni ternary alloy system in the ASM Alloy Phase Diagram Database except the inexistence of the σ phase under insufficient annealing.
Collapse
Affiliation(s)
- Zhaoyang Zhao
- National Center for Materials Service Safety, University of Science and Technology Beijing, Beijing 100083, China
| | - Ying Jin
- National Center for Materials Service Safety, University of Science and Technology Beijing, Beijing 100083, China
| | - Peng Shi
- National Center for Materials Service Safety, University of Science and Technology Beijing, Beijing 100083, China
| | - Yanpeng Xue
- National Center for Materials Service Safety, University of Science and Technology Beijing, Beijing 100083, China
| | - Bingbing Zhao
- Materials Genome Initiative Center, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yanpeng Zhang
- China Building Materials Academy, Beijing 100024, China
| | - Feifei Huang
- National Center for Materials Service Safety, University of Science and Technology Beijing, Beijing 100083, China
| | - Peng Bi
- National Center for Materials Service Safety, University of Science and Technology Beijing, Beijing 100083, China
| | - Qingrui Wang
- National Center for Materials Service Safety, University of Science and Technology Beijing, Beijing 100083, China
| |
Collapse
|
5
|
León-Bejarano F, Méndez MO, Ramírez-Elías MG, Alba A. Improved Vancouver Raman Algorithm Based on Empirical Mode Decomposition for Denoising Biological Samples. APPLIED SPECTROSCOPY 2019; 73:1436-1450. [PMID: 31411494 DOI: 10.1177/0003702819860121] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
A novel method based on the Vancouver Raman algorithm (VRA) and empirical mode decomposition (EMD) for denoising Raman spectra of biological samples is presented. The VRA is one of the most used methods for denoising Raman spectroscopy and is composed of two main steps: signal filtering and polynomial fitting. However, the signal filtering step consists in a simple mean filter that could eliminate spectrum peaks with small intensities or merge relatively close spectrum peaks into one single peak. Thus, the result is often sensitive to the order of the mean filter, so the user must choose it carefully to obtain the expected result; this introduces subjectivity in the process. To overcome these disadvantages, we propose a new algorithm, namely the modified-VRA (mVRA) with the following improvements: (1) to replace the mean filter step by EMD as an adaptive parameter-free signal processing method; and (2) to automate the selection of polynomial degree. The denoising capabilities of VRA, EMD, and mVRA were compared in Raman spectra of artificial data based on Teflon material, synthetic material obtained from vitamin E and paracetamol, and biological material of human nails and mouse brain. The correlation coefficient (ρ) was used to compare the performance of the methods. For the artificial Raman spectra, the denoised signal obtained by mVRA (ρ>0.91) outperforms VRA (ρ>0.86) for moderate to high noise levels whereas mVRA outperformed EMD (ρ>0.90) for high noise levels. On the other hand, when it comes to modeling the underlying fluorescence signal of the samples (i.e., the baseline trend), the proposed method mVRA showed consistent results (ρ>0.94). For Raman spectra of synthetic material, good performance of the three methods (ρ=0.99 for VRA, ρ=0.93 for EMD, and ρ=0.99 for mVRA) was obtained. Finally, in the biological material, mVRA and VRA showed similar results (ρ=0.96 for VRA, ρ=0.85 for EMD, and ρ=0.91 for mVRA); however, mVRA retains valuable information corresponding to relevant Raman peaks with small amplitude. Thus, the application of EMD as a filter in the VRA method provides a good alternative for denoising biological Raman spectra, since the information of the Raman peaks is conserved and parameter tuning is not required. Simultaneously, EMD allows the baseline correction to be automated.
Collapse
Affiliation(s)
- Fabiola León-Bejarano
- Facultad de Ciencias, Universidad Autónoma de San Luis Potosí, San Luis Potosí, SLP, México
| | - Martin O Méndez
- Laboratorio Nacional CI3M, Facultad de Ciencias & CICSaB, Universidad Autónoma de San Luis Potosí, San Luis Potosí, SLP, México
| | - Miguel G Ramírez-Elías
- Facultad de Ciencias, Universidad Autónoma de San Luis Potosí, San Luis Potosí, SLP, México
| | - Alfonso Alba
- Laboratorio Nacional CI3M, Facultad de Ciencias & CICSaB, Universidad Autónoma de San Luis Potosí, San Luis Potosí, SLP, México
| |
Collapse
|
6
|
Dünder E, Gümüştekin S, Murat N, Cengiz MA. Subset selection in quantile regression analysis via alternative Bayesian information criteria and heuristic optimization. COMMUN STAT-THEOR M 2017. [DOI: 10.1080/03610926.2016.1257718] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
- Emre Dünder
- Ondokuz Mayıs University, Faculty of Science, Department of Statistics, Samsun, Turkey
| | - Serpil Gümüştekin
- Ondokuz Mayıs University, Faculty of Science, Department of Statistics, Samsun, Turkey
| | - Naci Murat
- Ondokuz Mayıs University, Faculty of Engineering, Department of Endustrial Engineering, Samsun, Turkey
| | - Mehmet Ali Cengiz
- Ondokuz Mayıs University, Faculty of Science, Department of Statistics, Samsun, Turkey
| |
Collapse
|
7
|
Giguere S, Boucher T, Carey CJ, Mahadevan S, Dyar MD. A Fully Customized Baseline Removal Framework for Spectroscopic Applications. APPLIED SPECTROSCOPY 2017; 71:1457-1470. [PMID: 28664778 DOI: 10.1177/0003702817695624] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
The task of proper baseline or continuum removal is common to nearly all types of spectroscopy. Its goal is to remove any portion of a signal that is irrelevant to features of interest while preserving any predictive information. Despite the importance of baseline removal, median or guessed default parameters are commonly employed, often using commercially available software supplied with instruments. Several published baseline removal algorithms have been shown to be useful for particular spectroscopic applications but their generalizability is ambiguous. The new Custom Baseline Removal (Custom BLR) method presented here generalizes the problem of baseline removal by combining operations from previously proposed methods to synthesize new correction algorithms. It creates novel methods for each technique, application, and training set, discovering new algorithms that maximize the predictive accuracy of the resulting spectroscopic models. In most cases, these learned methods either match or improve on the performance of the best alternative. Examples of these advantages are shown for three different scenarios: quantification of components in near-infrared spectra of corn and laser-induced breakdown spectroscopy data of rocks, and classification/matching of minerals using Raman spectroscopy. Software to implement this optimization is available from the authors. By removing subjectivity from this commonly encountered task, Custom BLR is a significant step toward completely automatic and general baseline removal in spectroscopic and other applications.
Collapse
Affiliation(s)
- Stephen Giguere
- 1 College of Information and Computer Science, University of Massachusetts, Amherst, MA
| | - Thomas Boucher
- 1 College of Information and Computer Science, University of Massachusetts, Amherst, MA
| | - C J Carey
- 1 College of Information and Computer Science, University of Massachusetts, Amherst, MA
| | | | - M Darby Dyar
- 3 Department of Astronomy, Mount Holyoke College, South Hadley, MA
| |
Collapse
|
8
|
Erny GL, Acunha T, Simó C, Cifuentes A, Alves A. Background correction in separation techniques hyphenated to high-resolution mass spectrometry - Thorough correction with mass spectrometry scans recorded as profile spectra. J Chromatogr A 2017; 1492:98-105. [PMID: 28267998 DOI: 10.1016/j.chroma.2017.02.052] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2016] [Revised: 01/31/2017] [Accepted: 02/23/2017] [Indexed: 01/19/2023]
Abstract
Separation techniques hyphenated with high-resolution mass spectrometry have been a true revolution in analytical separation techniques. Such instruments not only provide unmatched resolution, but they also allow measuring the peaks accurate masses that permit identifying monoisotopic formulae. However, data files can be large, with a major contribution from background noise and background ions. Such unnecessary contribution to the overall signal can hide important features as well as decrease the accuracy of the centroid determination, especially with minor features. Thus, noise and baseline correction can be a valuable pre-processing step. The methodology that is described here, unlike any other approach, is used to correct the original dataset with the MS scans recorded as profiles spectrum. Using urine metabolic studies as examples, we demonstrate that this thorough correction reduces the data complexity by more than 90%. Such correction not only permits an improved visualisation of secondary peaks in the chromatographic domain, but it also facilitates the complete assignment of each MS scan which is invaluable to detect possible comigration/coeluting species.
Collapse
Affiliation(s)
- Guillaume L Erny
- LEPABE - Laboratory for Process Engineering, Environment, Biotechnology and Energy, Faculdade de Engenharia da Universidade do Porto, Rua Dr. Roberto Frias, 4200-465 Porto, Portugal.
| | - Tanize Acunha
- Laboratory of Foodomics, CIAL, CSIC, Nicolas Cabrera 9, 28049 Madrid, Spain; CAPES Foundation, Ministry of Education of Brazil, 70040-020 Brasília, DF, Brazil
| | - Carolina Simó
- Laboratory of Foodomics, CIAL, CSIC, Nicolas Cabrera 9, 28049 Madrid, Spain
| | | | - Arminda Alves
- LEPABE - Laboratory for Process Engineering, Environment, Biotechnology and Energy, Faculdade de Engenharia da Universidade do Porto, Rua Dr. Roberto Frias, 4200-465 Porto, Portugal
| |
Collapse
|
9
|
Mani-Varnosfaderani A, Kanginejad A, Gilany K, Valadkhani A. Estimating complicated baselines in analytical signals using the iterative training of Bayesian regularized artificial neural networks. Anal Chim Acta 2016; 940:56-64. [PMID: 27662759 DOI: 10.1016/j.aca.2016.08.046] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2016] [Revised: 08/21/2016] [Accepted: 08/30/2016] [Indexed: 11/18/2022]
Abstract
The present work deals with the development of a new baseline correction method based on the comparative learning capabilities of artificial neural networks. The developed method uses the Bayes probability theorem for prevention of the occurrence of the over-fitting and finding a generalized baseline. The developed method has been applied on simulated and real metabolomic gas-chromatography (GC) and Raman data sets. The results revealed that the proposed method can be used to handle different types of baselines with cave, convex, curvelinear, triangular and sinusoidal patterns. For further evaluation of the performances of this method, it has been compared with benchmarking baseline correction methods such as corner-cutting (CC), morphological weighted penalized least squares (MPLS), adaptive iteratively-reweighted penalized least squares (airPLS) and iterative polynomial fitting (iPF). In order to compare the methods, the projected difference resolution (PDR) criterion has been calculated for the data before and after the baseline correction procedure. The calculated values of PDR after the baseline correction using iBRANN, airPLS, MPLS, iPF and CC algorithms for the GC metabolomic data were 4.18, 3.64, 3.88, 1.88 and 3.08, respectively. The obtained results in this work demonstrated that the developed iterative Bayesian regularized neural network (iBRANN) method in this work thoroughly detects the baselines and is superior over the CC, MPLS, airPLS and iPF techniques. A graphical user interface has been developed for the suggested algorithm and can be used for easy implementation of the iBRANN algorithm for the correction of different chromatography, NMR and Raman data sets.
Collapse
Affiliation(s)
- Ahmad Mani-Varnosfaderani
- Chemometrics and Chemoinformatics Laboratory, Department of Chemistry, Faculty of Sciences, Tarbiat Modares University, P.O. Box 14115-175, Tehran, Iran.
| | - Atefeh Kanginejad
- Chemometrics and Chemoinformatics Laboratory, Department of Chemistry, Faculty of Sciences, Tarbiat Modares University, P.O. Box 14115-175, Tehran, Iran
| | - Kambiz Gilany
- Reproductive Biotechnology Research Center, Avicenna Research Institute, ACECR, Tehran, Iran
| | - Abolfazl Valadkhani
- Department of Analytical Chemistry, Chemistry and Chemical Engineering Research Center of Iran, P. O. Box 14335-186, Tehran, Iran
| |
Collapse
|
10
|
Narath SH, Mautner SI, Svehlikova E, Schultes B, Pieber TR, Sinner FM, Gander E, Libiseller G, Schimek MG, Sourij H, Magnes C. An Untargeted Metabolomics Approach to Characterize Short-Term and Long-Term Metabolic Changes after Bariatric Surgery. PLoS One 2016; 11:e0161425. [PMID: 27584017 PMCID: PMC5008721 DOI: 10.1371/journal.pone.0161425] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2015] [Accepted: 07/20/2016] [Indexed: 12/28/2022] Open
Abstract
Bariatric surgery is currently one of the most effective treatments for obesity and leads to significant weight reduction, improved cardiovascular risk factors and overall survival in treated patients. To date, most studies focused on short-term effects of bariatric surgery on the metabolic profile and found high variation in the individual responses to surgery. The aim of this study was to identify relevant metabolic changes not only shortly after bariatric surgery (Roux-en-Y gastric bypass) but also up to one year after the intervention by using untargeted metabolomics. 132 serum samples taken from 44 patients before surgery, after hospital discharge (1-3 weeks after surgery) and at a 1-year follow-up during a prospective study (NCT01271062) performed at two study centers (Austria and Switzerland). The samples included 24 patients with type 2 diabetes at baseline, thereof 9 with diabetes remission after one year. The samples were analyzed by using liquid chromatography coupled to high resolution mass spectrometry (LC-HRMS, HILIC-QExactive). Raw data was processed with XCMS and drift-corrected through quantile regression based on quality controls. 177 relevant metabolic features were selected through Random Forests and univariate testing and 36 metabolites were identified. Identified metabolites included trimethylamine-N-oxide, alanine, phenylalanine and indoxyl-sulfate which are known markers for cardiovascular risk. In addition we found a significant decrease in alanine after one year in the group of patients with diabetes remission relative to non-remission. Our analysis highlights the importance of assessing multiple points in time in subjects undergoing bariatric surgery to enable the identification of biomarkers for treatment response, cardiovascular benefit and diabetes remission. Key-findings include different trend pattern over time for various metabolites and demonstrated that short term changes should not necessarily be used to identify important long term effects of bariatric surgery.
Collapse
Affiliation(s)
- Sophie H. Narath
- JOANNEUM RESEARCH Forschungsgesellschaft mbH HEALTH Institute for Biomedicine and Health Sciences, Graz, Austria
| | - Selma I. Mautner
- JOANNEUM RESEARCH Forschungsgesellschaft mbH HEALTH Institute for Biomedicine and Health Sciences, Graz, Austria
- Medical University of Graz, Department of Internal Medicine, Division of Endocrinology and Diabetology, Graz, Austria
- CBmed – Center of Biomarker Research in Medicine, Stiftingtalstrasse 5, 8010 Graz, Austria
| | - Eva Svehlikova
- Medical University of Graz, Department of Internal Medicine, Division of Endocrinology and Diabetology, Graz, Austria
| | - Bernd Schultes
- eSwiss Medical & Surgical Center, St. Gallen, Switzerland
| | - Thomas R. Pieber
- JOANNEUM RESEARCH Forschungsgesellschaft mbH HEALTH Institute for Biomedicine and Health Sciences, Graz, Austria
- Medical University of Graz, Department of Internal Medicine, Division of Endocrinology and Diabetology, Graz, Austria
- CBmed – Center of Biomarker Research in Medicine, Stiftingtalstrasse 5, 8010 Graz, Austria
| | - Frank M. Sinner
- JOANNEUM RESEARCH Forschungsgesellschaft mbH HEALTH Institute for Biomedicine and Health Sciences, Graz, Austria
- Medical University of Graz, Department of Internal Medicine, Division of Endocrinology and Diabetology, Graz, Austria
| | - Edgar Gander
- JOANNEUM RESEARCH Forschungsgesellschaft mbH HEALTH Institute for Biomedicine and Health Sciences, Graz, Austria
| | - Gunnar Libiseller
- JOANNEUM RESEARCH Forschungsgesellschaft mbH HEALTH Institute for Biomedicine and Health Sciences, Graz, Austria
| | - Michael G. Schimek
- Institute for Medical Informatics, Statistics and Documentation Medical University of Graz, Graz, Austria
| | - Harald Sourij
- Medical University of Graz, Department of Internal Medicine, Division of Endocrinology and Diabetology, Graz, Austria
- CBmed – Center of Biomarker Research in Medicine, Stiftingtalstrasse 5, 8010 Graz, Austria
- * E-mail:
| | - Christoph Magnes
- JOANNEUM RESEARCH Forschungsgesellschaft mbH HEALTH Institute for Biomedicine and Health Sciences, Graz, Austria
| |
Collapse
|
11
|
|
12
|
Guo S, Bocklitz T, Popp J. Optimization of Raman-spectrum baseline correction in biological application. Analyst 2016; 141:2396-404. [PMID: 26907832 DOI: 10.1039/c6an00041j] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
In the last decade Raman-spectroscopy has become an invaluable tool for biomedical diagnostics. However, a manual rating of the subtle spectral differences between normal and abnormal disease states is not possible or practical. Thus it is necessary to combine Raman-spectroscopy with chemometrics in order to build statistical models predicting the disease states directly without manual intervention. Within chemometrical analysis a number of corrections have to be applied to receive robust models. Baseline correction is an important step of the pre-processing, which should remove spectral contributions of fluorescence effects and improve the performance and robustness of statistical models. However, it is demanding, time-consuming, and depends on expert knowledge to select an optimal baseline correction method and its parameters every time working with a new dataset. To circumvent this issue we proposed a genetic algorithm based method to automatically optimize the baseline correction. The investigation was carried out in three main steps. Firstly, a numerical quantitative marker was defined to evaluate the baseline estimation quality. Secondly, a genetic algorithm based methodology was established to search the optimal baseline estimation with the defined quantitative marker as evaluation function. Finally, classification models were utilized to benchmark the performance of the optimized baseline. For comparison, model based baseline optimization was carried out applying the same classifiers. It was proven that our method could provide a semi-optimal and stable baseline estimation without any chemical knowledge required or any additional spectral information used.
Collapse
Affiliation(s)
- Shuxia Guo
- Institute of Physical Chemistry and Abbe School of Photonics, Friedrich-Schiller-University, Jena, Helmholtzweg 4, D-07743 Jena, Germany
| | | | | |
Collapse
|
13
|
Chemometrics applied to quality control and metabolomics for traditional Chinese medicines. J Chromatogr B Analyt Technol Biomed Life Sci 2016; 1015-1016:82-91. [PMID: 26901849 DOI: 10.1016/j.jchromb.2016.02.011] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2015] [Revised: 02/03/2016] [Accepted: 02/06/2016] [Indexed: 02/08/2023]
Abstract
Traditional Chinese medicines (TCMs) bring a great challenge in quality control and evaluating the efficacy because of their complexity of chemical composition. Chemometric techniques provide a good opportunity for mining more useful chemical information from TCMs. Then, the application of chemometrics in the field of TCMs is spontaneous and necessary. This review focuses on the recent various important chemometrics tools for chromatographic fingerprinting, including peak alignment information features, baseline correction and applications of chemometrics in metabolomics and modernization of TCMs, including authentication and evaluation of the quality of TCMs, evaluating the efficacy of TCMs and essence of TCM syndrome. In the conclusions, the general trends and some recommendations for improving chromatographic metabolomics data analysis are provided.
Collapse
|
14
|
Mohamed A, Nguyen CH, Mamitsuka H. Current status and prospects of computational resources for natural product dereplication: a review. Brief Bioinform 2015; 17:309-21. [PMID: 26153512 DOI: 10.1093/bib/bbv042] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2015] [Indexed: 01/08/2023] Open
Abstract
Research in natural products has always enhanced drug discovery by providing new and unique chemical compounds. However, recently, drug discovery from natural products is slowed down by the increasing chance of re-isolating known compounds. Rapid identification of previously isolated compounds in an automated manner, called dereplication, steers researchers toward novel findings, thereby reducing the time and effort for identifying new drug leads. Dereplication identifies compounds by comparing processed experimental data with those of known compounds, and so, diverse computational resources such as databases and tools to process and compare compound data are necessary. Automating the dereplication process through the integration of computational resources has always been an aspired goal of natural product researchers. To increase the utilization of current computational resources for natural products, we first provide an overview of the dereplication process, and then list useful resources, categorizing into databases, methods and software tools and further explaining them from a dereplication perspective. Finally, we discuss the current challenges to automating dereplication and proposed solutions.
Collapse
|
15
|
Zhang M, Wen M, Zhang ZM, Lu H, Liang Y, Zhan D. Robust alignment of chromatograms by statistically analyzing the shifts matrix generated by moving window fast Fourier transform cross-correlation. J Sep Sci 2015; 38:965-74. [PMID: 25645318 DOI: 10.1002/jssc.201401235] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2014] [Revised: 12/28/2014] [Accepted: 12/28/2014] [Indexed: 11/09/2022]
Abstract
Retention time shift is one of the most challenging problems during the preprocessing of massive chromatographic datasets. Here, an improved version of the moving window fast Fourier transform cross-correlation algorithm is presented to perform nonlinear and robust alignment of chromatograms by analyzing the shifts matrix generated by moving window procedure. The shifts matrix in retention time can be estimated by fast Fourier transform cross-correlation with a moving window procedure. The refined shift of each scan point can be obtained by calculating the mode of corresponding column of the shifts matrix. This version is simple, but more effective and robust than the previously published moving window fast Fourier transform cross-correlation method. It can handle nonlinear retention time shift robustly if proper window size has been selected. The window size is the only one parameter needed to adjust and optimize. The properties of the proposed method are investigated by comparison with the previous moving window fast Fourier transform cross-correlation and recursive alignment by fast Fourier transform using chromatographic datasets. The pattern recognition results of a gas chromatography mass spectrometry dataset of metabolic syndrome can be improved significantly after preprocessing by this method. Furthermore, the proposed method is available as an open source package at https://github.com/zmzhang/MWFFT2.
Collapse
Affiliation(s)
- Mingjing Zhang
- College of Chemistry and Chemical Engineering, Central South University, Changsha, P. R. China
| | | | | | | | | | | |
Collapse
|
16
|
Zhang ZM, Tong X, Peng Y, Ma P, Zhang MJ, Lu HM, Chen XQ, Liang YZ. Multiscale peak detection in wavelet space. Analyst 2015; 140:7955-64. [DOI: 10.1039/c5an01816a] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Multi-scale peak detection (MSPD) for analytical instruments is presented by taking full advantage of additional information in wavelet space including ridges, valleys, and zero-crossings.
Collapse
Affiliation(s)
- Zhi-Min Zhang
- College of Chemistry and Chemical Engineering
- Central South University
- Changsha 410083
- China
- Institute of Chemometrics and Intelligent Instruments
| | - Xia Tong
- College of Chemistry and Chemical Engineering
- Central South University
- Changsha 410083
- China
| | - Ying Peng
- College of Chemistry and Chemical Engineering
- Central South University
- Changsha 410083
- China
| | - Pan Ma
- College of Chemistry and Chemical Engineering
- Central South University
- Changsha 410083
- China
| | - Ming-Jin Zhang
- College of Chemistry and Chemical Engineering
- Central South University
- Changsha 410083
- China
| | - Hong-Mei Lu
- College of Chemistry and Chemical Engineering
- Central South University
- Changsha 410083
- China
- Institute of Chemometrics and Intelligent Instruments
| | - Xiao-Qing Chen
- College of Chemistry and Chemical Engineering
- Central South University
- Changsha 410083
- China
| | - Yi-Zeng Liang
- Institute of Chemometrics and Intelligent Instruments
- Central South University
- Changsha 410083
- P.R. China
| |
Collapse
|