1
|
Sitorus A, Lapcharoensuk R. Development of automatic tuning for combined preprocessing and hyperparameters of machine learning and its application to NIR spectral data of coconut milk adulteration. Food Chem 2024; 457:140108. [PMID: 38905832 DOI: 10.1016/j.foodchem.2024.140108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 05/29/2024] [Accepted: 06/12/2024] [Indexed: 06/23/2024]
Abstract
This study proposed a novel approach to automatically select the preprocessing methods and hyperparameters of machine learning (ML) algorithms based on their best performance in cross-validation for near-infrared (NIR) spectroscopy data. The proposed method simultaneously incorporates single or multiple-preprocessing steps and tunes hyperparameters to determine the best model performance for FT-NIR and Micro-NIR spectral data of coconut milk adulteration with distilled water and mature coconut water in the range of 0%-50%. Computational experiments were conducted using nine single preprocessing types, three types of ML classifier (linear discriminant analysis (LDA), k-nearest neighbour (KNN), multilayer perceptron (MLP)) and three types of ML regressor (partial least squares (PLS), KNN, MLP). The proposed performance strategy effectively addressed and produced satisfactory outcomes for classification and regression challenges in coconut milk adulteration. Finally, the results demonstrated that the proposed approach can more accurately determine the best model, particularly for NIR spectroscopy of coconut milk adulteration.
Collapse
Affiliation(s)
- Agustami Sitorus
- Department of Agricultural Engineering, School of Engineering, King Mongkut's Institute of Technology Ladkrabang, Bangkok 10520, Thailand; National Research and Innovation Agency (BRIN), Jakarta Pusat 10340, Indonesia
| | - Ravipat Lapcharoensuk
- Department of Agricultural Engineering, School of Engineering, King Mongkut's Institute of Technology Ladkrabang, Bangkok 10520, Thailand.
| |
Collapse
|
2
|
Yang SW, Xie Y, Liu JZ, Zhang D, Huang J, Liang P. A novel method for quantitative determination of multiple substances using Raman spectroscopy combined with CWT. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2024; 317:124427. [PMID: 38754205 DOI: 10.1016/j.saa.2024.124427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Revised: 04/21/2024] [Accepted: 05/07/2024] [Indexed: 05/18/2024]
Abstract
The identification of mixed solutions is a challenging and important subject in chemical analysis. In this paper, we propose a novel workflow that enables rapid qualitative and quantitative detection of mixed solutions. We use a methanol-ethanol mixed solution as an example to demonstrate the superiority of this workflow. The workflow includes the following steps: (1) converting Raman spectra into Raman images through CWT; (2) using MobileNetV3 as the backbone network, improved multi-label and multi-channel synchronization enables simultaneous prediction of multiple mixture concentrations; and (3) using transfer learning and multi-stage training strategies for training to achieve accurate quantitative analysis. We compare six traditional machine learning algorithms and two deep learning models to evaluate the performance of our new method. The experimental results show that our model has achieved good prediction results when predicting the concentration of methanol and ethanol, and the coefficient of determination R2 is greater than 0.999. At different concentrations, both MAPE and RSD outperform other models, which demonstrates that our workflow has outstanding analytical capabilities. Importantly, we have solved the problem that current quantitative analysis algorithms for Raman spectroscopy are almost unable to accurately predict the concentration of multiple substances simultaneously. In conclusion, it is foreseeable that this non-destructive, automated, and highly accurate workflow can further advance Raman spectroscopy.
Collapse
Affiliation(s)
- Si-Wei Yang
- College of Optical and Electronic Technology, China Jiliang University, Hangzhou 310018, China
| | - Yuhao Xie
- College of Optical and Electronic Technology, China Jiliang University, Hangzhou 310018, China
| | - Jia-Zhen Liu
- College of Optical and Electronic Technology, China Jiliang University, Hangzhou 310018, China
| | - De Zhang
- College of Horticulture & Forestry Sciences, Key Laboratory of Horticultural Plant Biology, Ministry of Education, Huazhong Agricultural University, Wuhan 430070, China
| | - Jie Huang
- College of Optical and Electronic Technology, China Jiliang University, Hangzhou 310018, China
| | - Pei Liang
- College of Optical and Electronic Technology, China Jiliang University, Hangzhou 310018, China.
| |
Collapse
|
3
|
Huang H, Fang Z, Xu Y, Lu G, Feng C, Zeng M, Tian J, Ping Y, Han Z, Zhao Z. Stacking and ridge regression-based spectral ensemble preprocessing method and its application in near-infrared spectral analysis. Talanta 2024; 276:126242. [PMID: 38761656 DOI: 10.1016/j.talanta.2024.126242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Revised: 05/08/2024] [Accepted: 05/09/2024] [Indexed: 05/20/2024]
Abstract
Spectral preprocessing techniques can, to a certain extent, eliminate irrelevant information, such as current noise and stray light from spectral data, thereby enhancing the performance of prediction models. However, current preprocessing techniques mostly attempt to find the best single preprocessing method or their combination, overlooking the complementary information among different preprocessing methods. These preprocessing techniques fail to maximize the utilization of useful information in spectral data and restrict the performance of prediction models. This study proposed a spectral ensemble preprocessing method based on the rapidly developing ensemble learning methods in recent years and the ridge regression (RR) model, named stacking preprocessing ridge regression (SPRR), to address the aforementioned issues. Different from conventional ensemble learning methods, the proposed SPRR method applied multiple different preprocessing techniques to the original spectral data, generating multiple preprocessed datasets. These datasets were then individually inputted into RR base models for training. Ultimately, RR still served as the meta-model, integrating the output results of each RR base model through stacking. This approach not only produced diversity in base models but also achieved higher accuracy and lower computational complexity by using a single type of base model. On the apple spectral dataset collected by our team, correlation analysis showed significant complementary information among the data produced by different preprocessing techniques. This provided robust theoretical support for the proposed SPRR method. By introducing the currently popular averaging ensemble preprocessing method in a comparative experiment, the results of applying the proposed SPRR method to six datasets (apple, meat, wheat, olive oil, tablet, and corn) demonstrated that compared to the single preprocessing method and averaging ensemble preprocessing method, SPRR yielded the best accuracy and reliability for all six datasets. Furthermore, under the same conditions of the training and test datasets, the proposed SPRR method demonstrated better performance than the four commonly used ensemble preprocessing methods.
Collapse
Affiliation(s)
- Haowen Huang
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Zile Fang
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Yuelong Xu
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Guosheng Lu
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Can Feng
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Min Zeng
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Jiaju Tian
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Yongfu Ping
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Zhuolin Han
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China
| | - Zhigang Zhao
- College of New Materials and New Energies, Shenzhen Technology University, Shenzhen, 518118, PR China.
| |
Collapse
|
4
|
Awotunde O, Cai J, Azar CGE, Medina D, Eyolfson SI, Hayes K, Waffo C, Djang'eing'a RM, Ziemons EM, Sacré PY, Lieberman M. Field assessment of active ingredient quantity in pharmaceutical tablets with limited calibration of near infrared spectra: An application to ciprofloxacin tablets. J Pharm Biomed Anal 2024; 246:116189. [PMID: 38733763 DOI: 10.1016/j.jpba.2024.116189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Revised: 04/12/2024] [Accepted: 04/26/2024] [Indexed: 05/13/2024]
Abstract
Portable near-infrared (NIR) spectrophotometers have emerged as valuable tools for identifying substandard and falsified pharmaceuticals (SFPs). Integration of these devices with chemometric and machine learning models enhances their ability to provide quantitative chemical insights. However, different NIR spectrophotometer models vary in resolution, sensitivity, and responses to environmental factors such as temperature and humidity, necessitating instrument-specific libraries that hinder the wider adoption of NIR technology. This study addresses these challenges and seeks to establish a robust approach to promote the use of NIR technology in post-market pharmaceutical analysis. We developed support vector machine and partial least squares regression models based on binary mixtures of lab-made ciprofloxacin and microcrystalline cellulose, then applied the models to ciprofloxacin dosage forms that were assayed with high performance liquid chromatography (HPLC). A receiver operating characteristic (ROC) analysis was performed to set spectrophotometer independent NIR metrics to evaluate ciprofloxacin dosage forms as "meets standard," "needs HPLC assay," or "fails standard." Over 200 ciprofloxacin tablets representing 50 different brands were evaluated using spectra acquired from three types of NIR spectrophotometer with 85% of the prediction agreeing with HPLC testing. This study shows that non-brand-specific predictive models can be applied across multiple spectrophotometers for rapid screening of the conformity of pharmaceutical active ingredients to regulatory standard.
Collapse
Affiliation(s)
- Olatunde Awotunde
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Jin Cai
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, IN 46556, USA
| | | | - Diane Medina
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Samantha I Eyolfson
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Kathleen Hayes
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Christelle Waffo
- University of Liege (ULiege), CIRM, Laboratory of Pharmaceutical Analytical Chemistry, Department of Pharmacy, Liege, Belgium
| | - Roland Marini Djang'eing'a
- University of Liege (ULiege), CIRM, Laboratory of Pharmaceutical Analytical Chemistry, Department of Pharmacy, Liege, Belgium
| | - Eric M Ziemons
- University of Liege (ULiege), CIRM, Laboratory of Pharmaceutical Analytical Chemistry, Department of Pharmacy, Liege, Belgium
| | - Pierre-Yves Sacré
- University of Liege (ULiege), CIRM, Research Support Unit in Chemometrics, Department of Pharmacy, Liege, Belgium
| | - Marya Lieberman
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, IN 46556, USA.
| |
Collapse
|
5
|
Yang S, White B, de Santana FB, Hall RL, Daly K. Comparing the potential of benchtop and handheld mid-infrared spectrometers for predicting soil phosphorus (P) sorption capacity and evaluating the influence of sample preparation. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2024; 322:124856. [PMID: 39047667 DOI: 10.1016/j.saa.2024.124856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2024] [Revised: 07/09/2024] [Accepted: 07/19/2024] [Indexed: 07/27/2024]
Abstract
Traditional soil phosphorus (P) sorption capacity is examined from a Langmuir isotherm batch technique, which is time-consuming, labour intensive and generates chemical waste. In this work, we provide an efficient and convenient technique with MIR spectroscopy to predict the Langmuir parameter of soil P sorption maximum capacity (Smax, mg·kg-1). Four spectral libraries from benchtop (Bruker) and handheld (Agilent) MIR spectrometers were built with samples in two particle size ranges, <0.100 mm (ball-milled) and <2 mm. respectively. Using an archive of samples with a database of sorption parameters, soils were classified into 'low' and 'high' sorption capacities. Chemometric regression models of partial least squares (PLS), Cubist, support vector machine (SVM) regression and random forest (RF) were evaluated for Smax prediction. Bruker spectral libraries with both soil particle sizes yielded 'excellent models', with SVM predicting Smax values with high accuracy (RPIQV = 4.50 and 4.25 for the spectral libraries of the ball-milled and <2 mm samples, respectively). In comparison, the Agilent handheld spectral libraries contained more noise and less resolution. For Agilent MIR spectroscopy, more homogeneous samples after ball milling resulted in a higher accurate Smax prediction. For Agilent libraries of ball-milled samples, an 'approximate quantitative model' (RPIQV = 2.74) was obtained from the raw spectra using the Cubist algorithm. However, for Agilent spectroscopy of <2 mm samples, the best performing Cubist algorithm can only achieve a 'fair model' (RPIQV=2.23) with the potential to discriminate between 'low' and 'high' Smax values. The results suggest that the benchtop spectrometer can predict the Langmuir Smax value with high accuracy without the need to ball mill samples. However, the handheld spectrometer can only make approximate quantitative predictions of Smax for ball-milled samples. For <2 mm samples, Agilent can only be used to classify 'low' and 'high' sorption capacity soils.
Collapse
Affiliation(s)
- Sifan Yang
- Environment, Soils and Land Use Department, Teagasc, Johnstown Castle Research Centre, Wexford Y35 TC97, Ireland; DCU Water Institute, School of Chemical Sciences, Dublin City University, Glasnevin, Dublin 9 D09 E432, Ireland
| | - Blánaid White
- DCU Water Institute, School of Chemical Sciences, Dublin City University, Glasnevin, Dublin 9 D09 E432, Ireland
| | - Felipe B de Santana
- Environment, Soils and Land Use Department, Teagasc, Johnstown Castle Research Centre, Wexford Y35 TC97, Ireland
| | - Rebecca L Hall
- Environment, Soils and Land Use Department, Teagasc, Johnstown Castle Research Centre, Wexford Y35 TC97, Ireland
| | - Karen Daly
- Environment, Soils and Land Use Department, Teagasc, Johnstown Castle Research Centre, Wexford Y35 TC97, Ireland.
| |
Collapse
|
6
|
Milani NBL, García-Cicourel AR, Blomberg J, Edam R, Samanipour S, Bos TS, Pirok BWJ. Generating realistic data through modeling and parametric probability for the numerical evaluation of data processing algorithms in two-dimensional chromatography. Anal Chim Acta 2024; 1312:342724. [PMID: 38834259 DOI: 10.1016/j.aca.2024.342724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 04/22/2024] [Accepted: 05/14/2024] [Indexed: 06/06/2024]
Abstract
BACKGROUND Comprehensive two-dimensional chromatography generates complex data sets, and numerous baseline correction and noise removal algorithms have been proposed in the past decade to address this challenge. However, evaluating their performance objectively is currently not possible due to a lack of objective data. RESULT To tackle this issue, we introduce a versatile platform that models and reconstructs single-trace two-dimensional chromatography data, preserving peak parameters. This approach balances real experimental data with synthetic data for precise comparisons. We achieve this by employing a Skewed Lorentz-Normal model to represent each peak and creating probability distributions for relevant parameter sampling. The model's performance has been showcased through its application to two-dimensional gas chromatography data where it has created a data set with 458 peaks with an RMSE of 0.0048 or lower and minimal residuals compared to the original data. Additionally, the same process has been shown in liquid chromatography data. SIGNIFICANCE Data analysis is an integral component of any analytical method. The development of new data processing strategies is of paramount importance to tackle the complex signals generated by state-of-the-art separation technology. Through the use of probability distributions, quantitative assessment of algorithm performance of new algorithms is now possible. Therefore, creating new opportunities for faster, more accurate, and simpler data analysis development.
Collapse
Affiliation(s)
- Nino B L Milani
- Van't Hoff Institute for Molecular Science (HIMS), University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, the Netherlands; Centre for Analytical Sciences Amsterdam (CASA), the Netherlands.
| | | | - Jan Blomberg
- Shell Global Solutions International B.V., Grasweg 31, 1031 HW, Amsterdam, the Netherlands
| | - Rob Edam
- Shell Global Solutions International B.V., Grasweg 31, 1031 HW, Amsterdam, the Netherlands
| | - Saer Samanipour
- Van't Hoff Institute for Molecular Science (HIMS), University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, the Netherlands; Centre for Analytical Sciences Amsterdam (CASA), the Netherlands
| | - Tijmen S Bos
- Van't Hoff Institute for Molecular Science (HIMS), University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, the Netherlands; Centre for Analytical Sciences Amsterdam (CASA), the Netherlands
| | - Bob W J Pirok
- Van't Hoff Institute for Molecular Science (HIMS), University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, the Netherlands; Centre for Analytical Sciences Amsterdam (CASA), the Netherlands.
| |
Collapse
|
7
|
Kok YE, Crisford A, Parkes A, Venkateswaran S, Oreffo R, Mahajan S, Pound M. Classification of osteoarthritic and healthy cartilage using deep learning with Raman spectra. Sci Rep 2024; 14:15902. [PMID: 38987563 PMCID: PMC11237049 DOI: 10.1038/s41598-024-66857-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2024] [Accepted: 07/04/2024] [Indexed: 07/12/2024] Open
Abstract
Raman spectroscopy is a rapid method for analysing the molecular composition of biological material. However, noise contamination in the spectral data necessitates careful pre-processing prior to analysis. Here we propose an end-to-end Convolutional Neural Network to automatically learn an optimal combination of pre-processing strategies, for the classification of Raman spectra of superficial and deep layers of cartilage harvested from 45 Osteoarthritis and 19 Osteoporosis (Healthy controls) patients. Using 6-fold cross-validation, the Multi-Convolutional Neural Network achieves comparable or improved classification accuracy against the best-performing Convolutional Neural Network applied to either the raw or pre-processed spectra. We utilised Integrated Gradients to identify the contributing features (Raman signatures) in the network decision process, showing they are biologically relevant. Using these features, we compared Artificial Neural Networks, Decision Trees and Support Vector Machines for the feature selection task. Results show that training on fewer than 3 and 300 features, respectively, for the disease classification and layer assignment task provide performance comparable to the best-performing CNN-based network applied to the full dataset. Our approach, incorporating multi-channel input and Integrated Gradients, can potentially facilitate the clinical translation of Raman spectroscopy-based diagnosis without the need for laborious manual pre-processing and feature selection.
Collapse
Affiliation(s)
- Yong En Kok
- School of Computer Science, University of Nottingham, Nottingham, NG8 1BB, UK.
| | - Anna Crisford
- Institute of Life Sciences and Department of Chemistry, University of Southampton, Southampton, SO17 1BJ, UK
| | - Andrew Parkes
- School of Computer Science, University of Nottingham, Nottingham, NG8 1BB, UK
| | - Seshasailam Venkateswaran
- Precision Healthcare University Research Institute, Queen Mary University of London, London, E1 1HH, UK
| | - Richard Oreffo
- Bone and Joint Research Group, Centre for Human Development, Stem Cells and Regeneration, Institute of Developmental Sciences, University of Southampton, Southampton, SO16 6YD, UK
| | - Sumeet Mahajan
- Institute of Life Sciences and Department of Chemistry, University of Southampton, Southampton, SO17 1BJ, UK
| | - Michael Pound
- School of Computer Science, University of Nottingham, Nottingham, NG8 1BB, UK
| |
Collapse
|
8
|
Chen X, Lin K, Chen K, Wang L, Liu H, Ma P, Zeng L, Zhang X, Sui M, Chen H. Novel non-invasive method for urine mapping: Deep-learning-enabled SERS spectroscopy for the rapid differential detection of kidney allograft injury. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2024; 315:124255. [PMID: 38608562 DOI: 10.1016/j.saa.2024.124255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 03/16/2024] [Accepted: 04/03/2024] [Indexed: 04/14/2024]
Abstract
The kidney allograft has been under continuous attack from diverse injuries since the very beginning of organ procurement, leading to a gradual decline in function, chronic fibrosis, and allograft loss. It is vital to routinely and precisely monitor the risk of injuries after renal transplantation, which is difficult to achieve because the traditional laboratory tests lack sensitivity and specificity, and graft biopsies are invasive with the risk of many complications and time-consuming. Herein, a novel method for the diagnosis of graft injury is demonstrated, using deep learning-assisted surface-enhanced Raman spectroscopy (SERS) of the urine analysis. Specifically, we developed a hybrid SERS substrate composed of gold and silver with high sensitivity to the urine composition under test, eliminating the need for labels, which makes measurements easy to perform and meanwhile results in extremely abundant and complex Raman vibrational bands. Deep learning algorithms were then developed to improve the interpretation of the SERS spectral fingerprints. The deep learning model was trained with SERS signals of urine samples of recipients with different injury types including delayed graft function (DGF), calcineurin-inhibitor toxicity (CNIT), T cell-mediated rejection (TCMR), antibody-mediated rejection (AMR), and BK virus nephropathy (BKVN), which explored the features of these types and achieved the injury differentiation with an overall accuracy of 93.03%. The results highlight the potential of combining label-free SERS spectroscopy with deep learning as a method for liquid biopsy of kidney allograft injuries, which can provide great potential to diagnose and evaluate allograft injuries, and thus extend the life of kidney allografts.
Collapse
Affiliation(s)
- Xi Chen
- Key Laboratory of Optical Technology and Instrument for Medicine, Ministry of Education, University of Shanghai for Science and Technology, Shanghai 200093, China
| | - Kailin Lin
- Department of Thoracic Surgery, Fudan University Shanghai Cancer Center, Shanghai 200000, China
| | - Kewen Chen
- Department of Organ Transplantation, Shanghai Changhai Hospital, Navy Medical University, Shanghai 200433, China
| | - Luyao Wang
- Key Laboratory of Optical Technology and Instrument for Medicine, Ministry of Education, University of Shanghai for Science and Technology, Shanghai 200093, China
| | - Hongyi Liu
- Key Laboratory of Optical Technology and Instrument for Medicine, Ministry of Education, University of Shanghai for Science and Technology, Shanghai 200093, China
| | - Pei Ma
- Key Laboratory of Optical Technology and Instrument for Medicine, Ministry of Education, University of Shanghai for Science and Technology, Shanghai 200093, China
| | - Li Zeng
- Department of Organ Transplantation, Shanghai Changhai Hospital, Navy Medical University, Shanghai 200433, China
| | - Xuedian Zhang
- Key Laboratory of Optical Technology and Instrument for Medicine, Ministry of Education, University of Shanghai for Science and Technology, Shanghai 200093, China
| | - Mingxing Sui
- Department of Organ Transplantation, Shanghai Changhai Hospital, Navy Medical University, Shanghai 200433, China.
| | - Hui Chen
- Key Laboratory of Optical Technology and Instrument for Medicine, Ministry of Education, University of Shanghai for Science and Technology, Shanghai 200093, China.
| |
Collapse
|
9
|
Biswas A, Chaudhari SR. Exploring the role of NIR spectroscopy in quantifying and verifying honey authenticity: A review. Food Chem 2024; 445:138712. [PMID: 38364494 DOI: 10.1016/j.foodchem.2024.138712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 01/19/2024] [Accepted: 02/06/2024] [Indexed: 02/18/2024]
Abstract
Honey, recognized for its diverse flavors and nutritional benefits, confronts challenges in maintaining authenticity and quality due to factors like adulteration and mislabelling. This review undertakes a comprehensive exploration of the utility of Near-Infrared (NIR) spectroscopy as a non-destructive analytical method for concurrently evaluating both honey quantity and authenticity. The primary purpose of this investigation is to delve into the various applications of NIR spectroscopy in honey analysis, with a specific focus on its capability to identify and quantify significant quality parameters such as sugar content, moisture levels, 5-HMF, and proline content. Results from the study underscore the effectiveness of NIR spectroscopy, especially when integrated with advanced chemometrics models. This combination not only facilitates quantification of diverse quality parameters but also enhances the classification of honey based on geographical and botanical origin. The technology emerges as a potent tool for detecting adulteration, addressing critical challenges in preserving the authenticity and quality of honey products. The impact of this critical analysis extends to shedding light on the current state, challenges, and future prospects of applying NIR spectroscopy in the honey industry. This analysis outlines the current challenges and future prospects of NIR spectroscopy in the honey industry. Emphasizing its potential to improve consumer confidence and food safety, the research has broader implications for authenticity and quality assurance in honey. Integrating NIR spectroscopy into industry practices could establish stronger quality control measures, benefiting both producers and consumers globally.
Collapse
Affiliation(s)
- Anisha Biswas
- Department of Plantation Products, Spices and Flavour Technology, CSIR-Central Food Technological Research Institute, Mysuru, Karnataka 570020, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Sachin R Chaudhari
- Department of Plantation Products, Spices and Flavour Technology, CSIR-Central Food Technological Research Institute, Mysuru, Karnataka 570020, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India.
| |
Collapse
|
10
|
Mariod AA, Tahir HE. Metabolic and elemental profiling as potential discriminating features among the black mahlab seeds (Monechma ciliatum) grown in three different regions. PHYTOCHEMICAL ANALYSIS : PCA 2024; 35:1063-1071. [PMID: 38431984 DOI: 10.1002/pca.3341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 01/01/2024] [Accepted: 02/09/2024] [Indexed: 03/05/2024]
Abstract
INTRODUCTION Black mahlab (Monechma ciliatum) seed is a rich source of metabolites and minerals and is usually believed to have a similar composition between different areas of cultivation. Until now, no studies have assessed changes in black mahlab seeds (BMSs) to determine those constituents that help to discriminate them according to geographical origin. OBJECTIVES The present study attempted to compare the metabolomics and elemental profiles of BMSs of different geographical origins and identified the potential markers using ultrahigh-performance liquid chromatography quadrupole Orbitrap tandem mass spectrometry (UHPLC-Q-Orbitrap-MS2), and inductively coupled plasma mass spectrometry (ICP-MS) techniques and established the chemometric model to identify the potential markers and discriminate them according to cultivation sites. MATERIAL AND METHODS In this work, data from metabolites analysis by UHPLC-Q-Orbitrap-MS2 and multi-elemental data obtained from ICP-MS were combined with chemometrics for tracing the geographical origin of BMSs. Principal component analysis (PCA) was used to evaluate the overall grouping of samples. In contrast, partial least squares-discriminant analysis (PLS-DA) and orthogonal partial least squares discriminant analysis (OPLS-DA) were employed for authentication. RESULTS PLS-DA and OPLS-DA models were fully validated (R2Y and Q2 values > 0.5). Variable importance of various projections was applied to obtain valuable data about differential elements (seven markers were identified) and metabolites (23 markers were identified) with high discrimination potential. The outcomes presented in this study serve as an appropriate framework for developing novel discrimination approaches in food origin screening.
Collapse
Affiliation(s)
- Abdalbasit Adam Mariod
- College of Sciences and Arts - Alkamil, University of Jeddah, Alkamil, Saudi Arabia
- Indigenous Knowledge and Heritage Center at Ghibaish College of Science and Technology in Ghibaish, Ghibaish, Sudan
| | - Haroon Elrasheid Tahir
- School of Food and Biological Engineering, Jiangsu University, Zhenjiang, Jiangsu, China
| |
Collapse
|
11
|
Cao X, Huang J, Chen J, Niu Y, Wei S, Tong H, Wu M, Yang Y. Rapid Measurement of Antioxidant Properties of Dendrobium officinale Using Near-Infrared Spectroscopy and Chemometrics. Foods 2024; 13:1769. [PMID: 38890997 PMCID: PMC11171845 DOI: 10.3390/foods13111769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 05/30/2024] [Accepted: 05/31/2024] [Indexed: 06/20/2024] Open
Abstract
Dendrobium officinale (D. officinale), often used as a dual-use plant with herbal medicine and food applications, has attracted considerable attention for health-benefiting components and wide economic value. The antioxidant ability of D. officinale is of great significance to ensure its health care value and safeguard consumers' interests. However, the common analytical methods for evaluating the antioxidant ability of D. officinale are time-consuming, laborious, and costly. In this study, near-infrared (NIR) spectroscopy and chemometrics were employed to establish a rapid and accurate method for the determination of 2,2'-azinobis-3-ethylbenzothiazoline-6-sulfonic acid (ABTS) scavenging capacity, 2,2-diphenyl-1-picrylhydrazyl (DPPH) scavenging capacity, and ferric reducing antioxidant power (FRAP) in D. officinale. The quantitative models were developed based on the partial least squares (PLS) algorithm. Two wavelength selection methods, namely the genetic algorithm (GA) and competitive adaptive reweighted sampling (CARS) method, were used for model optimization. The CARS-PLS models exhibited superior predictive performance compared to other PLS models. The root mean square errors of cross-validation (RMSECVs) for ABTS, FRAP, and DPPH were 0.44%, 2.64 μmol/L, and 2.06%, respectively. The results demonstrated the potential application of NIR spectroscopy combined with the CARS-PLS model for the rapid prediction of antioxidant activity in D. officinale. This method can serve as an alternative to conventional analytical methods for efficiently quantifying the antioxidant properties in D. officinale.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Yue Yang
- Zhejiang Provincial Key Laboratory for Water Environment and Marine Biological Resources Protection, College of Life and Environmental Science, Wenzhou University, Wenzhou 325035, China; (X.C.); (J.H.); (J.C.); (Y.N.); (S.W.); (H.T.); (M.W.)
| |
Collapse
|
12
|
Dong X, Yan X, Wan Y, Gao D, Jiao J, Wang H, Qu H. Enhancing real-time cell culture monitoring: Automated Raman model optimization with Taguchi method. Biotechnol Bioeng 2024; 121:1831-1845. [PMID: 38454569 DOI: 10.1002/bit.28688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 11/18/2023] [Accepted: 02/20/2024] [Indexed: 03/09/2024]
Abstract
Raman spectroscopy has found widespread usage in monitoring cell culture processes both in research and practical applications. However, commonly, preprocessing methods, spectral regions, and modeling parameters have been chosen based on experience or trial-and-error strategies. These choices can significantly impact the performance of the models. There is an urgent need for a simple, effective, and automated approach to determine a suitable procedure for constructing accurate models. This paper introduces the adoption of a design of experiment (DoE) method to optimize partial least squares models for measuring the concentration of different components in cell culture bioreactors. The experimental implementation utilized the orthogonal test table L25(56). Within this framework, five factors were identified as control variables for the DoE method: the window width of Savitzky-Golay smoothing, the baseline correction method, the order of preprocessing steps, spectral regions, and the number of latent variables. The evaluation method for the model was considered as a factor subject to noise. The optimal combination of levels was determined through the signal-to-noise ratio response table employing Taguchi analysis. The effectiveness of this approach was validated through two cases, involving different cultivation scales, different Raman spectrometers, and different analytical components. The results consistently demonstrated that the proposed approach closely approximated the global optimum, regardless of data set size, predictive components, or the brand of Raman spectrometer. The performance of models recommended by the DoE strategy consistently surpassed those built using raw data, underscoring the reliability of models generated through this approach. When compared to exhaustive all-combination experiments, the DoE approach significantly reduces calculation times, making it highly practical for the implementation of Raman spectroscopy in bioprocess monitoring.
Collapse
Affiliation(s)
- Xiaoxiao Dong
- Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China
| | - Xu Yan
- Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China
- Hisun Biopharmaceutical Co. Ltd., Hangzhou, China
| | - Yuxiang Wan
- Hisun Biopharmaceutical Co. Ltd., Hangzhou, China
| | - Dong Gao
- Hisun Biopharmaceutical Co. Ltd., Hangzhou, China
| | - Jingyu Jiao
- Hisun Biopharmaceutical Co. Ltd., Hangzhou, China
| | - Haibin Wang
- Hisun Biopharmaceutical Co. Ltd., Hangzhou, China
| | - Haibin Qu
- Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China
| |
Collapse
|
13
|
Bian X, Liu Y, Zhang R, Sun H, Liu P, Tan X. Rapid quantification of grapeseed oil multiple adulterations using near-infrared spectroscopy coupled with a novel double ensemble modeling method. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2024; 311:124016. [PMID: 38354676 DOI: 10.1016/j.saa.2024.124016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 02/04/2024] [Accepted: 02/07/2024] [Indexed: 02/16/2024]
Abstract
As a high-quality edible oil, grapeseed oil is often adulterated with low-price/quality vegetable oils. A novel ensemble modeling method is proposed for quantitative analysis of grapeseed oil adulterations combined with near-infrared (NIR) spectroscopy. The method combines Monte Carlo (MC) sampling and whale optimization algorithm (WOA) to build numerous partial least squares (PLS) sub-models, named MC-WOA-PLS. A total of 80 adulterated grapeseed oil samples were prepared by mixing grapeseed oil with soybean oil, palm oil, cottonseed oil, and corn oil with the designed mass percentages. NIR spectra of the 80 samples were measured in a transmittance mode in the range of 12,000-4000 cm-1. Parameters in MC-WOA-PLS including the number of latent variables (LVs) in PLS, iteration number of WOA, whale number, number of PLS sub-models, and percentage of training subsets were optimized. To validate the prediction performance of the model, root mean square error of calibration (RMSEC), root mean square error of cross-validation (RMSECV), root mean squared error of prediction (RMSEP), correlation coefficient (R), residual predictive deviation (RPD), and standard deviation (S.D.) were used. Compared with PLS, standard normal variate-PLS (SNV-PLS), uninformative variable elimination-PLS (UVE-PLS), Monte Carlo uninformative variable elimination-PLS (MCUVE-PLS), randomization test-PLS (RT-PLS), variable importance in projection-PLS (VIP-PLS), and WOA-PLS, MC-WOA-PLS achieves the best prediction accuracy and stability for quantification of the five pure oils in adulterated grapeseed oil samples.
Collapse
Affiliation(s)
- Xihui Bian
- State Key Laboratory of Separation Membranes and Membrane Processes, School of Chemical Engineering and Technology, Tiangong University, Tianjin 300387, PR China; NMPA Key Laboratory for Technology Research and Evaluation of Drug Products, Shandong University, Jinan 250012, PR China.
| | - Yuxia Liu
- State Key Laboratory of Separation Membranes and Membrane Processes, School of Chemical Engineering and Technology, Tiangong University, Tianjin 300387, PR China
| | - Rongling Zhang
- State Key Laboratory of Separation Membranes and Membrane Processes, School of Chemical Engineering and Technology, Tiangong University, Tianjin 300387, PR China
| | - Hao Sun
- State Key Laboratory of Separation Membranes and Membrane Processes, School of Chemical Engineering and Technology, Tiangong University, Tianjin 300387, PR China
| | - Peng Liu
- State Key Laboratory of Separation Membranes and Membrane Processes, School of Chemical Engineering and Technology, Tiangong University, Tianjin 300387, PR China
| | - Xiaoyao Tan
- State Key Laboratory of Separation Membranes and Membrane Processes, School of Chemical Engineering and Technology, Tiangong University, Tianjin 300387, PR China
| |
Collapse
|
14
|
Sitorus A, Lapcharoensuk R. Exploring Deep Learning to Predict Coconut Milk Adulteration Using FT-NIR and Micro-NIR Spectroscopy. SENSORS (BASEL, SWITZERLAND) 2024; 24:2362. [PMID: 38610572 PMCID: PMC11014270 DOI: 10.3390/s24072362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Revised: 04/04/2024] [Accepted: 04/06/2024] [Indexed: 04/14/2024]
Abstract
Accurately identifying adulterants in agriculture and food products is associated with preventing food safety and commercial fraud activities. However, a rapid, accurate, and robust prediction model for adulteration detection is hard to achieve in practice. Therefore, this study aimed to explore deep-learning algorithms as an approach to accurately identify the level of adulterated coconut milk using two types of NIR spectrophotometer, including benchtop FT-NIR and portable Micro-NIR. Coconut milk adulteration samples came from deliberate adulteration with corn flour and tapioca starch in the 1 to 50% range. A total of four types of deep-learning algorithm architecture that were self-modified to a one-dimensional framework were developed and tested to the NIR dataset, including simple CNN, S-AlexNET, ResNET, and GoogleNET. The results confirmed the feasibility of deep-learning algorithms for predicting the degree of coconut milk adulteration by corn flour and tapioca starch using NIR spectra with reliable performance (R2 of 0.886-0.999, RMSE of 0.370-6.108%, and Bias of -0.176-1.481). Furthermore, the ratio of percent deviation (RPD) of all algorithms with all types of NIR spectrophotometers indicates an excellent capability for quantitative predictions for any application (RPD > 8.1) except for case predicting tapioca starch, using FT-NIR by ResNET (RPD < 3.0). This study demonstrated the feasibility of using deep-learning algorithms and NIR spectral data as a rapid, accurate, robust, and non-destructive way to evaluate coconut milk adulterants. Last but not least, Micro-NIR is more promising than FT-NIR in predicting coconut milk adulteration from solid adulterants, and it is portable for in situ measurements in the future.
Collapse
Affiliation(s)
| | - Ravipat Lapcharoensuk
- Department of Agricultural Engineering, School of Engineering, King Mongkut’s Institute of Technology Ladkrabang, Bangkok 10520, Thailand
| |
Collapse
|
15
|
Francis F, Luz S, Wu H, Stock SJ, Townsend R. Machine learning on cardiotocography data to classify fetal outcomes: A scoping review. Comput Biol Med 2024; 172:108220. [PMID: 38489990 DOI: 10.1016/j.compbiomed.2024.108220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 02/02/2024] [Accepted: 02/25/2024] [Indexed: 03/17/2024]
Abstract
INTRODUCTION Uterine contractions during labour constrict maternal blood flow and oxygen delivery to the developing baby, causing transient hypoxia. While most babies are physiologically adapted to withstand such intrapartum hypoxia, those exposed to severe hypoxia or with poor physiological reserves may experience neurological injury or death during labour. Cardiotocography (CTG) monitoring was developed to identify babies at risk of hypoxia by detecting changes in fetal heart rate (FHR) patterns. CTG monitoring is in widespread use in intrapartum care for the detection of fetal hypoxia, but the clinical utility is limited by a relatively poor positive predictive value (PPV) of an abnormal CTG and significant inter and intra observer variability in CTG interpretation. Clinical risk and human factors may impact the quality of CTG interpretation. Misclassification of CTG traces may lead to both under-treatment (with the risk of fetal injury or death) or over-treatment (which may include unnecessary operative interventions that put both mother and baby at risk of complications). Machine learning (ML) has been applied to this problem since early 2000 and has shown potential to predict fetal hypoxia more accurately than visual interpretation of CTG alone. To consider how these tools might be translated for clinical practice, we conducted a review of ML techniques already applied to CTG classification and identified research gaps requiring investigation in order to progress towards clinical implementation. MATERIALS AND METHOD We used identified keywords to search databases for relevant publications on PubMed, EMBASE and IEEE Xplore. We used Preferred Reporting Items for Systematic Review and Meta-Analysis for Scoping Reviews (PRISMA-ScR). Title, abstract and full text were screened according to the inclusion criteria. RESULTS We included 36 studies that used signal processing and ML techniques to classify CTG. Most studies used an open-access CTG database and predominantly used fetal metabolic acidosis as the benchmark for hypoxia with varying pH levels. Various methods were used to process and extract CTG signals and several ML algorithms were used to classify CTG. We identified significant concerns over the practicality of using varying pH levels as the CTG classification benchmark. Furthermore, studies needed to be more generalised as most used the same database with a low number of subjects for an ML study. CONCLUSION ML studies demonstrate potential in predicting fetal hypoxia from CTG. However, more diverse datasets, standardisation of hypoxia benchmarks and enhancement of algorithms and features are needed for future clinical implementation.
Collapse
Affiliation(s)
| | | | - Honghan Wu
- Institute of Health Informatics, University College London, UK
| | | | | |
Collapse
|
16
|
Makni Y, Diallo T, Guérin T, Parinet J. A proof-of-concept study on the versatility of liquid chromatography coupled to high-resolution mass spectrometry to screen for various contaminants and highlight markers of floral and geographical origin for different honeys. Food Chem 2024; 436:137720. [PMID: 37844510 DOI: 10.1016/j.foodchem.2023.137720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2023] [Revised: 09/29/2023] [Accepted: 10/08/2023] [Indexed: 10/18/2023]
Abstract
The high-resolution mass spectrometry is a powerful analytical tool for improving food safety and authenticity, but still underused in official control laboratories. The present work is a proof-of-concept study overviewing how liquid-chromatography coupled to high-resolution mass spectrometry could be used simultaneously for large-scale screening of contaminants and differentiation of honey samples. Within this study, the samples were extracted using all-in-one QuEChERS-based protocol that allowed for analysis of various anthropogenic contaminants and endogenous compounds. First, targeted-analysis of 52 honey samples led to unequivocal identification of 23 chemicals, including neonicotinoids, triazole fungicides and synergist. Then, suspect-screening using MSDial software allowed for tentative identification of 30 chemicals including plasticizers, flame-retardants and additives. Suspect-screening also made it possible to highlight tentative markers of chestnut honey (deoxyvasicinone, 2-quinolone, indoleacrylic acid and kynurenic acid) and citrus honey (caffeine, 2-oxindole and indole-3-carbinol). Lastly, non-targeted analysis enabled to separate honeys by their type, floral and geographical origins.
Collapse
Affiliation(s)
- Yassine Makni
- University Paris Est Creteil, ANSES, Laboratory for Food Safety, Pesticides and Marine Biotoxins Unit, 14 rue Pierre et Marie Curie, F-94701 Maisons-Alfort, France
| | - Thierno Diallo
- University Paris Est Creteil, ANSES, Laboratory for Food Safety, Pesticides and Marine Biotoxins Unit, 14 rue Pierre et Marie Curie, F-94701 Maisons-Alfort, France; Littoral Environnement et Sociétés (LIENSs), UMR 7266, CNRS-Université de La Rochelle, 2 rue Olympe de Gouges, F-17042 La Rochelle Cedex 01, France
| | - Thierry Guérin
- ANSES, Strategy and Programmes Department, F-94701 Maisons-Alfort, France
| | - Julien Parinet
- University Paris Est Creteil, ANSES, Laboratory for Food Safety, Pesticides and Marine Biotoxins Unit, 14 rue Pierre et Marie Curie, F-94701 Maisons-Alfort, France.
| |
Collapse
|
17
|
Kočiščáková Z, Král M, Jeništová A. Detection of fragrances on the skin and study of their interaction using infrared and Raman spectroscopy. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2024; 308:123698. [PMID: 38043296 DOI: 10.1016/j.saa.2023.123698] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 11/23/2023] [Accepted: 11/26/2023] [Indexed: 12/05/2023]
Abstract
Nowadays, fragrances belong to the widely used cosmetics. Their composition is designed in a way that it evolves and changes over time. In this work, the effect of fragrances on the skin was studied - the interactions between pig skin samples and fragrances and the possibility of their detection and mutual differentiation. Non-invasive techniques of vibrational spectroscopy were used to obtain the data, namely FT-IR spectroscopy with attenuated total reflection accessory and Raman microspectroscopy. Vibrational spectra were measured within 8 h with different time intervals and after 22 h from the application of fragrance for FT-IR and Raman measurements, respectively. The obtained spectra were pre‑processed and subsequently evaluated by multivariate statistical methods. The study showed that skin treated by fragrances is well distinguishable from untreated skin, even after 22 h. In addition, it is possible to differentiate individual fragrances from each other; therefore, the use of spectroscopical techniques could be a potential tool for forensic analysis of fragrances.
Collapse
Affiliation(s)
- Zuzana Kočiščáková
- Department of Physical Chemistry, Faculty of Chemical Engineering, University of Chemistry and Technology, Technická 5, 166 28 Prague 6 - Dejvice, Czech Republic
| | - Martin Král
- Department of Physical Chemistry, Faculty of Chemical Engineering, University of Chemistry and Technology, Technická 5, 166 28 Prague 6 - Dejvice, Czech Republic.
| | - Adéla Jeništová
- Department of Physical Chemistry, Faculty of Chemical Engineering, University of Chemistry and Technology, Technická 5, 166 28 Prague 6 - Dejvice, Czech Republic.
| |
Collapse
|
18
|
Kitazoe T, Usui C, Kodaira E, Maruyama T, Kawano N, Fuchino H, Yamamoto K, Kitano Y, Kawahara N, Yoshimatsu K, Shirahata T, Kobayashi Y. Improved quantitative analysis of tenuifolin using hydrolytic continuous-flow system to build prediction models for its content based on near-infrared spectroscopy. J Nat Med 2024; 78:296-311. [PMID: 38172356 DOI: 10.1007/s11418-023-01764-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Accepted: 11/14/2023] [Indexed: 01/05/2024]
Abstract
This study used two types of analyses and statistical calculations on powdered samples of Polygala root (PR) and Senega root (SR): (1) determination of saponin content by an independently developed quantitative analysis of tenuifolin content using a flow reactor, and (2) near-infrared spectroscopy (NIR) using crude drug powders as direct samples for metabolic profiling. Furthermore, a prediction model for tenuifolin content was developed and validated using multivariate analysis based on the results of (1) and (2). The goal of this study was to develop a rapid analytical method utilizing the saponin content and explore the possibility of quality control through a wide-area survey of crude drugs using NIR spectroscopy. Consequently, various parameters and appropriate wavelengths were examined in the regression analysis, and a model with a reasonable contribution rate and prediction accuracy was successfully developed. In this case, the wavenumber contributing to the model was consistent with that of tenuifolin, confirming that this model was based on saponin content. In this series of analyses, we have succeeded in developing a model that can quickly estimate saponin content without post-processing and have demonstrated a brief way to perform quality control of crude drugs in the clinical field and on the market.
Collapse
Affiliation(s)
- Tatsuki Kitazoe
- School of Pharmacy, Kitasato University, 5-9-1 Shirokane, Minato-ku, Tokyo, 108-8641, Japan
| | - Chisato Usui
- School of Pharmacy, Kitasato University, 5-9-1 Shirokane, Minato-ku, Tokyo, 108-8641, Japan
| | - Eiichi Kodaira
- School of Pharmacy, Kitasato University, 5-9-1 Shirokane, Minato-ku, Tokyo, 108-8641, Japan
| | - Takuro Maruyama
- Division of Pharmacognosy, Phytochemistry and Narcotics, National Institute of Health Sciences, 3-25-26, Tonomachi, Kawasaki-ku, Kawasaki, Kanagawa, 210-9501, Japan
| | - Noriaki Kawano
- National Institutes of Biomedical Innovation, Health and Nutrition, 1-2 Hachimandai, Tsukuba, Ibaraki, 305-0843, Japan
| | - Hiroyuki Fuchino
- National Institutes of Biomedical Innovation, Health and Nutrition, 1-2 Hachimandai, Tsukuba, Ibaraki, 305-0843, Japan
| | - Kazuhiko Yamamoto
- National Institutes of Biomedical Innovation, Health and Nutrition, 1-2 Hachimandai, Tsukuba, Ibaraki, 305-0843, Japan
| | - Yasushi Kitano
- Nippon Funmatsu Yakuhin Co., Ltd, 2-5-11, Doshomachi, Chuo-ku, Osaka, 541-0045, Japan
| | - Nobuo Kawahara
- National Institutes of Biomedical Innovation, Health and Nutrition, 1-2 Hachimandai, Tsukuba, Ibaraki, 305-0843, Japan
- The Kochi Prefectural Makino Botanical Garden, Godaisan, Kochi, 781-8125, Japan
| | - Kayo Yoshimatsu
- National Institutes of Biomedical Innovation, Health and Nutrition, 1-2 Hachimandai, Tsukuba, Ibaraki, 305-0843, Japan
| | - Tatsuya Shirahata
- School of Pharmacy, Kitasato University, 5-9-1 Shirokane, Minato-ku, Tokyo, 108-8641, Japan
| | - Yoshinori Kobayashi
- School of Pharmacy, Kitasato University, 5-9-1 Shirokane, Minato-ku, Tokyo, 108-8641, Japan.
| |
Collapse
|
19
|
Ma C, Zhai L, Ding J, Liu Y, Hu S, Zhang T, Tang H, Li H. Raman spectroscopy combined with partial least squares (PLS) based on hybrid spectral preprocessing and backward interval PLS (biPLS) for quantitative analysis of four PAHs in oil sludge. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2024; 310:123953. [PMID: 38290282 DOI: 10.1016/j.saa.2024.123953] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Revised: 12/19/2023] [Accepted: 01/21/2024] [Indexed: 02/01/2024]
Abstract
Polycyclic aromatic hydrocarbons (PAHs) contained in a large amount of oily sludge produced in petroleum and petrochemical production has become one of the main environmental protection concerns in the industry. The accurate determination of PAHs is of great significance in the field of petroleum geochemistry and environmental protection. In this study, Raman spectroscopy combined with partial least squares (PLS) based on different hybrid spectral preprocessing methods and variable selection strategies was proposed for quantitative analysis of phenanthrene, fluoranthrene, fluorene and naphthalene (Phe, Flt, Flu and Nap) in oil sludge. At first, PAHs in oily sludge was extracted by solid-liquid extraction with methanol as extractant, and Raman spectra of 21 oily sludge samples were collected by portable Raman spectrometer. And then, the influence of first derivative (D1st), wavelet transform (WT) and their hybrid spectral preprocessing on the predictive performance of the PLS calibration model was discussed. Thirdly, biPLS (backward interval partial least squares) was used to optimize the input variables before and after the hybrid spectral preprocessing methods, and the influence of biPLS and the hybrid spectral preprocessing sequence on the predictive performance of the PLS calibration model was discussed. Finally, the predictive performance of the PLS calibration model was optimized according to the results of leave-one-out cross-validation (LOOCV) method. The results show that the biPLS-D1st-WT-PLS calibration model established by using biPLS first to select the characteristic variables, followed by hybrid spectral preprocessing of the characteristic variables, has better prediction performance for Flt (determination coefficient of prediction (R2P) = 0.9987, and the mean relative error of prediction (MREP) = 0.0606). For Phe, Flu and Nap, the WT-biPLS-PLS calibration model has a better predictive effect (R2P are 0.9995, 0.9996 and 0.9983, and MREP are 0.0426, 0.0719 and 0.0497, respectively). In general, portable Raman spectroscopy combined with PLS calibration model based on different hybrid spectral preprocessing and variable selection strategies has achieved good prediction results for quantitative analysis of four PAHs in oily sludge. It is a new strategy to firstly select the characteristic variables of the original spectra, and secondly to preprocess the characteristic variables by the hybrid spectral preprocessing, which will provide a new idea for the establishment of quantitative analysis methods for PAHs in oily sludge.
Collapse
Affiliation(s)
- Changfei Ma
- Key Laboratory of Synthetic and Natural Functional Molecular of the Ministry of Education, College of Chemistry & Material Science, Northwest University, Xi'an 710127, China
| | - Lulu Zhai
- Key Laboratory of Synthetic and Natural Functional Molecular of the Ministry of Education, College of Chemistry & Material Science, Northwest University, Xi'an 710127, China
| | - Jianming Ding
- Key Laboratory of Synthetic and Natural Functional Molecular of the Ministry of Education, College of Chemistry & Material Science, Northwest University, Xi'an 710127, China
| | - Yanli Liu
- HBIS Materials Technology Research Institute, Shijiazhuang, Hebei 050000, China
| | - Shunfan Hu
- Key Laboratory of Synthetic and Natural Functional Molecular of the Ministry of Education, College of Chemistry & Material Science, Northwest University, Xi'an 710127, China
| | - Tianlong Zhang
- Key Laboratory of Synthetic and Natural Functional Molecular of the Ministry of Education, College of Chemistry & Material Science, Northwest University, Xi'an 710127, China
| | - Hongsheng Tang
- Key Laboratory of Synthetic and Natural Functional Molecular of the Ministry of Education, College of Chemistry & Material Science, Northwest University, Xi'an 710127, China.
| | - Hua Li
- Key Laboratory of Synthetic and Natural Functional Molecular of the Ministry of Education, College of Chemistry & Material Science, Northwest University, Xi'an 710127, China; College of Chemistry and Chemical Engineering, Xi'an Shiyou University, Xi'an 710065, China.
| |
Collapse
|
20
|
Jongyingcharoen JS, Howimanporn S, Sitorus A, Phanomsophon T, Posom J, Salubsi T, Kongwaree A, Lim CH, Phetpan K, Sirisomboon P, Tsuchikawa S. Classification of the Crosslink Density Level of Para Rubber Thick Film of Medical Glove by Using Near-Infrared Spectral Data. Polymers (Basel) 2024; 16:184. [PMID: 38256982 PMCID: PMC10818871 DOI: 10.3390/polym16020184] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 12/20/2023] [Accepted: 12/20/2023] [Indexed: 01/24/2024] Open
Abstract
Classification of the crosslink density level of para rubber medical gloves by using near-infrared spectral data combined with machine learning is the first time reported in this paper. The spectra of medical glove samples with different crosslink densities acquired by an ultra-compact portable MicroNIR spectrometer were correlated with their crosslink density levels, which were referencely evaluated by the toluene swell index (TSI). The machine learning protocols used to classify the 3 groups of TSI were specified as less than 80% TSI, 80-88% TSI, and more than 88% TSI. The 80-88% TSI group was the group in which the compounded latex was suitable for medical glove production, which made the glove specification comply with the requirements of customers as indicated by the tensile test. The results show that when comparing the algorithms used for modeling, the linear discriminant analysis (LDA) developed by 2nd derivative spectra with 15 k-best selected wavelengths fairly accurately predicted the class but was most reliable among other algorithms, i.e., artificial neural networks (ANN), support vector machines (SVM), and k-nearest neighbors (kNN), due to higher prediction accuracy, precision, recall, and F1-score of the same value of 0.76 and no overfitting or underfitting prediction. This developed model can be implemented in the glove factory for screening purposes in the production line. However, deep learning modeling should be explored with a larger sample number required for better model performance.
Collapse
Affiliation(s)
- Jiraporn Sripinyowanich Jongyingcharoen
- Department of Agricultural Engineering, School of Engineering, King Mongkut’s Institute of Technology Ladkrabang, Bangkok 10520, Thailand; (J.S.J.); (S.H.); (T.P.); (P.S.)
| | - Suppakit Howimanporn
- Department of Agricultural Engineering, School of Engineering, King Mongkut’s Institute of Technology Ladkrabang, Bangkok 10520, Thailand; (J.S.J.); (S.H.); (T.P.); (P.S.)
| | - Agustami Sitorus
- Department of Agricultural Engineering, School of Engineering, King Mongkut’s Institute of Technology Ladkrabang, Bangkok 10520, Thailand; (J.S.J.); (S.H.); (T.P.); (P.S.)
- National Research and Innovation Agency (BRIN), Jakarta Pusat 10340, Indonesia
| | - Thitima Phanomsophon
- Department of Agricultural Engineering, School of Engineering, King Mongkut’s Institute of Technology Ladkrabang, Bangkok 10520, Thailand; (J.S.J.); (S.H.); (T.P.); (P.S.)
| | - Jetsada Posom
- Department of Agricultural Engineering, Faculty of Engineering, Khon Kaen University, Khon Kaen 40002, Thailand
| | - Thanapol Salubsi
- W. A. Rubber Mate Co., Ltd., Bangkok 10240, Thailand; (T.S.); (A.K.)
| | - Adisak Kongwaree
- W. A. Rubber Mate Co., Ltd., Bangkok 10240, Thailand; (T.S.); (A.K.)
| | - Chin Hock Lim
- Thai Rubber Latex Group Public Co., Ltd., Chonburi 20190, Thailand;
| | - Kittisak Phetpan
- Department of Engineering, King Mongkut’s Institute of Technology Ladkrabang, Prince of Chumphon Campus, Chumphon 86160, Thailand;
| | - Panmanas Sirisomboon
- Department of Agricultural Engineering, School of Engineering, King Mongkut’s Institute of Technology Ladkrabang, Bangkok 10520, Thailand; (J.S.J.); (S.H.); (T.P.); (P.S.)
| | - Satoru Tsuchikawa
- Graduate School of Bioagricultural Sciences, Nagoya University, Nagoya 464-8601, Japan;
| |
Collapse
|
21
|
Cozzolino D, Chapman J. Advances, limitations, and considerations on the use of vibrational spectroscopy towards the development of management decision tools in food safety. Anal Bioanal Chem 2024; 416:611-620. [PMID: 37542534 DOI: 10.1007/s00216-023-04849-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Revised: 07/05/2023] [Accepted: 07/06/2023] [Indexed: 08/07/2023]
Abstract
Food safety and food security are two of the main concerns for the modern food manufacturing industry. Disruptions in the food supply and value chains have created the need to develop agile screening tools that will allow the detection of food pathogens, spoilage microorganisms, microbial contaminants, toxins, herbicides, and pesticides in agricultural commodities, natural products, and food ingredients. Most of the current routine analytical methods used to detect and identify microorganisms, herbicides, and pesticides in food ingredients and products are based on the use of reliable and robust immunological, microbiological, and biochemical techniques (e.g. antigen-antibody interactions, extraction and analysis of DNA) and chemical methods (e.g. chromatography). However, the food manufacturing industries are demanding agile and affordable analytical methods. The objective of this review is to highlight the advantages and limitations of the use of vibrational spectroscopy combined with chemometrics as proxy to evaluate and quantify herbicides, pesticides, and toxins in foods.
Collapse
Affiliation(s)
- Daniel Cozzolino
- The University of Queensland, Centre for Nutrition and Food Sciences, Queensland Alliance for Agriculture and Food Innovation, St. Lucia, Brisbane, QLD, 4072, Australia.
| | - James Chapman
- School of Science, RMIT University, GPO Box 2476, Melbourne, VIC, 3001, Australia
| |
Collapse
|
22
|
Wang S, Lin M, Meng Y, Jiang T, Fan F, Wang S. Self-expansion full information optimization strategy: Convenient and efficient method for near infrared spectrum auto-analysis. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2023; 303:123224. [PMID: 37603976 DOI: 10.1016/j.saa.2023.123224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Revised: 07/06/2023] [Accepted: 07/31/2023] [Indexed: 08/23/2023]
Abstract
An essential step in the application of near infrared spectroscopy technology is the spectrum preprocessing. A reasonable implementation of it ensures that the effective spectral information is correctly extracted and, also that the model's accuracy is increased. However, some analysts' research still uses the manual approach of trial and error, particularly those less skilled ones. Previous papers have provided preprocessing optimization algorithms for NIR, but there are still some problems that need to be resolved, such as the unwieldy sequence determination of preprocessing method or, the fluctuated optimization outcomes or, lack of sufficient statistical information. This research suggests a spectrum auto-analysis methodology named self-expansion full information optimization strategy, a new powerful open-source technique for concurrently addressing all of these above issues simultaneously. For the first time in the field of chemometrics, this algorithm offers a reliable and effective automatic near infrared auto-modelling method based on the statistical informatics. With the aid of its built-in modules, such as information generators, spectrum processors, etc., it is able to fully search the common preprocessing techniques, which is determined by Monte Carlo cross validation. Then the final ensemble calibration model is built by employing the optimized preprocessing schemes, along with the wavelength variables screening algorithm. The optimization strategy can offer the user objective useful statistics information created throughout the modeling process to further examine the model's effectiveness. The results demonstrate that the suggested method can easily and successfully extract spectrum information and develop calibration models by putting it to the test on two groups of actual near-infrared spectral data. Additionally, this optimization strategy can also be applied to other spectrum analysis areas, such Raman spectroscopy or infrared spectroscopy, by changing a few of its parameters, and has extraordinary application value.
Collapse
Affiliation(s)
- Shenghao Wang
- School of Electronic and Information Engineering, Zhongyuan University of Technology, Zhengzhou, China.
| | - Manman Lin
- School of Electronic and Information Engineering, Zhongyuan University of Technology, Zhengzhou, China
| | - Yanhong Meng
- School of Electronic and Information Engineering, Zhongyuan University of Technology, Zhengzhou, China
| | - Tao Jiang
- School of Electronic and Information Engineering, Zhongyuan University of Technology, Zhengzhou, China
| | - Fuling Fan
- School of Electronic and Information Engineering, Zhongyuan University of Technology, Zhengzhou, China
| | - Shuanghong Wang
- School of Electronic and Information Engineering, Zhongyuan University of Technology, Zhengzhou, China
| |
Collapse
|
23
|
Zhou R, Chen X, Huang M, Chen H, Zhang L, Xu D, Wang D, Gao P, Wang B, Dai X. ATR-FTIR spectroscopy combined with chemometrics to assess the spectral markers of irradiated baijius and their potential application in irradiation dose control. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2023; 302:123162. [PMID: 37478760 DOI: 10.1016/j.saa.2023.123162] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 07/13/2023] [Accepted: 07/14/2023] [Indexed: 07/23/2023]
Abstract
Although some methods have been proposed for the identification of irradiated baijius, they are often costly, time-consuming, and destructive. It is also unclear what instrumentation can be used to fully characterize the quality changes in irradiated baijius. To address this issue, this study pioneers the use of attenuated total reflection Fourier transform infrared (ATR-FTIR) spectroscopy in combination with chemometrics to open up new avenues for characterizing irradiated baijius and their quality control. Principal component analysis, five spectral pre-processing methods (Savitzky-Golay smoothing (S-G), second-order derivative (SD), multiple scattering correction (MSC), S-G + SD and S-G + MSC), five wavelength selection methods (random forest variable importance (RFVI), two-dimensional correlation spectroscopy (2D-COS), variable importance in projection (VIP), ReliefF, and Venn), and three classification models (partial least squares-discriminant analysis (PLS-DA), random forest (RF), and grasshopper optimization algorithm-based support vector machine (GOA-SVM)) were integrated into the analytical framework of ATR-FTIR spectroscopy, aiming to accurately identify baijiu samples according to different irradiation doses and to search for irradiation-induced spectral difference characteristics (spectral markers). The results showed that SD was the best spectral pre-processing method, and RF models constructed using the 20 most competitive and discriminative spectral markers (selected by Venn) could achieve accurate identification of baijiu samples based on irradiation dose (0, 4, 6, and 8 kGy). After Pearson correlation analysis, the five significantly (P<0.05) changed spectral markers (1596, 2025, 2309, 2329, and 2380 cm-1) were attributed to changes in the content of total acids, alcohols, and aromatic compounds. These findings demonstrate for the first time the potential of ATR-FTIR spectroscopy as a fast, low-cost, and non-destructive tool for the characterization and identification of irradiated baijiu samples. This approach may also offer a promising solution for labeling management of irradiated foods, vintage identification of baijius, and brand protection.
Collapse
Affiliation(s)
- Rui Zhou
- College of Life Science and Engineering, Southwest University of Science and Technology, Mianyang 621010, Sichuan, PR China
| | - Xiaoming Chen
- College of Life Science and Engineering, Southwest University of Science and Technology, Mianyang 621010, Sichuan, PR China.
| | - Min Huang
- Sichuan Institute of Atomic Energy, Chengdu 610101, Sichuan, PR China
| | - Hao Chen
- Sichuan Institute of Atomic Energy, Chengdu 610101, Sichuan, PR China
| | - Lili Zhang
- College of Life Science and Engineering, Southwest University of Science and Technology, Mianyang 621010, Sichuan, PR China
| | - Defu Xu
- Luzhou Laojiao Co., Ltd, Luzhou 646699, Sichuan, PR China
| | - Dan Wang
- College of Life Science and Engineering, Southwest University of Science and Technology, Mianyang 621010, Sichuan, PR China
| | - Peng Gao
- Sichuan Institute of Atomic Energy, Chengdu 610101, Sichuan, PR China
| | - Bensheng Wang
- College of Life Science and Engineering, Southwest University of Science and Technology, Mianyang 621010, Sichuan, PR China
| | - Xiaoxue Dai
- Luzhou Laojiao Co., Ltd, Luzhou 646699, Sichuan, PR China
| |
Collapse
|
24
|
Szymoński K, Skirlińska-Nosek K, Lipiec E, Sofińska K, Czaja M, Wilkosz N, Krupa M, Wanat F, Ulatowska-Białas M, Adamek D. Combined analytical approach empowers precise spectroscopic interpretation of subcellular components of pancreatic cancer cells. Anal Bioanal Chem 2023; 415:7281-7295. [PMID: 37906289 PMCID: PMC10684650 DOI: 10.1007/s00216-023-04997-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 09/27/2023] [Accepted: 10/09/2023] [Indexed: 11/02/2023]
Abstract
The lack of specific and sensitive early diagnostic options for pancreatic cancer (PC) results in patients being largely diagnosed with late-stage disease, thus inoperable and burdened with high mortality. Molecular spectroscopic methodologies, such as Raman or infrared spectroscopies, show promise in becoming a leader in screening for early-stage cancer diseases, including PC. However, should such technology be introduced, the identification of differentiating spectral features between various cancer types is required. This would not be possible without the precise extraction of spectra without the contamination by necrosis, inflammation, desmoplasia, or extracellular fluids such as mucous that surround tumor cells. Moreover, an efficient methodology for their interpretation has not been well defined. In this study, we compared different methods of spectral analysis to find the best for investigating the biomolecular composition of PC cells cytoplasm and nuclei separately. Sixteen PC tissue samples of main PC subtypes (ductal adenocarcinoma, intraductal papillary mucinous carcinoma, and ampulla of Vater carcinoma) were collected with Raman hyperspectral mapping, resulting in 191,355 Raman spectra and analyzed with comparative methodologies, specifically, hierarchical cluster analysis, non-negative matrix factorization, T-distributed stochastic neighbor embedding, principal components analysis (PCA), and convolutional neural networks (CNN). As a result, we propose an innovative approach to spectra classification by CNN, combined with PCA for molecular characterization. The CNN-based spectra classification achieved over 98% successful validation rate. Subsequent analyses of spectral features revealed differences among PC subtypes and between the cytoplasm and nuclei of their cells. Our study establishes an optimal methodology for cancer tissue spectral data classification and interpretation that allows precise and cognitive studies of cancer cells and their subcellular components, without mixing the results with cancer-surrounding tissue. As a proof of concept, we describe findings that add to the spectroscopic understanding of PC.
Collapse
Affiliation(s)
- Krzysztof Szymoński
- Department of Pathomorphology, Medical College, Jagiellonian University, Kraków, Poland.
- Department of Pathomorphology, University Hospital, Kraków, Poland.
| | - Katarzyna Skirlińska-Nosek
- Faculty of Physics, Astronomy and Applied Computer Science, M. Smoluchowski Institute of Physics, Jagiellonian University, Kraków, Poland
- Doctoral School of Exact and Natural Sciences, Jagiellonian University, Kraków, Poland
| | - Ewelina Lipiec
- Faculty of Physics, Astronomy and Applied Computer Science, M. Smoluchowski Institute of Physics, Jagiellonian University, Kraków, Poland
| | - Kamila Sofińska
- Faculty of Physics, Astronomy and Applied Computer Science, M. Smoluchowski Institute of Physics, Jagiellonian University, Kraków, Poland
| | - Michał Czaja
- Faculty of Physics, Astronomy and Applied Computer Science, M. Smoluchowski Institute of Physics, Jagiellonian University, Kraków, Poland
- Doctoral School of Exact and Natural Sciences, Jagiellonian University, Kraków, Poland
| | - Natalia Wilkosz
- Faculty of Physics, Astronomy and Applied Computer Science, M. Smoluchowski Institute of Physics, Jagiellonian University, Kraków, Poland
- AGH University of Krakow, Faculty of Physics and Applied Computer Science, Kraków, Poland
| | - Matylda Krupa
- Department of Pathomorphology, Medical College, Jagiellonian University, Kraków, Poland
| | - Filip Wanat
- Department of Pathomorphology, Medical College, Jagiellonian University, Kraków, Poland
| | - Magdalena Ulatowska-Białas
- Department of Pathomorphology, Medical College, Jagiellonian University, Kraków, Poland
- Department of Pathomorphology, University Hospital, Kraków, Poland
| | - Dariusz Adamek
- Department of Pathomorphology, Medical College, Jagiellonian University, Kraków, Poland
| |
Collapse
|
25
|
Gaida M, Stefanuto PH, Focant JF. Theoretical modeling and machine learning-based data processing workflows in comprehensive two-dimensional gas chromatography-A review. J Chromatogr A 2023; 1711:464467. [PMID: 37871505 DOI: 10.1016/j.chroma.2023.464467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Revised: 10/15/2023] [Accepted: 10/17/2023] [Indexed: 10/25/2023]
Abstract
In recent years, comprehensive two-dimensional gas chromatography (GC × GC) has been gradually gaining prominence as a preferred method for the analysis of complex samples due to its higher peak capacity and resolution power compared to conventional gas chromatography (GC). Nonetheless, to fully benefit from the capabilities of GC × GC, a holistic approach to method development and data processing is essential for a successful and informative analysis. Method development enables the fine-tuning of the chromatographic separation, resulting in high-quality data. While generating such data is pivotal, it does not necessarily guarantee that meaningful information will be extracted from it. To this end, the first part of this manuscript reviews the importance of theoretical modeling in achieving good optimization of the separation conditions, ultimately improving the quality of the chromatographic separation. Multiple theoretical modeling approaches are discussed, with a special focus on thermodynamic-based modeling. The second part of this review highlights the importance of establishing robust data processing workflows, with a special emphasis on the use of advanced data processing tools such as, Machine Learning (ML) algorithms. Three widely used ML algorithms are discussed: Random Forest (RF), Support Vector Machine (SVM), and Partial Least Square-Discriminate Analysis (PLS-DA), highlighting their role in discovery-based analysis.
Collapse
Affiliation(s)
- Meriem Gaida
- Organic and Biological Analytical Chemistry Group (OBiAChem), MolSys Research Unit, Liège University, Belgium
| | - Pierre-Hugues Stefanuto
- Organic and Biological Analytical Chemistry Group (OBiAChem), MolSys Research Unit, Liège University, Belgium
| | - Jean-François Focant
- Organic and Biological Analytical Chemistry Group (OBiAChem), MolSys Research Unit, Liège University, Belgium
| |
Collapse
|
26
|
Li R, Gibson JM. Predicting Groundwater PFOA Exposure Risks with Bayesian Networks: Empirical Impact of Data Preprocessing on Model Performance. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:18329-18338. [PMID: 37594027 DOI: 10.1021/acs.est.3c00348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/19/2023]
Abstract
The plethora of data on PFASs in environmental samples collected in response to growing concern about these chemicals could enable the training of machine-learning models for predicting exposure risks. However, differences in sampling and analysis methods across data sets must be reconciled through data preprocessing, and little information is available about how such manipulations affect the resulting models. This study evaluates how data preprocessing influences machine-learned Bayesian network models of PFOA in groundwater. We link 19 years of PFOA measurements from Minnesota, USA, to publicly available information about potential PFOA sources and factors that may influence their environmental fate. Nine different preprocessing methods were tested, and the resulting data sets were used to train models to predict the probability of PFOA ≥ 35 ppt, the 2017 Minnesota health advisory level. Different preprocessing approaches produced varying model structures with significantly different accuracies. Nonetheless, models showed similar relationships between predictor variables and PFOA exposure risks, and all models were relatively accurate, distinguishing wells at high risk from those at low risk for 82.0% to 89.0% of test data samples. There was a trade-off between data quality and model performance since a stricter data screening strategy decreased the sample size for model training.
Collapse
Affiliation(s)
- Runwei Li
- Department of Civil Engineering, New Mexico State University, 3035 S Espina St, Las Cruces, New Mexico 88003, United States
- Department of Civil, Construction, and Environmental Engineering, North Carolina State University, 915 Partners Way, Raleigh, North Carolina 27606, United States
| | - Jacqueline MacDonald Gibson
- Department of Civil, Construction, and Environmental Engineering, North Carolina State University, 915 Partners Way, Raleigh, North Carolina 27606, United States
| |
Collapse
|
27
|
Ferreiro B, Leardi R, Farinini E, Andrade JM. Supervised classification combined with genetic algorithm variable selection for a fast identification of polymeric microdebris using infrared reflectance. MARINE POLLUTION BULLETIN 2023; 195:115540. [PMID: 37722263 DOI: 10.1016/j.marpolbul.2023.115540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 09/06/2023] [Accepted: 09/10/2023] [Indexed: 09/20/2023]
Abstract
Pollution caused by plastics and, in particular, microplastics has become a source of environmental concern for Society. Their ubiquity, with millions of tons of plastic debris spilled in both land and sea, requires efficient technological improvements in the ways residues are collected, handled, characterized and recycled. For reliable decision-making, dependable chemical information is essential to assess both the nature of the plastics found in the environment and their fate. In this work an efficient method to identify the polymeric composition of microplastic fragments is proposed. It combines infrared reflectance spectra and chemometric methods. A breakthrough result is that the models include polymers weathered under both dry (shoreline) and submerged (in sea water) conditions and, hence, they are very promising as a starting point for eventual practical applications. In addition, no spectral processing is required after the initial measurement. SYNOPSIS: This approach to identify microplastics in aquatic environments combines infrared measurements and multivariate data analysis to fight against (micro)plastic pollution.
Collapse
Affiliation(s)
- Borja Ferreiro
- Grupo Química Analítica Aplicada (QANAP), Faculty of Sciences, Universidade da Coruña, Campus da Zapateira, s/n, 15071 A Coruña, Spain
| | - Riccardo Leardi
- Department of Pharmacy, University of Genoa, viale Cembrano 4, 16148 Genoa, Italy
| | - Emanuele Farinini
- Department of Pharmacy, University of Genoa, viale Cembrano 4, 16148 Genoa, Italy
| | - Jose M Andrade
- Grupo Química Analítica Aplicada (QANAP), Faculty of Sciences, Universidade da Coruña, Campus da Zapateira, s/n, 15071 A Coruña, Spain.
| |
Collapse
|
28
|
Liu C, Xu F, Zuo Z, Wang Y. Network pharmacology and fingerprint for the integrated analysis of mechanism, identification and prediction in Panax notoginseng. PHYTOCHEMICAL ANALYSIS : PCA 2023; 34:772-787. [PMID: 36479744 DOI: 10.1002/pca.3195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 10/30/2022] [Accepted: 11/21/2022] [Indexed: 06/17/2023]
Abstract
INTRODUCTION Panax notoginseng (Burkill) F. H. Chen ex C. H. Chow, is a well-known herb with multitudinous efficacy. In this study, a series of overall analyses on the action mechanism, component content, origin identification, and content prediction of P. notoginseng are conducted. OBJECTIVES The purpose was to analyse the mechanism of pharmacological efficacy, differences between contents and groups of P. notoginseng from different origins, and to identify the origin and predict the content. MATERIALS AND METHODS The P. notoginseng samples from four different origins were used for analysis by the database, network pharmacology (Q-marker) and fingerprint analysis [high-performance liquid chromatography (HPLC), attenuated total reflectance Fourier-transform infrared (ATR-FTIR) and near-infrared (NIR)] combined with data fusion strategy (low- and feature-level). RESULTS Four saponins were identified as Q-markers, and exerted pharmacological effects on signalling pathways through 24 core targets. The qualitative and quantitative analysis of HPLC showed that there were differences among groups and different origins. Therefore, considering the need to treat diseases, combined with network database and network pharmacology, the suitable producing areas were determined through the mechanism of action and the required saponin content. The low-level data fusion successfully identified the origin and predicted the content of P. notoginseng from different origins. The accuracy rate of each evaluation index of the partial least squares discriminant analysis (PLS-DA) model was 1, and the t-SNE (t-distributed stochastic neighbor embedding) visualisation results were good. The coefficient of determination (R2 ) of the partial least squares regression (PLSR) model ranged from 0.9235-0.9996, and the root mean square error of cross-validation (RMSECV) and root mean square error of prediction (RMSEP) range is 0.301-1.519. CONCLUSION This study was designed to provide a sufficient theoretical basis for the quality control of P. notoginseng.
Collapse
Affiliation(s)
- Chunlu Liu
- Medicinal Plants Research Institute, Yunnan Academy of Agricultural Sciences, Kunming, Yunnan, P. R. China
- College of Traditional Chinese Medicine, Yunnan University of Chinese Medicine, Kunming, Yunnan, P. R. China
| | - Furong Xu
- College of Traditional Chinese Medicine, Yunnan University of Chinese Medicine, Kunming, Yunnan, P. R. China
| | - Zhitian Zuo
- Medicinal Plants Research Institute, Yunnan Academy of Agricultural Sciences, Kunming, Yunnan, P. R. China
| | - Yuanzhong Wang
- Medicinal Plants Research Institute, Yunnan Academy of Agricultural Sciences, Kunming, Yunnan, P. R. China
| |
Collapse
|
29
|
Li B, Zappalá G, Dumont E, Boisen A, Rindzevicius T, Schmidt MN, Alstrøm TS. Nitroaromatic explosives' detection and quantification using an attention-based transformer on surface-enhanced Raman spectroscopy maps. Analyst 2023; 148:4787-4798. [PMID: 37602485 DOI: 10.1039/d3an00446e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/22/2023]
Abstract
Rapidly and accurately detecting and quantifying the concentrations of nitroaromatic explosives is critical for public health and security. Among existing approaches, explosives' detection with Surface-Enhanced Raman Spectroscopy (SERS) has received considerable attention due to its high sensitivity. Typically, a preprocessed single spectrum that is the average of the entire or a selected subset of a SERS map is used to train various machine learning models for detection and quantification. Designing an appropriate averaging and preprocessing procedure for SERS maps across different concentrations is time-consuming and computationally costly, and the averaging of spectra may lead to the loss of crucial spectral information. We propose an attention-based vision transformer neural network for nitroaromatic explosives' detection and quantification that takes raw SERS maps as the input without any preprocessing. We produce two novel SERS datasets, 2,4-dinitrophenols (DNP) and picric acid (PA), and one benchmark SERS dataset, 4-nitrobenzenethiol (4-NBT), which have repeated measurements down to concentrations of 1 nM to illustrate the detection limit. We experimentally show that our approach outperforms or is on par with the existing methods in terms of detection and concentration prediction accuracy. With the produced attention maps, we can further identify the regions with a higher signal-to-noise ratio in the SERS maps. Based on our findings, the molecule of interest detection and concentration prediction using raw SERS maps is a promising alternative to existing approaches.
Collapse
Affiliation(s)
- Bo Li
- Department of Applied Mathematics and Computer Science, Technical University of Denmark, 2800 Lyngby, Denmark.
| | - Giulia Zappalá
- Center for Intelligent Drug Delivery and Sensing Using Microcontainers and Nanomechanics (IDUN), Department of Health Technology, Technical University of Denmark, 2800 Lyngby, Denmark
| | - Elodie Dumont
- Center for Intelligent Drug Delivery and Sensing Using Microcontainers and Nanomechanics (IDUN), Department of Health Technology, Technical University of Denmark, 2800 Lyngby, Denmark
| | - Anja Boisen
- Center for Intelligent Drug Delivery and Sensing Using Microcontainers and Nanomechanics (IDUN), Department of Health Technology, Technical University of Denmark, 2800 Lyngby, Denmark
| | - Tomas Rindzevicius
- Center for Intelligent Drug Delivery and Sensing Using Microcontainers and Nanomechanics (IDUN), Department of Health Technology, Technical University of Denmark, 2800 Lyngby, Denmark
| | - Mikkel N Schmidt
- Department of Applied Mathematics and Computer Science, Technical University of Denmark, 2800 Lyngby, Denmark.
| | - Tommy S Alstrøm
- Department of Applied Mathematics and Computer Science, Technical University of Denmark, 2800 Lyngby, Denmark.
| |
Collapse
|
30
|
Seddiki K, Precioso F, Sanabria M, Salzet M, Fournier I, Droit A. Early Diagnosis: End-to-End CNN-LSTM Models for Mass Spectrometry Data Classification. Anal Chem 2023; 95:13431-13437. [PMID: 37624777 PMCID: PMC10501374 DOI: 10.1021/acs.analchem.3c00613] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Accepted: 08/09/2023] [Indexed: 08/27/2023]
Abstract
Liquid chromatography-mass spectrometry (LC-MS) is a powerful method for cell profiling. The use of LC-MS technology is a tool of choice for cancer research since it provides molecular fingerprints of analyzed tissues. However, the ubiquitous presence of noise, the peaks shift between acquisitions, and the huge amount of information owing to the high dimensionality of the data make rapid and accurate cancer diagnosis a challenging task. Deep learning (DL) models are not only effective classifiers but are also well suited to jointly learn feature representation and classification tasks. This is particularly relevant when applied to raw LC-MS data and hence avoid the need for costly preprocessing and complicated feature selection. In this study, we propose a new end-to-end DL methodology that addresses all of the above challenges at once, while preserving the high potential of LC-MS data. Our DL model is designed to early discriminate between tumoral and normal tissues. It is a combination of a convolutional neural network (CNN) and a long short-term memory (LSTM) Network. The CNN network allows for significantly reducing the high dimensionality of the data while learning spatially relevant features. The LSTM network enables our model to capture temporal patterns. We show that our model outperforms not only benchmark models but also state-of-the-art models developed on the same data. Our framework is a promising strategy for improving early cancer detection during a diagnostic process.
Collapse
Affiliation(s)
- Khawla Seddiki
- Centre
de Recherche du CHU de Québec-Université Laval, Québec City, Québec G1V 4G2, Canada
- Univ.
Lille, Inserm, CHU Lille,
U1192-Protéomique Réponse Inflammatoire Spectrométrie
de Masse-PRISM, Lille F-59000, France
| | - Fŕed́eric Precioso
- Université
Ĉote d’Azur, CNRS, INRIA, I3S, Sophia Antipolis 06900, France
| | - Melissa Sanabria
- Université
Ĉote d’Azur, CNRS, INRIA, I3S, Sophia Antipolis 06900, France
| | - Michel Salzet
- Univ.
Lille, Inserm, CHU Lille,
U1192-Protéomique Réponse Inflammatoire Spectrométrie
de Masse-PRISM, Lille F-59000, France
| | - Isabelle Fournier
- Univ.
Lille, Inserm, CHU Lille,
U1192-Protéomique Réponse Inflammatoire Spectrométrie
de Masse-PRISM, Lille F-59000, France
| | - Arnaud Droit
- Centre
de Recherche du CHU de Québec-Université Laval, Québec City, Québec G1V 4G2, Canada
| |
Collapse
|
31
|
Yu Z, Chen X, Zhang J, Su Q, Wang K, Liu W. Rapid and Non-Destructive Estimation of Moisture Content in Caragana Korshinskii Pellet Feed Using Hyperspectral Imaging. SENSORS (BASEL, SWITZERLAND) 2023; 23:7592. [PMID: 37688047 PMCID: PMC10490800 DOI: 10.3390/s23177592] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 08/18/2023] [Accepted: 08/30/2023] [Indexed: 09/10/2023]
Abstract
Moisture content is an important parameter for estimating the quality of pellet feed, which is vital in nutrition, storage, and taste. The ranges of moisture content serve as an index for factors such as safe storage and nutrition stability. A rapid and non-destructive model for the measurement of moisture content in pellet feed was developed. To achieve this, 144 samples of Caragana korshinskii pellet feed from various regions in Inner Mongolia Autonomous Region underwent separate moisture content control, measurement using standard methods, and captured their images using a hyperspectral imaging (HSI) system in the spectral range of 935.5-2539 nm. The Monte Carlo cross validation (MCCV) was used to eliminate abnormal sample data from the spectral data for better model accuracy, and a global model of moisture content was built by using partial least squares regression (PLSR) with seven preprocessing techniques and two spectral feature extraction techniques. The results showed that the regression model developed by PLSR based on second derivative (SD) and competitive adaptive reweighted sampling (CARS) resulted in better performance for moisture content. The model showed predictive abilities for moisture content with a coefficient of determination of 0.9075 and a root mean square error (RMSE) of 0.4828 for the training set; and a coefficient of determination of 0.907 and a root mean square error (RMSE) of 0.5267 for the test set; and a relative prediction error of 3.3 and the standard error of 0.307.
Collapse
Affiliation(s)
| | | | - Jianchao Zhang
- College of Mechanical and Electrical Engineering, Inner Mongolia Agricultural University, Hohhot 010018, China; (Z.Y.); (X.C.); (Q.S.); (K.W.); (W.L.)
| | | | | | | |
Collapse
|
32
|
Lösel H, Brockelt J, Gärber F, Teipel J, Kuballa T, Seifert S, Fischer M. Comparative Analysis of LC-ESI-IM-qToF-MS and FT-NIR Spectroscopy Approaches for the Authentication of Organic and Conventional Eggs. Metabolites 2023; 13:882. [PMID: 37623826 PMCID: PMC10456441 DOI: 10.3390/metabo13080882] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 07/16/2023] [Accepted: 07/20/2023] [Indexed: 08/26/2023] Open
Abstract
The importance of animal welfare and the organic production of chicken eggs has increased in the European Union in recent years. Legal regulation for organic husbandry makes the production of organic chicken eggs more expensive compared to conventional husbandry and thus increases the risk of food fraud. Therefore, the aim of this study was to develop a non-targeted lipidomic LC-ESI-IM-qToF-MS method based on 270 egg samples, which achieved a classification accuracy of 96.3%. Subsequently, surrogate minimal depth (SMD) was applied to select important variables identified as carotenoids and lipids based on their MS/MS spectra. The LC-MS results were compared with FT-NIR spectroscopy analysis as a low-resolution screening method and achieved 80.0% accuracy. Here, SMD selected parts of the spectrum which are associated with lipids and proteins. Furthermore, we used SMD for low-level data fusion to analyze relations between the variables of the LC-MS and the FT-NIR spectroscopy datasets. Thereby, lipid-associated bands of the FT-NIR spectrum were related to the identified lipids from the LC-MS analysis, demonstrating that FT-NIR spectroscopy partially provides similar information about the lipidome. In future applications, eggs can therefore be analyzed with FT-NIR spectroscopy to identify conspicuous samples that can subsequently be counter-tested by mass spectrometry.
Collapse
Affiliation(s)
- Henri Lösel
- Hamburg School of Food Science, Institute of Food Chemistry, University of Hamburg, Grindelallee 117, 20146 Hamburg, Germany; (H.L.); (J.B.); (F.G.); (S.S.)
| | - Johannes Brockelt
- Hamburg School of Food Science, Institute of Food Chemistry, University of Hamburg, Grindelallee 117, 20146 Hamburg, Germany; (H.L.); (J.B.); (F.G.); (S.S.)
| | - Florian Gärber
- Hamburg School of Food Science, Institute of Food Chemistry, University of Hamburg, Grindelallee 117, 20146 Hamburg, Germany; (H.L.); (J.B.); (F.G.); (S.S.)
| | - Jan Teipel
- Chemisches und Veterinäruntersuchungsamt (CVUA) Karlsruhe, Weissenburger Strasse 3, 76187 Karlsruhe, Germany (T.K.)
| | - Thomas Kuballa
- Chemisches und Veterinäruntersuchungsamt (CVUA) Karlsruhe, Weissenburger Strasse 3, 76187 Karlsruhe, Germany (T.K.)
| | - Stephan Seifert
- Hamburg School of Food Science, Institute of Food Chemistry, University of Hamburg, Grindelallee 117, 20146 Hamburg, Germany; (H.L.); (J.B.); (F.G.); (S.S.)
| | - Markus Fischer
- Hamburg School of Food Science, Institute of Food Chemistry, University of Hamburg, Grindelallee 117, 20146 Hamburg, Germany; (H.L.); (J.B.); (F.G.); (S.S.)
| |
Collapse
|
33
|
Wang Y, Tian ZP, Xie JJ, Luo Y, Yao J, Shen J. Rapid Determination of Polysaccharides in Cistanche Tubulosa Using Near-Infrared Spectroscopy Combined with Machine Learning. J AOAC Int 2023; 106:1118-1125. [PMID: 36355447 DOI: 10.1093/jaoacint/qsac144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2022] [Revised: 10/30/2022] [Accepted: 11/01/2022] [Indexed: 07/20/2023]
Abstract
BACKGROUND Cistanche tubulosa, as a homology of medicine and food, not only has a unique medicinal value but also is widely used in healthcare products. Polysaccharide is one of its important quality indicators. OBJECTIVE In this study, an analytical model based on near-infrared (NIR) spectroscopy combined with machine learning was established to predict the polysaccharide content of C. tubulosa. METHODS The polysaccharide content in the samples determined by the phenol-sulfuric acid method was used as a reference value, and machine learning was applied to relate the spectral information to the reference value. Dividing the samples into a calibration set and a prediction set using the Kennard-Stone algorithm. The model was optimized by various preprocessing methods, including Savitzky-Golay (SG), standard normal variate (SNV), multiple scattering correction (MSC), first-order derivative (FD), second-order derivative (SD), and combinations of them. Variable selection was performed through the successive projections algorithm (SPA) and stability competitive adaptive reweighted sampling (sCARS). Four machine learning models were used to build quantitative models, including the random forest (RF), partial least-squares (PLS), principal component regression (PCR), and support vector machine (SVM). The evaluation indexes of the model were the coefficient of determination (R2), root-mean-square error (RMSE), and residual prediction deviation (RPD). RESULTS RF performs best among the four machine learning models. R2c (calibration set coefficient of determination) and RMSEC (root mean square error of the calibration set), %, were 0.9763. and 0.3527 for calibration, respectively. R2p (prediction set coefficient of determination), RMSEP (root mean square error of the prediction set), %, and RPD were 0.9230, 0.5130, and 3.33 for prediction, respectively. CONCLUSION The results indicate that NIR combined with the RF is an effective method applied to the quality evaluation of the polysaccharides of C. tubulosa. HIGHLIGHTS Four quantitative models were developed to predict the polysaccharide content in C. tubulosa, and good results were obtained. The characteristic variables were basically determined by the sCARS algorithm, and the corresponding characteristic groups were analyzed.
Collapse
Affiliation(s)
- Yu Wang
- School of Pharmacy, Xinjiang Medical University, Xinyi Road, Urumqi 830011, China
- Key Laboratory of Active Components of Xinjiang Natural Medicine and Drug Release Technology, Xinyi Road, Urumqi 830011, China
| | - Zhan-Ping Tian
- School of Pharmacy, Xinjiang Medical University, Xinyi Road, Urumqi 830011, China
| | - Jia-Jia Xie
- School of Pharmacy, Xinjiang Medical University, Xinyi Road, Urumqi 830011, China
| | - Ying Luo
- School of Pharmacy, Xinjiang Medical University, Xinyi Road, Urumqi 830011, China
| | - Jun Yao
- School of Pharmacy, Xinjiang Medical University, Xinyi Road, Urumqi 830011, China
- Key Laboratory of Active Components of Xinjiang Natural Medicine and Drug Release Technology, Xinyi Road, Urumqi 830011, China
| | - Jing Shen
- Department of Pharmacy, Affiliated Hospital 5 of Xinjiang Medical University, Henan West Road, Urumqi 830011, China
| |
Collapse
|
34
|
Savane P, Belmokhtar N, Delile A, Boizot N, Ridel C, Lelu-Walter MA, Teyssier C. Characterization of hybrid larch somatic embryo maturation by biochemical analyses and by a novel, fast mid-infrared approach. PHYSIOLOGIA PLANTARUM 2023; 175:e13966. [PMID: 37365151 DOI: 10.1111/ppl.13966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 06/06/2023] [Accepted: 06/22/2023] [Indexed: 06/28/2023]
Abstract
The morphology of somatic embryos (SE) is not a sufficient criterion to determine the level of maturation and the optimal stage to transfer embryos for germination, unlike the biochemical components. This composition characterization in the laboratory is too restrictive to be considered at each maturation cycle, as would be necessary. It is, therefore, essential to consider alternative methods. The objectives of this work were to achieve a complete biochemical characterization of the embryos during their development, to serve as a reference and develop a characterization based on infrared spectrometry and chemometrics. During the precotyledonary stage (0-3 weeks of maturation), water content and glucose and fructose levels were high, which is consistent with SE development. After 4 weeks, the cotyledonary SE had a metabolism oriented towards the storage accumulation of lipids, proteins and starch, whereas raffinose only appeared from 8 weeks. Mid-infrared calibration models were developed for water, proteins, lipids, carbohydrates, glucose, fructose, inositols, raffinose, stachyose and starch contents with an r2 average of 0.84. A model was also developed to discriminate the weeks of SE maturation. Different classes of age were discriminated with at least 72% of accuracy. Infrared analysis of the SE based on their full biochemical spectral fingerprint revealed a very slight variation in composition between 7 and 9 weeks, information that is very difficult to obtain by conventional analysis methods. These results provide novel insights into the maturation of conifer SE and indicate that mid-infrared spectrometry could be an easy and effective method for SE characterization.
Collapse
|
35
|
Muñoz EC, Gosetti F, Ballabio D, Andò S, Gómez-Laserna O, Amigo JM, Garzanti E. Characterization of pyrite weathering products by Raman hyperspectral imaging and chemometrics techniques. Microchem J 2023. [DOI: 10.1016/j.microc.2023.108655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
|
36
|
Brown AO, Green PJ, Frankham GJ, Stuart BH, Ueland M. Insights into the Effects of Violating Statistical Assumptions for Dimensionality Reduction for Chemical "-omics" Data with Multiple Explanatory Variables. ACS OMEGA 2023; 8:22042-22054. [PMID: 37360494 PMCID: PMC10286096 DOI: 10.1021/acsomega.3c01613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 05/25/2023] [Indexed: 06/28/2023]
Abstract
Biological volatilome analysis is inherently complex due to the considerable number of compounds (i.e., dimensions) and differences in peak areas by orders of magnitude, between and within compounds found within datasets. Traditional volatilome analysis relies on dimensionality reduction techniques which aid in the selection of compounds that are considered relevant to respective research questions prior to further analysis. Currently, compounds of interest are identified using either supervised or unsupervised statistical methods which assume the data residuals are normally distributed and exhibit linearity. However, biological data often violate the statistical assumptions of these models related to normality and the presence of multiple explanatory variables which are innate to biological samples. In an attempt to address deviations from normality, volatilome data can be log transformed. However, whether the effects of each assessed variable are additive or multiplicative should be considered prior to transformation, as this will impact the effect of each variable on the data. If assumptions of normality and variable effects are not investigated prior to dimensionality reduction, ineffective or erroneous compound dimensionality reduction can impact downstream analyses. It is the aim of this manuscript to assess the impact of single and multivariable statistical models with and without the log transformation to volatilome dimensionality reduction prior to any supervised or unsupervised classification analysis. As a proof of concept, Shingleback lizard (Tiliqua rugosa) volatilomes were collected across their species distribution and from captivity and were assessed. Shingleback volatilomes are suspected to be influenced by multiple explanatory variables related to habitat (Bioregion), sex, parasite presence, total body volume, and captive status. This work determined that the exclusion of relevant multiple explanatory variables from analysis overestimates the effect of Bioregion and the identification of significant compounds. The log transformation increased the number of compounds that were identified as significant, as did analyses that assumed that residuals were normally distributed. Among the methods considered in this work, the most conservative form of dimensionality reduction was achieved through analyzing untransformed data using Monte Carlo tests with multiple explanatory variables.
Collapse
Affiliation(s)
- Amber O. Brown
- Australian
Museum Research Institute, Australian Museum, Sydney 2001, NSW, Australia
- Centre
for Forensic Science, University of Technology
Sydney, Ultimo 2007, NSW, Australia
| | - Peter J. Green
- University
of Bristol, Bristol BS8 1UG, U.K.
- University
of Technology Sydney, Ultimo 2007, NSW, Australia
| | - Greta J. Frankham
- Australian
Museum Research Institute, Australian Museum, Sydney 2001, NSW, Australia
- Centre
for Forensic Science, University of Technology
Sydney, Ultimo 2007, NSW, Australia
| | - Barbara H. Stuart
- Australian
Museum Research Institute, Australian Museum, Sydney 2001, NSW, Australia
| | - Maiken Ueland
- Australian
Museum Research Institute, Australian Museum, Sydney 2001, NSW, Australia
| |
Collapse
|
37
|
Mennickent D, Rodríguez A, Opazo MC, Riedel CA, Castro E, Eriz-Salinas A, Appel-Rubio J, Aguayo C, Damiano AE, Guzmán-Gutiérrez E, Araya J. Machine learning applied in maternal and fetal health: a narrative review focused on pregnancy diseases and complications. Front Endocrinol (Lausanne) 2023; 14:1130139. [PMID: 37274341 PMCID: PMC10235786 DOI: 10.3389/fendo.2023.1130139] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Accepted: 05/04/2023] [Indexed: 06/06/2023] Open
Abstract
Introduction Machine learning (ML) corresponds to a wide variety of methods that use mathematics, statistics and computational science to learn from multiple variables simultaneously. By means of pattern recognition, ML methods are able to find hidden correlations and accomplish accurate predictions regarding different conditions. ML has been successfully used to solve varied problems in different areas of science, such as psychology, economics, biology and chemistry. Therefore, we wondered how far it has penetrated into the field of obstetrics and gynecology. Aim To describe the state of art regarding the use of ML in the context of pregnancy diseases and complications. Methodology Publications were searched in PubMed, Web of Science and Google Scholar. Seven subjects of interest were considered: gestational diabetes mellitus, preeclampsia, perinatal death, spontaneous abortion, preterm birth, cesarean section, and fetal malformations. Current state ML has been widely applied in all the included subjects. Its uses are varied, the most common being the prediction of perinatal disorders. Other ML applications include (but are not restricted to) biomarker discovery, risk estimation, correlation assessment, pharmacological treatment prediction, drug screening, data acquisition and data extraction. Most of the reviewed articles were published in the last five years. The most employed ML methods in the field are non-linear. Except for logistic regression, linear methods are rarely used. Future challenges To improve data recording, storage and update in medical and research settings from different realities. To develop more accurate and understandable ML models using data from cutting-edge instruments. To carry out validation and impact analysis studies of currently existing high-accuracy ML models. Conclusion The use of ML in pregnancy diseases and complications is quite recent, and has increased over the last few years. The applications are varied and point not only to the diagnosis, but also to the management, treatment, and pathophysiological understanding of perinatal alterations. Facing the challenges that come with working with different types of data, the handling of increasingly large amounts of information, the development of emerging technologies, and the need of translational studies, it is expected that the use of ML continue growing in the field of obstetrics and gynecology.
Collapse
Affiliation(s)
- Daniela Mennickent
- Departamento de Bioquímica Clínica e Inmunología, Facultad de Farmacia, Universidad de Concepción, Concepción, Chile
- Departamento de Análisis Instrumental, Facultad de Farmacia, Universidad de Concepción, Concepción, Chile
- Machine Learning Applied in Biomedicine (MLAB), Concepción, Chile
| | - Andrés Rodríguez
- Machine Learning Applied in Biomedicine (MLAB), Concepción, Chile
- Departamento de Ciencias Básicas, Facultad de Ciencias, Universidad del Bío-Bío, Chillán, Chile
| | - Ma. Cecilia Opazo
- Instituto de Ciencias Naturales, Facultad de Medicina Veterinaria y Agronomía, Universidad de Las Américas, Santiago, Chile
- Millennium Institute on Immunology and Immunotherapy, Santiago, Chile
| | - Claudia A. Riedel
- Millennium Institute on Immunology and Immunotherapy, Santiago, Chile
- Departamento de Ciencias Biológicas, Facultad de Ciencias de la Vida, Universidad Andrés Bello, Santiago, Chile
| | - Erica Castro
- Departamento de Obstetricia y Puericultura, Facultad de Ciencias de la Salud, Universidad de Atacama, Copiapó, Chile
| | - Alma Eriz-Salinas
- Departamento de Obstetricia y Puericultura, Facultad de Medicina, Universidad de Concepción, Concepción, Chile
| | - Javiera Appel-Rubio
- Departamento de Bioquímica Clínica e Inmunología, Facultad de Farmacia, Universidad de Concepción, Concepción, Chile
| | - Claudio Aguayo
- Departamento de Bioquímica Clínica e Inmunología, Facultad de Farmacia, Universidad de Concepción, Concepción, Chile
| | - Alicia E. Damiano
- Cátedra de Biología Celular y Molecular, Departamento de Ciencias Biológicas, Facultad de Farmacia y Bioquímica, Universidad de Buenos Aires, Buenos Aires, Argentina
- Laboratorio de Biología de la Reproducción, Instituto de Fisiología y Biofísica Bernardo Houssay (IFIBIO-Houssay)- CONICET, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Enrique Guzmán-Gutiérrez
- Departamento de Bioquímica Clínica e Inmunología, Facultad de Farmacia, Universidad de Concepción, Concepción, Chile
- Machine Learning Applied in Biomedicine (MLAB), Concepción, Chile
| | - Juan Araya
- Departamento de Análisis Instrumental, Facultad de Farmacia, Universidad de Concepción, Concepción, Chile
- Machine Learning Applied in Biomedicine (MLAB), Concepción, Chile
| |
Collapse
|
38
|
Andrews HB, Sadergaski LR. Leveraging visible and near-infrared spectroelectrochemistry to calibrate a robust model for Vanadium(IV/V) in varying nitric acid and temperature levels. Talanta 2023; 259:124554. [PMID: 37080075 DOI: 10.1016/j.talanta.2023.124554] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 04/10/2023] [Accepted: 04/11/2023] [Indexed: 04/22/2023]
Abstract
Spectroelectrochemistry and optimal design of experiments can be used to rapidly build accurate models for species quantification and enable a greater level of process awareness. Optical spectroscopy can provide vital elemental and molecular information, but several hurdles must be overcome before it can become a widely adopted analytical method for remote analysis in the nuclear field. Analytes with varying oxidation state, acid concentration, and fluctuating temperature must be efficiently accounted for to minimize time and resources in restrictive hot cell environments. The classic one-factor-at-a-time approach is not suitable for frequent calibration/maintenance operations in this setting. Therefore, a novel alternative was developed to characterize a system containing vanadium(IV/V) (0.01-0.1 M), nitric acid (0.1-4 M), and varying temperatures (20-45 °C). Spectroelectrochemistry methods were used to acquire a sample set selected by optimal design of experiments. This new approach allows for the accurate analysis of vanadium and HNO3 concentration by leveraging UV-Vis-NIR absorption spectroscopy with robust and accurate chemometric models. The top model's root mean squared error of prediction percent values were 3.47%, 4.06%, 3.40%, and 10.9% for V(IV), V(V), HNO3, and temperature, respectively. These models, efficiently developed using the designed approach, exhibited strong predictive accuracy for vanadium and acid with varying oxidation states and temperature using only spectrophotometry, which advances current technology for real-world hot cell applications. Additionally, Nernstian analysis of the V(IV/V) standard potential was performed using traditional absorbance methods and multivariate curve resolution (MCR). The successful tests demonstrated that MCR Nernst tests may be valuable in highly convoluted spectral systems to better understand the redox processes' behavior.
Collapse
Affiliation(s)
- Hunter B Andrews
- Radioisotope Science and Technology Division, Oak Ridge National Laboratory, 1 Bethel Valley Rd., Oak Ridge, TN, 37980, USA.
| | - Luke R Sadergaski
- Radioisotope Science and Technology Division, Oak Ridge National Laboratory, 1 Bethel Valley Rd., Oak Ridge, TN, 37980, USA
| |
Collapse
|
39
|
Dayananda B, Owen S, Kolobaric A, Chapman J, Cozzolino D. Pre-processing Applied to Instrumental Data in Analytical Chemistry: A Brief Review of the Methods and Examples. Crit Rev Anal Chem 2023:1-9. [PMID: 37053040 DOI: 10.1080/10408347.2023.2199864] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/14/2023]
Abstract
The field of analytical chemistry has been significantly advanced by the availability of state-of-the-art instrumentation, allowing for the development of novel applications in this field. However, in many cases, the direct interpretation of the recorded data is often not straightforward, hence some level of pre-processing is required (e.g., baseline correction, derivatives, normalization, smoothing). These techniques have become a critical first step for the successful analysis of the data recorded, and it is recommended to use them before the application of chemometrics (e.g., classification, calibration development). The aim of this paper is to provide with an overview of the most used pre-processing methods applied to instrumental analytical methods (e.g., spectroscopy, chromatography). Examples of their application in near infrared and UV-VIS spectroscopy as well as in gas chromatography will be also discussed. Overall, this paper provides with a comprehensive understanding of pre-processing techniques in analytical chemistry, highlighting their importance during the analysis and interpretation of data, as well as during the development of accurate and reliable chemometric models.
Collapse
Affiliation(s)
- B Dayananda
- School of Agriculture and Food Sciences, The University of Queensland, Brisbane, Queensland, Australia
| | - S Owen
- School of Science, RMIT University, Melbourne, Victoria, Australia
| | - A Kolobaric
- School of Science, RMIT University, Melbourne, Victoria, Australia
| | - J Chapman
- School of Science, RMIT University, Melbourne, Victoria, Australia
| | - D Cozzolino
- Centre for Nutrition and Food Sciences, Queensland Alliance for Agriculture and Food Innovation (QAAFI), The University of Queensland, Brisbane, Queensland, Australia
| |
Collapse
|
40
|
Kappatou CD, Odgers J, García-Muñoz S, Misener R. An Optimization Approach Coupling Preprocessing with Model Regression for Enhanced Chemometrics. Ind Eng Chem Res 2023; 62:6196-6213. [PMID: 37097815 PMCID: PMC10119938 DOI: 10.1021/acs.iecr.2c04583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 03/02/2023] [Accepted: 03/27/2023] [Indexed: 04/09/2023]
Abstract
Chemometric methods are broadly used in the chemical and biochemical sectors. Typically, derivation of a regression model follows data preprocessing in a sequential manner. Yet, preprocessing can significantly influence the regression model and eventually its predictive ability. In this work, we investigate the coupling of preprocessing and model parameter estimation by incorporating them simultaneously in an optimization step. Common model selection techniques rely almost exclusively on the performance of some accuracy metric, yet having a quantitative metric for model robustness can prolong model up-time. Our approach is applied to optimize for model accuracy and robustness. This requires the introduction of a novel mathematical definition for robustness. We test our method in a simulated set up and with industrial case studies from multivariate calibration. The results highlight the importance of both accuracy and robustness properties and illustrate the potential of the proposed optimization approach toward automating the generation of efficient chemometric models.
Collapse
Affiliation(s)
- Chrysoula D. Kappatou
- Computational Optimisation Group, Department of Computing, Imperial College London, London SW7 2AZ, United Kingdom
| | - James Odgers
- Computational Optimisation Group, Department of Computing, Imperial College London, London SW7 2AZ, United Kingdom
| | - Salvador García-Muñoz
- Synthetic Molecule Design and Development, Lilly Research Laboratories, Eli Lilly & Company, Indianapolis, Indiana 46285, United States
| | - Ruth Misener
- Computational Optimisation Group, Department of Computing, Imperial College London, London SW7 2AZ, United Kingdom
| |
Collapse
|
41
|
Liu Q, Dong P, Fengou LC, Nychas GJ, Fowler SM, Mao Y, Luo X, Zhang Y. Preliminary investigation into the prediction of indicators of beef spoilage using Raman and Fourier transform infrared spectroscopy. Meat Sci 2023; 200:109168. [PMID: 36963260 DOI: 10.1016/j.meatsci.2023.109168] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Revised: 03/13/2023] [Accepted: 03/16/2023] [Indexed: 03/22/2023]
Abstract
The objective of this study was to assess the potential to predict the microbial beef spoilage indicators by Raman and Fourier transform infrared (FT-IR) spectroscopies. Vacuum skin packaged (VSP) beef steaks were stored at 0 °C, 4 °C, 8 °C and under a dynamic temperature condition (0 °C ∼ 4 °C ∼ 8 °C, for 36 d). Total viable count (TVC) and total volatile basic nitrogen (TVB-N) were obtained during the storage period along with spectroscopic data. The Raman and FTIR spectra were baseline corrected, pre-processed using Savitzky-Golay smoothing and normalized. Subsequently partial least squares regression (PLSR) models of TVC and TVB-N were developed and evaluated. The root mean squared error (RMSE) ranged from 0.81 to1.59 (log CFU/g or mg/100 g) and the determination coefficient (R2) from 0.54 to 0.75. The performance of PLSR model based on data fusion (combination of Raman and FT-IR data) is better than that based on Raman spectra and similar to that of FT-IR. Overall, Raman spectroscopy, FT-IR spectroscopy, and a combination of both exhibited a potential for the prediction of the beef spoilage.
Collapse
Affiliation(s)
- Qingsen Liu
- Lab of Beef Processing and Quality Control, College of Food Science and Engineering, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Pengcheng Dong
- Lab of Beef Processing and Quality Control, College of Food Science and Engineering, Shandong Agricultural University, Tai'an, Shandong 271018, PR China.
| | - Lemonia-Christina Fengou
- Laboratory of Microbiology and Biotechnology of Foods, Department of Food Science and Human Nutrition, School of Food and Nutritional Sciences, Agricultural University of Athens, Iera Odos 75, 11855 Athens, Greece.
| | - George-John Nychas
- Laboratory of Microbiology and Biotechnology of Foods, Department of Food Science and Human Nutrition, School of Food and Nutritional Sciences, Agricultural University of Athens, Iera Odos 75, 11855 Athens, Greece.
| | - Stephanie Marie Fowler
- NSW Department of Primary Industries, Centre for Red Meat and Sheep Development, PO Box 129, Cowra, NSW 2794, Australia.
| | - Yanwei Mao
- Lab of Beef Processing and Quality Control, College of Food Science and Engineering, Shandong Agricultural University, Tai'an, Shandong 271018, PR China.
| | - Xin Luo
- Lab of Beef Processing and Quality Control, College of Food Science and Engineering, Shandong Agricultural University, Tai'an, Shandong 271018, PR China.
| | - Yimin Zhang
- Lab of Beef Processing and Quality Control, College of Food Science and Engineering, Shandong Agricultural University, Tai'an, Shandong 271018, PR China.
| |
Collapse
|
42
|
Wang YM, Ostendorf B, Pagay V. Detecting Grapevine Virus Infections in Red and White Winegrape Canopies Using Proximal Hyperspectral Sensing. SENSORS (BASEL, SWITZERLAND) 2023; 23:2851. [PMID: 36905055 PMCID: PMC10007312 DOI: 10.3390/s23052851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Revised: 02/21/2023] [Accepted: 03/01/2023] [Indexed: 06/18/2023]
Abstract
Grapevine virus-associated disease such as grapevine leafroll disease (GLD) affects grapevine health worldwide. Current diagnostic methods are either highly costly (laboratory-based diagnostics) or can be unreliable (visual assessments). Hyperspectral sensing technology is capable of measuring leaf reflectance spectra that can be used for the non-destructive and rapid detection of plant diseases. The present study used proximal hyperspectral sensing to detect virus infection in Pinot Noir (red-berried winegrape cultivar) and Chardonnay (white-berried winegrape cultivar) grapevines. Spectral data were collected throughout the grape growing season at six timepoints per cultivar. Partial least squares-discriminant analysis (PLS-DA) was used to build a predictive model of the presence or absence of GLD. The temporal change of canopy spectral reflectance showed that the harvest timepoint had the best prediction result. Prediction accuracies of 96% and 76% were achieved for Pinot Noir and Chardonnay, respectively. Our results provide valuable information on the optimal time for GLD detection. This hyperspectral method can also be deployed on mobile platforms including ground-based vehicles and unmanned aerial vehicles (UAV) for large-scale disease surveillance in vineyards.
Collapse
Affiliation(s)
- Yeniu Mickey Wang
- School of Agriculture, Food & Wine, Waite Research Institute, The University of Adelaide, PMB 1, Glen Osmond, SA 5064, Australia
- CSIRO Manufacturing, 13 Kintore Ave, Adelaide, SA 5000, Australia
| | - Bertram Ostendorf
- School of Biological Sciences, The University of Adelaide, Molecular Life Sciences Building, North Terrace Campus, Adelaide, SA 5005, Australia
| | - Vinay Pagay
- School of Agriculture, Food & Wine, Waite Research Institute, The University of Adelaide, PMB 1, Glen Osmond, SA 5064, Australia
| |
Collapse
|
43
|
Tsagkaris A, Bechynska K, Ntakoulas D, Pasias I, Weller P, Proestos C, Hajslova J. Investigating the impact of spectral data pre-processing to assess honey botanical origin through Fourier transform infrared spectroscopy (FTIR). J Food Compost Anal 2023. [DOI: 10.1016/j.jfca.2023.105276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/14/2023]
|
44
|
Shi S, Zhao D, Pan K, Ma Y, Zhang G, Li L, Cao C, Jiang Y. Combination of near-infrared spectroscopy and key wavelength-based screening algorithm for rapid determination of rice protein content. J Food Compost Anal 2023. [DOI: 10.1016/j.jfca.2023.105216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/13/2023]
|
45
|
Andrews H, Sadergaski LR, Cary SK. Pursuit of the Ultimate Regression Model for Samarium(III), Europium(III), and LiCl Using Laser-Induced Fluorescence, Design of Experiments, and a Genetic Algorithm for Feature Selection. ACS OMEGA 2023; 8:2281-2290. [PMID: 36687031 PMCID: PMC9850777 DOI: 10.1021/acsomega.2c06610] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 12/14/2022] [Indexed: 06/17/2023]
Abstract
Laser-induced fluorescence spectroscopy, Raman scattering, and partial least squares regression models were optimized for the quantification of samarium (0-150 μg mL-1), europium (0-75 μg mL-1), and lithium chloride (0.1-12 M) with a transformational preprocessing strategy. Selecting combinations of preprocessing methods to optimize the prediction performance of regression models is frequently a major bottleneck for chemometric analysis. Here, we propose an optimization tool using an innovative combination of optimal experimental designs for selecting preprocessing transformation and a genetic algorithm (GA) for feature selection. A D-optimal design containing 26 samples (i.e., combinations of preprocessing strategies) and a user-defined design (576 samples) did not statistically lower the root mean square error of the prediction (RMSEP). The greatest improvement in prediction performance was achieved when a GA was used for feature selection. This feature selection greatly lowered RMSEP statistics by an average of 53%, resulting in the top models with percent RMSEP values of 0.91, 3.5, and 2.1% for Sm(III), Eu(III), and LiCl, respectively. These results indicate that preprocessing corrections (e.g., scatter, scaling, noise, and baseline) alone cannot realize the optimal regression model; feature selection is a more crucial aspect to consider. This unique approach provides a powerful tool for approaching the true optimum prediction performance and can be applied to numerous fields of spectroscopy and chemometrics to rapidly construct models.
Collapse
|
46
|
Ordoudi SA, Strani L, Cocchi M. Toward the Non-Targeted Detection of Adulterated Virgin Olive Oil with Edible Oils via FTIR Spectroscopy & Chemometrics: Research Methodology Trends, Gaps and Future Perspectives. MOLECULES (BASEL, SWITZERLAND) 2023; 28:molecules28010337. [PMID: 36615530 PMCID: PMC9822006 DOI: 10.3390/molecules28010337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 12/23/2022] [Accepted: 12/23/2022] [Indexed: 01/04/2023]
Abstract
Fourier-Transform mid-infrared (FTIR) spectroscopy offers a strong candidate screening tool for rapid, non-destructive and early detection of unauthorized virgin olive oil blends with other edible oils. Potential applications to the official anti-fraud control are supported by dozens of research articles with a "proof-of-concept" study approach through different chemometric workflows for comprehensive spectral analysis. It may also assist non-targeted authenticity testing, an emerging goal for modern food fraud inspection systems. Hence, FTIR-based methods need to be standardized and validated to be accepted by the olive industry and official regulators. Thus far, several literature reviews evaluated the competence of FTIR standalone or compared with other vibrational techniques only in view of the chemometric methodology, regardless of the inherent characteristics of the product spectra or the application scope. Regarding authenticity testing, every step of the methodology workflow, and not only the post-acquisition steps, need thorough validation. In this context, the present review investigates the progress in the research methodology on FTIR-based detection of virgin olive oil adulteration over a period of more than 25 years with the aim to capture the trends, identify gaps or misuses in the existing literature and highlight intriguing topics for future studies. An extensive search in Scopus, Web of Science and Google Scholar, combined with bibliometric analysis, helped to extract qualitative and quantitative information from publication sources. Our findings verified that intercomparison of literature results is often impossible; sampling design, FTIR spectral acquisition and performance evaluation are critical methodological issues that need more specific guidance and criteria for application to product authenticity testing.
Collapse
Affiliation(s)
- Stella A. Ordoudi
- Laboratory of Food Chemistry and Technology, School of Chemistry, Aristotle University of Thessaloniki (AUTh), GR-54124 Thessaloniki, Greece
- Correspondence:
| | - Lorenzo Strani
- Department of Chemical and Geological Sciences, University of Modena and Reggio Emilia (UNIMORE), Via Campi 103, 41125 Modena, Italy
| | - Marina Cocchi
- Department of Chemical and Geological Sciences, University of Modena and Reggio Emilia (UNIMORE), Via Campi 103, 41125 Modena, Italy
| |
Collapse
|
47
|
Grassi S, Tarapoulouzi M, D’Alessandro A, Agriopoulou S, Strani L, Varzakas T. How Chemometrics Can Fight Milk Adulteration. Foods 2022; 12:foods12010139. [PMID: 36613355 PMCID: PMC9819000 DOI: 10.3390/foods12010139] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2022] [Revised: 12/10/2022] [Accepted: 12/22/2022] [Indexed: 12/29/2022] Open
Abstract
Adulteration and fraud are amongst the wrong practices followed nowadays due to the attitude of some people to gain more money or their tendency to mislead consumers. Obviously, the industry follows stringent controls and methodologies in order to protect consumers as well as the origin of the food products, and investment in these technologies is highly critical. In this context, chemometric techniques proved to be very efficient in detecting and even quantifying the number of substances used as adulterants. The extraction of relevant information from different kinds of data is a crucial feature to achieve this aim. However, these techniques are not always used properly. In fact, training is important along with investment in these technologies in order to cope effectively and not only reduce fraud but also advertise the geographical origin of the various food and drink products. The aim of this paper is to present an overview of the different chemometric techniques (from clustering to classification and regression applied to several analytical data) along with spectroscopy, chromatography, electrochemical sensors, and other on-site detection devices in the battle against milk adulteration. Moreover, the steps which should be followed to develop a chemometric model to face adulteration issues are carefully presented with the required critical discussion.
Collapse
Affiliation(s)
- Silvia Grassi
- Department of Food, Environmental and Nutritional Sciences (DeFENS), Università degli Studi di Milano, Via Celoria, 2, 20133 Milano, Italy
| | - Maria Tarapoulouzi
- Department of Chemistry, Faculty of Pure and Applied Science, University of Cyprus, P.O. Box 20537, Nicosia CY-1678, Cyprus
| | - Alessandro D’Alessandro
- Department of Chemical and Geological Sciences, University of Modena and Reggio Emilia, Via Campi 103, 41125 Modena, Italy
| | - Sofia Agriopoulou
- Department of Food Science and Technology, University of the Peloponnese, Antikalamos, 24100 Kalamata, Greece
| | - Lorenzo Strani
- Department of Chemical and Geological Sciences, University of Modena and Reggio Emilia, Via Campi 103, 41125 Modena, Italy
- Correspondence: (L.S.); (T.V.)
| | - Theodoros Varzakas
- Department of Food Science and Technology, University of the Peloponnese, Antikalamos, 24100 Kalamata, Greece
- Correspondence: (L.S.); (T.V.)
| |
Collapse
|
48
|
Ordoudi SA, Özdikicierler O, Tsimidou MZ. Detection of ternary mixtures of virgin olive oil with canola, hazelnut or safflower oils via non-targeted ATR-FTIR fingerprinting and chemometrics. Food Control 2022. [DOI: 10.1016/j.foodcont.2022.109240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
|
49
|
Gai Z, Sun L, Bai H, Li X, Wang J, Bai S. Convolutional neural network for apple bruise detection based on hyperspectral. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2022; 279:121432. [PMID: 35660156 DOI: 10.1016/j.saa.2022.121432] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Revised: 05/11/2022] [Accepted: 05/24/2022] [Indexed: 06/15/2023]
Abstract
The timely detection of apple bruises caused by collision and squeeze is of great significance to reduce the economic losses of the apple industry. This study proposed a spectral analysis model (SpectralCNN) based on a one-dimensional convolutional neural network to detect apple bruises. The influences of six spectral preprocessing methods on the SpectralCNN model were firstly analyzed in this paper. Compared with traditional chemometric models, the SpectralCNN model had a better accuracy, which was demonstrated not depend on the spectral preprocessing method by experiment results. Then, 20 characteristic wavelengths could be extracted by successive projection algorithm. The SpectralCNN model could achieve an accuracy of 95.79% on the test set of characteristic wavelengths, indicating that the extracted characteristic wavelengths contain most of the features of bruised and healthy pixels.
Collapse
Affiliation(s)
- Zhaodong Gai
- College of Electronics Engineering, Heilongjiang University, Harbin, Heilongjiang Province, China; Jiaxiang Research Academy of Industrial Technology, Jining, Shandong Province, China.
| | - Laijun Sun
- College of Electronics Engineering, Heilongjiang University, Harbin, Heilongjiang Province, China; Jiaxiang Research Academy of Industrial Technology, Jining, Shandong Province, China.
| | - Hongyi Bai
- College of Electronics Engineering, Heilongjiang University, Harbin, Heilongjiang Province, China.
| | - Xiaoxu Li
- College of Electronics Engineering, Heilongjiang University, Harbin, Heilongjiang Province, China; Jiaxiang Research Academy of Industrial Technology, Jining, Shandong Province, China.
| | - Jiaying Wang
- College of Electronics Engineering, Heilongjiang University, Harbin, Heilongjiang Province, China.
| | - Songning Bai
- College of Electronics Engineering, Heilongjiang University, Harbin, Heilongjiang Province, China.
| |
Collapse
|
50
|
Lv R, Wang Z, Ma Y, Li W, Tian J. Machine Learning Enhanced Optical Spectroscopy for Disease Detection. J Phys Chem Lett 2022; 13:9238-9249. [PMID: 36173116 DOI: 10.1021/acs.jpclett.2c02193] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Optical spectroscopy plays an important role in disease detection. Improving the sensitivity and specificity of spectral detection has great importance in the development of accurate diagnosis. The development of artificial intelligence technology provides a great opportunity to improve the detection accuracy through machine learning methods. In this Perspective, we focus on the combination of machine learning methods with the optical spectroscopy methods widely used for disease detection, including absorbance, fluorescence, scattering, FTIR, terahertz, etc. By comparing the spectral analysis with different machine learning methods, we illustrate that the support vector machine and convolutional neural network are most effective, which have potential to further improve the classification accuracy to distinguish disease subtypes if these machine learning methods are used. This Perspective broadens the scope of optical spectroscopy enhanced by machine learning and will be useful for the development of disease detection.
Collapse
Affiliation(s)
- Ruichan Lv
- Interdisciplinary Research Center of Smart Sensor, Engineering Research Center of Molecular and Neuro Imaging, Ministry of Education, School of Life Science and Technology, Xidian University, Xi'an, Shaanxi 710071, China
| | - Zhan Wang
- Interdisciplinary Research Center of Smart Sensor, Engineering Research Center of Molecular and Neuro Imaging, Ministry of Education, School of Life Science and Technology, Xidian University, Xi'an, Shaanxi 710071, China
| | - Yaqun Ma
- Interdisciplinary Research Center of Smart Sensor, Engineering Research Center of Molecular and Neuro Imaging, Ministry of Education, School of Life Science and Technology, Xidian University, Xi'an, Shaanxi 710071, China
| | - Wenjing Li
- Interdisciplinary Research Center of Smart Sensor, Engineering Research Center of Molecular and Neuro Imaging, Ministry of Education, School of Life Science and Technology, Xidian University, Xi'an, Shaanxi 710071, China
| | - Jie Tian
- Key Laboratory of Molecular Imaging of Chinese Academy of Sciences, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
| |
Collapse
|