1
|
Geng Y, Ni H, Shen H, Wang H, Wu J, Pan K, Wu Y, Chen Y, Luo Y, Xu T, Liu X. Feasibility of an NIR spectral calibration transfer algorithm based on optimized feature variables to predict tobacco samples in different states. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2023; 15:719-728. [PMID: 36722963 DOI: 10.1039/d2ay01805e] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
The prediction accuracy of calibration models for near-infrared (NIR) spectroscopy typically relies on the morphology and homogeneity of the samples. To achieve non-homogeneous tobacco samples for non-destructive and rapid analysis, a method that can predict tobacco filament samples using reliable models based on the corresponding tobacco powder is proposed here. First, as it is necessary to establish a simple and robust calibrated model with excellent performance, based on full-wavelength PLSR (Full-PLSR), the key feature variables were screened by three methods, namely competitive adaptive reweighted sampling (CARS), variable combination population analysis-iteratively retaining informative variables (VCPA-IRIV), and variable combination population analysis-genetic algorithm (VCPA-GA). The partial least squares regression (PLSR) models for predicting the total sugar content in tobacco were established based on three optimal wavelength sets and named CARS-PLSR, VCPA-IRIV-PLSR and VCPA-GA-PLSR, respectively. Subsequently, they were combined with different calibration transfer algorithms, including calibration transfer based on canonical correlation analysis (CTCCA), slope/bias correction (S/B) and non-supervised parameter-free framework for calibration enhancement (NS-PFCE), to evaluate the best prediction model for the tobacco filament samples. Compared with the previous two transfer algorithms, NS-PFCE performed the best under various wavelength conditions. The prediction results indicated that the most successful approach for predicting the tobacco filament samples was achieved by VCPA-IRIV-PLSR when coupled with the NS-PFCE method, which obtained the highest determination coefficient (Rp2 = 0.9340) and the lowest root mean square error of the prediction set (RMSEP = 0.8425). VCPA-IRIV simplifies the calibration model and improves the efficiency of model transfer (31 variables). Furthermore, it pledges the prediction accuracy of the tobacco filament samples when combined with NS-PFCE. In summary, calibration transfer based on optimized feature variables can eliminate prediction errors caused by sample morphological differences and proves to be a more beneficial method for online application in the tobacco industry.
Collapse
Affiliation(s)
- Yingrui Geng
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Hongfei Ni
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Hangzhou 310018, China
| | - Huanchao Shen
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Hangzhou 310018, China
| | - Hui Wang
- Technology Center, China Tobacco Zhejiang Industrial Co., Ltd, Hangzhou 310008, China
| | - Jizhong Wu
- Technology Center, China Tobacco Zhejiang Industrial Co., Ltd, Hangzhou 310008, China
| | - Keyu Pan
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Yongjiang Wu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Yong Chen
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Yingjie Luo
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Tengfei Xu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Xuesong Liu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| |
Collapse
|
2
|
Rapid Analysis of Raw Meal Composition Content Based on NIR Spectroscopy for Cement Raw Material Proportioning Control Process. Processes (Basel) 2022. [DOI: 10.3390/pr10122494] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Due to fast analysis speed, analyzing composition content of cement raw meal utilizing near infrared (NIR) spectroscopy, combined with partial least squares regression (PLS), is a reliable alternative method for the cement industry to obtain qualified cement products. However, it has hardly been studied. The raw materials employed in different cement plants differ, and the spectral absorption intensity in the NIR range of the raw meal component is weaker than organic substances, although there are obvious absorption peaks, which place high demands on the generality of modeling and accuracy of the analytical model. An effective modeling procedure is proposed, which optimizes the quantitative analytical model from several modeling stages, and two groups of samples with different raw material types and origins are collected to validate it. For the samples in the prediction set from Qufu, the root mean square error of prediction (RMSEP) of CaO, SiO2, Al2O3, and Fe2O3 were 0.1910, 0.2307, 0.0921, and 0.0429, respectively; the average prediction errors for CaO, SiO2, Al2O3, and Fe2O3 were 0.171%, 0.193%, 0.069%, and 0.032%, respectively; for the samples in the prediction set from Linyi, the RMSEP of CaO, SiO2, Al2O3, and Fe2O3 were 0.1995, 0.1267, 0.0336 and 0.0242, respectively, the average prediction errors for CaO, SiO2, Al2O3, and Fe2O3 were 0.154%, 0.100%, 0.022%, and 0.018%, respectively. The standard methods for chemical analysis of cement require that the mean measurement error for CaO, SiO2, Al2O3, and Fe2O3 should be within 0.40%, 0.30%, 0.20%, and 0.15%, respectively. It is obvious that the results of both groups of samples fully satisfied the requirements of raw material proportioning control of the production line, demonstrating that the modeling procedure has excellent generality, the models established have high prediction accuracy, and the NIR spectroscopy combined with the proposed modeling procedure is a rapid and accurate alternative approach for the analysis of cement raw meal composition content.
Collapse
|
3
|
Xiao D, Le BT, Ha TTL. Iron ore identification method using reflectance spectrometer and a deep neural network framework. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2021; 248:119168. [PMID: 33229210 DOI: 10.1016/j.saa.2020.119168] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Revised: 10/28/2020] [Accepted: 10/30/2020] [Indexed: 06/11/2023]
Abstract
In the first selection stage of iron ore, the ore classification accuracy plays a decisive role in subsequent work. Therefore, how to identify iron ore quickly and accurately is an important task. Traditional chemical, physical and manual identification methods have the disadvantages of high costs and high time consumption. This research proposes a new iron ore identification method, that combines deep learning with visible-infrared reflectance spectroscopy to establish an iron ore classification model. We collected iron ore samples from the Anshan iron ore area and measured the spectral data with a spectrometer. Then, a deep neural network framework is proposed based on the convolution neural network and the improved extreme learning machine algorithm, and an iron ore classification model is established based on the framework. The results show that the proposed model can effectively identify the types of iron ore, and the overall accuracy reaches 98.11%.
Collapse
Affiliation(s)
- Dong Xiao
- College of Information Science and Engineering, Northeastern University, Shenyang, 110819, China; Key Laboratory of Intelligent Diagnosis and Safety for Metallurgical Industry, Liaoning Province, Northeastern University, Shenyang 110819, China
| | - Ba Tuan Le
- Institute of Research and Development, Duy Tan University, Da Nang 550000, Viet Nam.
| | | |
Collapse
|
4
|
Liu J, Jin S, Bao C, Sun Y, Li W. Rapid determination of lignocellulose in corn stover based on near-infrared reflectance spectroscopy and chemometrics methods. BIORESOURCE TECHNOLOGY 2021; 321:124449. [PMID: 33285506 DOI: 10.1016/j.biortech.2020.124449] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Revised: 11/22/2020] [Accepted: 11/23/2020] [Indexed: 06/12/2023]
Abstract
In this study, a rapid detection method based on near-infrared reflectance spectroscopy was proposed for measuring the contents of cellulose, hemicellulose and lignin in corn stover. In the basis of strategies of variable selection, feature extraction and nonlinear modeling, BiPLS-PCA-SVM was constructed using backward interval partial least squares combined with principal component analysis and support vector machine, which was used to improve the performance of spectral regression calibration model. For BiPLS-PCA-SVM model, the determination coefficients, root mean squared error and residual predictive deviation for the validation set were 0.906, 0.900% and 3.213 for cellulose; 0.987, 0.797% and 9.071 for hemicellulose; and 0.936, 0.264% and 4.024 for lignin, correspondingly. The results indicate that near-infrared reflectance spectroscopy combined with BiPLS-PCA-SVM can provide a reliable alternative strategy to detect contents of lignocellulosic components for pretreated corn stover in the anaerobic digestion process.
Collapse
Affiliation(s)
- Jinming Liu
- College of Information and Electrical Engineering, Heilongjiang Bayi Agricultural University, Daqing 163319, PR China
| | - Shuo Jin
- College of Information and Electrical Engineering, Heilongjiang Bayi Agricultural University, Daqing 163319, PR China
| | - Changhao Bao
- College of Information and Electrical Engineering, Heilongjiang Bayi Agricultural University, Daqing 163319, PR China
| | - Yong Sun
- College of Engineering, Northeast Agricultural University, Harbin 150030, PR China.
| | - Wenzhe Li
- College of Engineering, Northeast Agricultural University, Harbin 150030, PR China
| |
Collapse
|
5
|
Yang B, Chen C, Cheng C, Cheng H, Yan Z, Chen F, Zhu Z, Zhang H, Yue F, Lv X. Detection of breast cancer of various clinical stages based on serum FT-IR spectroscopy combined with multiple algorithms. Photodiagnosis Photodyn Ther 2021; 33:102199. [PMID: 33515764 DOI: 10.1016/j.pdpdt.2021.102199] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Revised: 01/20/2021] [Accepted: 01/22/2021] [Indexed: 12/15/2022]
Abstract
BACKGROUND Breast cancer screening is time consuming, requires expensive equipment, and has demanding requirements for doctors. Hence, a large number of breast cancer patients may miss screening and early treatment, which greatly threatens their health around the world. Infrared spectroscopy may be able to be used as a screening tool for breast cancer detection. Fourier transform infrared (FT-IR) spectroscopy of serum was combined with traditional machine learning algorithms to achieve an auxiliary diagnosis that could quickly and accurately distinguish patients with different stages of breast cancer, including stage 1 disease, from control subjects without breast cancer. MATERIALS AND METHODS FT-IR spectroscopy were performed on the serum of 114 non-cancer control subjects, 35 patients with stage I, 43 patients with stage II, and 29 patients with stage III & IV breast cancer. Due to the experimental sample imbalance, we used the oversampling to process the four classes of sample. The oversampling selected Synthetic Minority Oversampling Technique (SMOTE). Subsequently, we used the random discarding method in undersampling to do experiments as well. The average FT-IR spectroscopy results for the four groups showed differences in phospholipids, nucleic acids, lipids, and proteins between non-cancer control subjects and breast cancer patients at different stages. Based on these differences, four classification models were used to classify stage I, II, III & IV breast cancer patients and non-cancer control subjects. First, standard normal variate transformation (SNV) was used to preprocess the original data, and then partial least squares (PLS) was used for feature extraction. Finally, the five models were established including extreme learning machine (ELM), k-nearest neighbor (KNN), genetic algorithms based on support vector machine (GA-SVM), particle swarm optimization-support vector machine (PSO-SVM) and grid search-support vector machine (GS-SVM). CONCLUSION In oversampling experiment, the GS-SVM classifier obtained the highest average classification accuracy of 95.45 %; the diagnostic accuracy of non-cancer control subjects was 100 %; breast cancer stage I was 90 %; breast cancer stage II was 84.62 %; and breast cancer stage III & IV was 100 %. In undersampling experiment, the GA-SVM model obtained the highest average classification accuracy of 100 %; the diagnostic accuracy of non-cancer control subjects was 100 %; breast cancer stage I was 100 %; breast cancer stage II was 100 %; and breast cancer stage III & IV was 100 %. The results show that FT-IR spectroscopy combined with powerful classification algorithms has great potential in distinguishing patients with different stages of breast cancer from non-cancer control subjects. In addition, this research provides a reference for future multiclassification studies of cervical cancer, ovarian cancer and other female high-incidence cancers through serum FT-IR spectroscopy.
Collapse
Affiliation(s)
- Bo Yang
- College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China
| | - Chen Chen
- College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China
| | - Chen Cheng
- College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China.
| | - Hong Cheng
- The Affiliated Tumor Hospital of Xinjiang Medical University, Urumqi 830000, China
| | - Ziwei Yan
- College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China
| | - Fangfang Chen
- College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China
| | - Zhimin Zhu
- College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China
| | - Huiting Zhang
- College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China
| | - Feilong Yue
- College of Software, Xinjiang University, Urumqi 830046, China
| | - Xiaoyi Lv
- College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China; College of Software, Xinjiang University, Urumqi 830046, China; Key Laboratory of Signal Detection and Processing, Xinjiang University, Urumqi 830046, China.
| |
Collapse
|