1
|
Liu T, Zhang H, Wu J, Liu W, Fang Y. Wastewater treatment process enhancement based on multi-objective optimization and interpretable machine learning. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 364:121430. [PMID: 38875983 DOI: 10.1016/j.jenvman.2024.121430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 04/22/2024] [Accepted: 06/07/2024] [Indexed: 06/16/2024]
Abstract
Optimization and control of wastewater treatment process (WTP) can contribute to cost reduction and efficiency. A wastewater treatment process multi-objective optimization (WTPMO) framework is proposed in this paper to provide suggestions for decision-making in setting parameters of WTP. Firstly, the prediction models based on Extreme Gradient Boosting (XGB) with Bayesian optimization (BO) are developed for predicting effluent water quality (EQ) and energy consumption (EC) for different influent quality and process parameter settings. Then, the SHapley Additive exPlanations (SHAP) algorithm is used to complement the interpretability of machine learning to quantitatively evaluate the impact of different features on the predicted targets. Finally, the Non-dominated Sorting Genetic Algorithm II (NSGA-II) with the Technique for Ordering Preferences on Similarity of Ideal Solutions (TOPSIS) is introduced to solve and make decisions on the multi-objective optimization problem. The WTPMO applicability is validated on Benchmark Simulation Model 1 (BSM1). The results show that BOXGB achieves accurate prediction for EQ and EC with R2 values of 0.923 and 0.965, respectively, indicating that BO can effectively select the model hyperparameters in XGB. Based on SHAP supplemented the interpretability of the model to fully explain how the influent water quality and decision variables affect the EQ and EC of the WTP. In addition, the optimized process parameters are determined based on NSGA-II and TOPSIS, and the EC optimization rate is 1.552% while guaranteeing water quality compliance. Overall, this research can effectively achieve the optimization of WTP, ensure that the effluent water quality meets the standards while reducing energy consumption, assist Wastewater treatment plants (WWTPs) to achieve more intelligent and efficient operation and maintenance management, and provide strong support for environmental protection and sustainable development goals.
Collapse
Affiliation(s)
- Tianxiang Liu
- National Center of Technology Innovation for Digital Construction, Huazhong University of Science & Technology, Wuhan, Hubei, 430074, China; School of Civil and Hydraulic Engineering, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Heng Zhang
- National Center of Technology Innovation for Digital Construction, Huazhong University of Science & Technology, Wuhan, Hubei, 430074, China; School of Civil and Hydraulic Engineering, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Junhao Wu
- National Center of Technology Innovation for Digital Construction, Huazhong University of Science & Technology, Wuhan, Hubei, 430074, China; School of Civil and Hydraulic Engineering, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Wenli Liu
- National Center of Technology Innovation for Digital Construction, Huazhong University of Science & Technology, Wuhan, Hubei, 430074, China; School of Civil and Hydraulic Engineering, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China.
| | - Yihai Fang
- Department of Civil Engineering, Monash University, Clayton, 3800, Victoria, Australia
| |
Collapse
|
2
|
Yang J, Hu X, Feng L, Liu Z, Murtazt A, Qin W, Zhou M, Liu J, Bi Y, Qian J, Zhang W. AI-Enabled Portable E-Nose Regression Predicting Harmful Molecules in a Gas Mixture. ACS Sens 2024; 9:2925-2934. [PMID: 38836922 DOI: 10.1021/acssensors.4c00050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/06/2024]
Abstract
The biomimetic electronic nose (e-nose) technology is a novel technology used for the identification and monitoring of complex gas molecules, and it is gaining significance in this field. However, due to the complexity and multiplicity of gas mixtures, the accuracy of electronic noses in predicting gas concentrations using traditional regression algorithms is not ideal. This paper presents a solution to the difficulty by introducing a fusion network model that utilizes a transformer-based multikernel feature fusion (TMKFF) module combined with a 1DCNN_LSTM network to enhance the accuracy of regression prediction for gas mixture concentrations using a portable electronic nose. The experimental findings demonstrate that the regression prediction performance of the fusion network is significantly superior to that of single models such as convolutional neural network (CNN) and long short-term memory (LSTM). The present study demonstrates the efficacy of our fusion network model in accurately predicting the concentrations of multiple target gases, such as SO2, NO2, and CO, in a gas mixture. Specifically, our algorithm exhibits substantial benefits in enhancing the prediction performance of low-concentration SO2 gas, which is a noteworthy achievement. The determination coefficient (R2) values of 93, 98, and 99% correspondingly demonstrate that the model is very capable of explaining the variation in the concentration of the target gases. The root-mean-square errors (RMSE) are 0.0760, 0.0711, and 3.3825, respectively, while the mean absolute errors (MAE) are 0.0507, 0.0549, and 2.5874, respectively. These results indicate that the model has relatively small prediction errors. The method we have developed holds significant potential for practical applications in detecting atmospheric pollution detection and other molecular detection areas in complex environments.
Collapse
Affiliation(s)
- Jilei Yang
- Anhui Province Key Laboratory of Measuring Theory and Precision Instrument, School of Instrument Science and Optoelectronics Engineering, Hefei University of Technology, Hefei 230009, China
| | - Xuefeng Hu
- Anhui Province Key Laboratory of Measuring Theory and Precision Instrument, School of Instrument Science and Optoelectronics Engineering, Hefei University of Technology, Hefei 230009, China
| | - Lihang Feng
- College of Electrical Engineering and Control Science, Nanjing Tech University, Nanjing 210009, China
- Anhui Six-Dimensional Sensor Technology Ltd., Fuyang, Anhui 232100, China
| | - Zhiyuan Liu
- Anhui Province Key Laboratory of Measuring Theory and Precision Instrument, School of Instrument Science and Optoelectronics Engineering, Hefei University of Technology, Hefei 230009, China
| | - Adil Murtazt
- School of Physics and Optoelectronic Engineering, Guangdong University of Technology, Guangzhou, Guangdong 510006, China
| | - Weiwei Qin
- Anhui Province Key Laboratory of Measuring Theory and Precision Instrument, School of Instrument Science and Optoelectronics Engineering, Hefei University of Technology, Hefei 230009, China
| | - Ming Zhou
- Anhui Province Key Laboratory of Measuring Theory and Precision Instrument, School of Instrument Science and Optoelectronics Engineering, Hefei University of Technology, Hefei 230009, China
| | - Jiaming Liu
- Anhui Province Key Laboratory of Measuring Theory and Precision Instrument, School of Instrument Science and Optoelectronics Engineering, Hefei University of Technology, Hefei 230009, China
| | - Yali Bi
- Anhui Province Key Laboratory of Measuring Theory and Precision Instrument, School of Instrument Science and Optoelectronics Engineering, Hefei University of Technology, Hefei 230009, China
| | - Jingui Qian
- Anhui Province Key Laboratory of Measuring Theory and Precision Instrument, School of Instrument Science and Optoelectronics Engineering, Hefei University of Technology, Hefei 230009, China
| | - Wei Zhang
- College of Integrated Circuit Science and Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
| |
Collapse
|
3
|
Sun Z, Wang Z, Qi X, Wang D, Gu X, Wang J, Lu H, Chen Y. Understanding key contributing factors on the severity of traffic violations by elderly drivers: a hybrid approach of latent class analysis and XGBoost based SHAP. Int J Inj Contr Saf Promot 2024; 31:273-293. [PMID: 38284989 DOI: 10.1080/17457300.2023.2300479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 12/24/2023] [Indexed: 01/30/2024]
Abstract
Traffic violation is one of the leading causes of traffic crashes. In the context of global aging, it is important to study traffic violations by elderly drivers for improving traffic safety in preparation for a worldwide aging population. In this study, a hybrid approach of Latent Class Analysis (LCA) and XGBoost based SHAP is proposed to identify hidden clusters and to understand the key contributing factors on the severity of traffic violations by elderly drivers, based on the police-reported traffic violation dataset of Beijing (China). First, LCA is applied to segment the dataset into several latent homogeneous clusters, then XGBoost based SHAP is established on each cluster to identify feature contributions and the interaction effects of the key contributing factors on the severity of traffic violations by elderly drivers. Two comparison groups were set up to analyze factors, which are responsible for the different severities of traffic violations. The results show that elderly drivers can be classified into four groups by age, urban or not, license, and season; factors such as less annual number of traffic violations, national & provincial highway, night and winter are key contributing factors for higher severity of traffic violations, which are consistent with common cognition; key contributing factors for all clusters are similar but not identical, for example, more annual number of traffic violations contribute to more severe violation for all clusters except for Cluster 2; some factors which are not key contributing factors may affect the severity of traffic violations when they are combined with other factors, for example, the combination of lower annual number of traffic violations and county & township highway contributes to more severe violation for Cluster 1. These findings can help government to formulate targeted countermeasures to decrease the severity of traffic violations by specific elderly groups and improve road service for the driving population.
Collapse
Affiliation(s)
- Zhiyuan Sun
- Beijing Key Laboratory of Traffic Engineering, Beijing University of Technology, Beijing, China
| | - Zhicheng Wang
- Beijing Key Laboratory of Traffic Engineering, Beijing University of Technology, Beijing, China
| | - Xin Qi
- Beijing Key Laboratory of Traffic Engineering, Beijing University of Technology, Beijing, China
| | - Duo Wang
- Beijing Key Laboratory of Traffic Engineering, Beijing University of Technology, Beijing, China
| | - Xin Gu
- Beijing Key Laboratory of Traffic Engineering, Beijing University of Technology, Beijing, China
| | - Jianyu Wang
- Beijing Key Laboratory of General Aviation Technology, Beijing University of Civil Engineering and Architecture, Beijing, China
| | - Huapu Lu
- Institute of Transportation Engineering, Tsinghua University, Beijing, China
| | - Yanyan Chen
- Beijing Key Laboratory of Traffic Engineering, Beijing University of Technology, Beijing, China
| |
Collapse
|
4
|
Yao S, Wu Q, Kang Q, Chen YW, Lu Y. An interpretable XGBoost-based approach for Arctic navigation risk assessment. RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2024; 44:459-476. [PMID: 37330273 DOI: 10.1111/risa.14175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Revised: 03/14/2023] [Accepted: 05/07/2023] [Indexed: 06/19/2023]
Abstract
The Northern Sea Route (NSR) makes travel between Europe and Asia shorter and quicker than a southern transit via the Strait of Malacca and Suez Canal. It provides greater access to Arctic resources such as oil and gas. As global warming accelerates, melting Arctic ice caps are likely to increase traffic in the NSR and enhance its commercial viability. Due to the harsh Arctic environment imposing threats to the safety of ship navigation, it is necessary to assess Arctic navigation risk to maintain shipping safety. Currently, most studies are focused on the conventional assessment of the risk, which lacks the validation based on actual data. In this study, actual data about Arctic navigation environment and related expert judgments were used to generate a structured data set. Based on the structured data set, extreme gradient boosting (XGBoost) and alternative methods were used to establish models for the assessment of Arctic navigation risk, which were validated using cross-validation. The results show that compared with alternative models, XGBoost models have the best performance in terms of mean absolute errors and root mean squared errors. The XGBoost models can learn and reproduce expert judgments and knowledge for the assessment of Arctic navigation risk. Feature importance (FI) and shapley additive explanations (SHAP) are used to further interpret the relationship between input data and predictions. The application of XGBoost, FI, and SHAP is aimed to improve the safety of Arctic shipping using advanced artificial intelligence techniques. The validated assessment enhances the quality and robustness of assessment.
Collapse
Affiliation(s)
- Shuaiyu Yao
- Department of Control Science and Engineering, Tongji University, Shanghai, China
| | - Qinhao Wu
- Department of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, The Netherlands
| | - Qi Kang
- Department of Control Science and Engineering, Tongji University, Shanghai, China
| | - Yu-Wang Chen
- Alliance Manchester Business School (AMBS), The University of Manchester, Manchester, UK
| | - Yi Lu
- COSCO Shipping Special Transportation Co., Ltd, Guangzhou, China
| |
Collapse
|
5
|
Saberi S, Nasiri H, Ghorbani O, Friswell MI, Castro SGP. Explainable Artificial Intelligence to Investigate the Contribution of Design Variables to the Static Characteristics of Bistable Composite Laminates. MATERIALS (BASEL, SWITZERLAND) 2023; 16:5381. [PMID: 37570085 PMCID: PMC10419828 DOI: 10.3390/ma16155381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 07/12/2023] [Accepted: 07/26/2023] [Indexed: 08/13/2023]
Abstract
Material properties, geometrical dimensions, and environmental conditions can greatly influence the characteristics of bistable composite laminates. In the current work, to understand how each input feature contributes to the curvatures of the stable equilibrium shapes of bistable laminates and the snap-through force to change these configurations, the correlation between these inputs and outputs is studied using a novel explainable artificial intelligence (XAI) approach called SHapley Additive exPlanations (SHAP). SHAP is employed to explain the contribution and importance of the features influencing the curvatures and the snap-through force since XAI models change the data into a form that is more convenient for users to understand and interpret. The principle of minimum energy and the Rayleigh-Ritz method is applied to obtain the responses of the bistable laminates used as the input datasets in SHAP. SHAP effectively evaluates the importance of the input variables to the parameters. The results show that the transverse thermal expansion coefficient and moisture variation have the most impact on the model's output for the transverse curvatures and snap-through force. The eXtreme Gradient Boosting (XGBoost) and Finite Element (FM) methods are also employed to identify the feature importance and validate the theoretical approach, respectively.
Collapse
Affiliation(s)
- Saeid Saberi
- Department of Mechanical Engineering, Isfahan University of Technology, Isfahan 84156-83111, Iran;
| | - Hamid Nasiri
- Department of Computer Engineering, Amirkabir University of Technology (Tehran Polytechnic), Tehran 159163-4311, Iran;
| | - Omid Ghorbani
- Department of Engineering, Kharazmi University, Tehran 15719-14911, Iran;
| | | | - Saullo G. P. Castro
- Department of Aerospace Structures and Materials, Delft University of Technology, Kluyverweg 1, 2629HS Delft, The Netherlands
| |
Collapse
|
6
|
Yin J, Zhao Y, Peng Z, Ba F, Peng P, Liu X, Rong Q, Guo Y, Zhang Y. Rapid Identification Method for CH 4/CO/CH 4-CO Gas Mixtures Based on Electronic Nose. SENSORS (BASEL, SWITZERLAND) 2023; 23:2975. [PMID: 36991686 PMCID: PMC10058281 DOI: 10.3390/s23062975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 03/05/2023] [Accepted: 03/08/2023] [Indexed: 06/19/2023]
Abstract
The inherent cross-sensitivity of semiconductor gas sensors makes them extremely challenging to accurately detect mixed gases. In order to solve this problem, this paper designed an electronic nose (E-nose) with seven gas sensors and proposed a rapid method for identifying CH4, CO, and their mixtures. Most reported methods for E-nose were based on analyzing the entire response process and employing complex algorithms, such as neural network, which result in long time-consuming processes for gas detection and identification. To overcome these shortcomings, this paper firstly proposes a way to shorten the gas detection time by analyzing only the start stage of the E-nose response instead of the entire response process. Subsequently, two polynomial fitting methods for extracting gas features are designed according to the characteristics of the E-nose response curves. Finally, in order to shorten the time consumption of calculation and reduce the complexity of the identification model, linear discriminant analysis (LDA) is introduced to reduce the dimensionality of the extracted feature datasets, and an XGBoost-based gas identification model is trained using the LDA optimized feature datasets. The experimental results show that the proposed method can shorten the gas detection time, obtain sufficient gas features, and achieve nearly 100% identification accuracy for CH4, CO, and their mixed gases.
Collapse
Affiliation(s)
- Jianxin Yin
- School of Mechanical and Automotive Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
| | - Yongli Zhao
- School of Mechanical and Automotive Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
| | - Zhi Peng
- School of Mechanical and Automotive Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
| | - Fushuai Ba
- School of Mechanical and Automotive Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
| | - Peng Peng
- School of Mechanical and Automotive Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
| | - Xiaolong Liu
- School of Materials, Sun Yat-sen University, Shenzhen 518107, China
| | - Qian Rong
- School of Materials, Sun Yat-sen University, Shenzhen 518107, China
| | - Youmin Guo
- School and Materials Science and Technology, Anhui University, Hefei 230601, China
| | - Yafei Zhang
- School of Mechanical and Automotive Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
| |
Collapse
|
7
|
Huo D, Zhang J, Dai X, Zhang P, Zhang S, Yang X, Wang J, Liu M, Sun X, Chen H. A Bio-Inspired Spiking Neural Network with Few-Shot Class-Incremental Learning for Gas Recognition. SENSORS (BASEL, SWITZERLAND) 2023; 23:2433. [PMID: 36904636 PMCID: PMC10006916 DOI: 10.3390/s23052433] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Revised: 02/17/2023] [Accepted: 02/20/2023] [Indexed: 06/18/2023]
Abstract
The sensitivity and selectivity profiles of gas sensors are always changed by sensor drifting, sensor aging, and the surroundings (e.g., temperature and humidity changes), which lead to a serious decline in gas recognition accuracy or even invalidation. To address this issue, the practical solution is to retrain the network to maintain performance, leveraging its rapid, incremental online learning capacity. In this paper, we develop a bio-inspired spiking neural network (SNN) to recognize nine types of flammable and toxic gases, which supports few-shot class-incremental learning, and can be retrained quickly with a new gas at a low accuracy cost. Compared with gas recognition approaches such as support vector machine (SVM), k-nearest neighbor (KNN), principal component analysis (PCA) +SVM, PCA+KNN, and artificial neural network (ANN), our network achieves the highest accuracy of 98.75% in five-fold cross-validation for identifying nine types of gases, each with five different concentrations. In particular, the proposed network has a 5.09% higher accuracy than that of other gas recognition algorithms, which validates its robustness and effectiveness for real-life fire scenarios.
Collapse
Affiliation(s)
- Dexuan Huo
- School of Integrated Circuits, Tsinghua University, Beijing 100084, China
| | - Jilin Zhang
- School of Integrated Circuits, Tsinghua University, Beijing 100084, China
| | - Xinyu Dai
- School of Integrated Circuits, Tsinghua University, Beijing 100084, China
| | - Pingping Zhang
- Suzhou Huiwen Nanotechnology Co., Ltd., Suzhou 215004, China
| | - Shumin Zhang
- Suzhou Huiwen Nanotechnology Co., Ltd., Suzhou 215004, China
| | - Xiao Yang
- Suzhou Huiwen Nanotechnology Co., Ltd., Suzhou 215004, China
| | - Jiachuang Wang
- State Key Laboratory of Transducer Technology, Shanghai Institute of Microsystem and Information Technology, Chinese Academy of Sciences, Shanghai 200050, China
| | - Mengwei Liu
- State Key Laboratory of Transducer Technology, Shanghai Institute of Microsystem and Information Technology, Chinese Academy of Sciences, Shanghai 200050, China
| | - Xuhui Sun
- Suzhou Huiwen Nanotechnology Co., Ltd., Suzhou 215004, China
| | - Hong Chen
- School of Integrated Circuits, Tsinghua University, Beijing 100084, China
| |
Collapse
|
8
|
Zhou F, Du X, Li W, Lu Z, Wu J. NIDD: an intelligent network intrusion detection model for nursing homes. JOURNAL OF CLOUD COMPUTING 2022. [DOI: 10.1186/s13677-022-00361-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
AbstractIn nursing homes using technologies such as IoT, big data, cloud computing, and machine learning, there is a constant risk of attacks such as Brute Force FTP, Brute Force SSH, Web Attack, Infiltration, and Botnet during data communication between individual terminals and the cloud server. Therefore, effectively identifying network communication data is essential to protect data communication security between individual terminals and the cloud server. Aiming at the data mentioned above regarding communication security issues, we propose an intelligent intrusion detection model NIDD (Network Intelligent Data Detection) model that combines deep convolution generation adversarial network (DCGAN) with Light Gradient Boosting Machine (LightGBM) and Shapley Additive exPlanations (SHAP). The NIDD model first generates new attack samples by learning the feature distribution of the existing attack sample data and effectively expands the rare attack samples. Secondly, we use the Light Gradient Boosting Machine (LightGBM) algorithm as the base classifier to train the dataset and start to build the intrusion detection model. Then use Shapley Additive exPlanations (SHAP) to analyze the contribution of the classification results, and adjust the model parameters according to the analysis results. Finally, we obtain the optimal model for the intelligent detection model of network intrusion. This paper conducts experimental tests on the NSL-KDD dataset. The experimental results show that the NIDD model built based on Light Gradient Boosting Machine can detect Brute Force FTP, Brute Force SSH, DoS, Heartbleed, Web Attack, Infiltration, Botnet, PROBE, R2L, and U2R attacks with an accuracy of 99.76%. Finally, we re-verified the NIDD model on the CIC-IDC-2018 dataset. The results once again proved that the NIDD model could solve the data communication security between the nursing robot and the cloud server and the data before the IoT terminal and the cloud server. Communication security provides a sufficient guarantee.
Collapse
|
9
|
High-Frequency Direction Forecasting of the Futures Market Using a Machine-Learning-Based Method. FUTURE INTERNET 2022. [DOI: 10.3390/fi14060180] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Futures price-movement-direction forecasting has always been a significant and challenging subject in the financial market. In this paper, we propose a combination approach that integrates the XGBoost (eXtreme Gradient Boosting), SMOTE (Synthetic Minority Oversampling Technique), and NSGA-II (Non-dominated Sorting Genetic Algorithm-II) methods. We applied the proposed approach on the direction prediction and simulation trading of rebar futures, which are traded on the Shanghai Futures Exchange. Firstly, the minority classes of the high-frequency rebar futures price change magnitudes are oversampled using the SMOTE algorithm to overcome the imbalance problem of the class data. Then, XGBoost is adopted to construct a multiclassification model for the price-movement-direction prediction. Next, the proposed approach employs NSGA-II to optimize the parameters of the pre-designed trading rule for trading simulation. Finally, the price-movement direction is predicted, and we conducted the high-frequency trading based on the optimized XGBoost model and the trading rule, with the classification and trading performances empirically evaluated by four metrics over four testing periods. Meanwhile, the LIME (Local Interpretable Model-agnostic Explanations) is applied as a model explanation approach to quantify the prediction contributions of features to the forecasting samples. From the experimental results, we found that the proposed approach performed best in terms of direction prediction accuracy, profitability, and return–risk ratio. The proposed approach could be beneficial for decision-making of the rebar traders and related companies engaged in rebar futures trading.
Collapse
|
10
|
XGBoost-DNN Mixed Model for Predicting Driver’s Estimation on the Relative Motion States during Lane-Changing Decisions: A Real Driving Study on the Highway. SUSTAINABILITY 2022. [DOI: 10.3390/su14116829] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
This study is conducted on a real live highway to investigate the driver’s performance in estimating the speed and distance of vehicles behind the target lane during lane changes. Data on the participants’ estimated and actual data on the rear car were collected in the experiment. Ridge regression is used to analyze the effects of both the driver’s features, as well as the relative and absolute motion characteristics between the target vehicle and the subject vehicle, on the driver’s estimation outcomes. Finally, a mixed algorithm of extreme gradient boosting (XGBoost) and deep neural network (DNN) was proposed in this paper for establishing driver’s speed estimation and distance prediction models. Compared with other machine learning models, the XGBoost-DNN prediction model performs more accurate prediction performance in both classification scenarios. It is worth mentioning that the XGBoost-DNN mixed model exhibits a prediction accuracy approximately two percentage points higher than that of the XGBoost model. In the two-classification scenarios, the accuracy estimations of XGBoost-DNN speed and distance prediction models are 91.03% and 92.46%, respectively. In the three-classification scenarios, the accuracy estimations of XGBoost-DNN speed and distance prediction models are 87.18% and 87.59%, respectively. This study can provide a theoretical basis for the development of warning rules for lane-change warning systems as well as insights for understanding lane-change decision failures.
Collapse
|
11
|
Automated detection of COVID-19 cases from chest X-ray images using deep neural network and XGBoost. Radiography (Lond) 2022; 28:732-738. [PMID: 35410707 PMCID: PMC8958100 DOI: 10.1016/j.radi.2022.03.011] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2021] [Revised: 02/25/2022] [Accepted: 03/21/2022] [Indexed: 11/20/2022]
Abstract
Introduction In late 2019 and after the COVID-19 pandemic in the world, many researchers and scholars tried to provide methods for detecting COVID-19 cases. Accordingly, this study focused on identifying patients with COVID-19 from chest X-ray images. Methods In this paper, a method for diagnosing coronavirus disease from X-ray images was developed. In this method, DenseNet169 Deep Neural Network (DNN) was used to extract the features of X-ray images taken from the patients’ chests. The extracted features were then given as input to the Extreme Gradient Boosting (XGBoost) algorithm to perform the classification task. Results Evaluation of the proposed approach and its comparison with the methods presented in recent years revealed that this method was more accurate and faster than the existing ones and had an acceptable performance for detecting COVID-19 cases from X-ray images. The experiments showed 98.23% and 89.70% accuracy, 99.78% and 100% specificity, 92.08% and 95.20% sensitivity in two and three-class problems, respectively. Conclusion This study aimed to detect people with COVID-19, focusing on non-clinical approaches. The developed method could be employed as an initial detection tool to assist the radiologists in more accurate and faster diagnosing the disease. Implication for practice The proposed method's simple implementation, along with its acceptable accuracy, allows it to be used in COVID-19 diagnosis. Moreover, the gradient-based class activation mapping (Grad-CAM) can be used to represent the deep neural network's decision area on a heatmap. Radiologists might use this heatmap to evaluate the chest area more accurately.
Collapse
|
12
|
Chehreh Chelgani S, Nasiri H, Tohry A. Modeling of particle sizes for industrial HPGR products by a unique explainable AI tool- A “Conscious Lab” development. ADV POWDER TECHNOL 2021. [DOI: 10.1016/j.apt.2021.09.020] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
13
|
Shi R, Xu X, Li J, Li Y. Prediction and analysis of train arrival delay based on XGBoost and Bayesian optimization. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107538] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
14
|
Extracting Information on Rocky Desertification from Satellite Images: A Comparative Study. REMOTE SENSING 2021. [DOI: 10.3390/rs13132497] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Rocky desertification occurs in many karst terrains of the world and poses major challenges for regional sustainable development. Remotely sensed data can provide important information on rocky desertification. In this study, three common open-access satellite image datasets (Sentinel-2B, Landsat-8, and Gaofen-6) were used for extracting information on rocky desertification in a typical karst region (Guangnan County, Yunnan) of southwest China, using three machine-learning algorithms implemented in the Python programming language: random forest (RF), bagged decision tree (BDT), and extremely randomized trees (ERT). Comparative analyses of the three data sources and three algorithms show that: (1) The Sentinel-2B image has the best capability for extracting rocky desertification information, with an overall accuracy (OA) of 85.21% using the ERT method. This can be attributed to the higher spatial resolution of the Sentinel-2B image than that of Landsat-8 and Gaofen-6 images and Gaofen-6’s lack of the shortwave infrared (SWIR) bands suitable for mapping carbonate rocks. (2) The ERT method has the best classification results of rocky desertification. Compared with the RF and BDT methods, the ERT method has stronger randomness in modeling and can effectively identify important feature factors for extracting information on rocky desertification. (3) The combination of the Sentinel-2B images and the ERT method provides an effective, efficient, and free approach to information extraction for mapping rocky desertification. The study can provide a useful reference for effective mapping of rocky desertification in similar karst environments of the world, in terms of both satellite image sources and classification algorithms. It also provides important information on the total area and spatial distribution of different levels of rocky desertification in the study area to support decision making by local governments for sustainable development.
Collapse
|
15
|
Research on a Gas Concentration Prediction Algorithm Based on Stacking. SENSORS 2021; 21:s21051597. [PMID: 33668797 PMCID: PMC7956455 DOI: 10.3390/s21051597] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Revised: 02/14/2021] [Accepted: 02/20/2021] [Indexed: 11/17/2022]
Abstract
Machine learning algorithms play an important role in the detection of toxic, flammable and explosive gases, and they are extremely important for the study of mixed gas classification and concentration prediction methods. To solve the problem of low prediction accuracy of gas concentration regression prediction algorithms, a gas concentration prediction algorithm based on a stacking model is proposed in the current research. In this paper, the stochastic forest, extreme random regression tree and gradient boosting decision tree (GBDT) regression algorithms are selected as the base learning devices and use the stacking algorithm to take the output of each base learning device as input to train a new model to produce a final output. Through the stacking model, the grid search algorithm is studied to automatically optimize the parameters so that the performance of the entire system can reach the optimal parameters. Through experimental simulation, the gas concentration prediction algorithm based on stacking model has better prediction effect than other integrated frame algorithms and the accuracy of mixed gas concentration prediction is improved.
Collapse
|
16
|
Jang KW, Choi JH, Jeon JH, Kim HS. Combustible Gas Classification Modeling using Support Vector Machine and Pairing Plot Scheme. SENSORS 2019; 19:s19225018. [PMID: 31744238 PMCID: PMC6891470 DOI: 10.3390/s19225018] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Revised: 11/14/2019] [Accepted: 11/15/2019] [Indexed: 11/16/2022]
Abstract
Combustible gases, such as CH4 and CO, directly or indirectly affect the human body. Thus, leakage detection of combustible gases is essential for various industrial sites and daily life. Many types of gas sensors are used to identify these combustible gases, but since gas sensors generally have low selectivity among gases, coupling issues often arise which adversely affect gas detection accuracy. To solve this problem, we built a decoupling algorithm with different gas sensors using a machine learning algorithm. Commercially available semiconductor sensors were employed to detect CH4 and CO, and then support vector machine (SVM) applied as a supervised learning algorithm for gas classification. We also introduced a pairing plot scheme to more effectively classify gas type. The proposed model classified CH4 and CO gases 100% correctly at all levels above the minimum concentration the gas sensors could detect. Consequently, SVM with pairing plot is a memory efficient and promising method for more accurate gas classification.
Collapse
|