1
|
Hadavimoghaddam F, Rozhenko A, Mohammadi MR, Mostajeran Gortani M, Pourafshary P, Hemmati-Sarapardeh A. Modeling crude oil pyrolysis process using advanced white-box and black-box machine learning techniques. Sci Rep 2023; 13:22649. [PMID: 38114589 PMCID: PMC10730853 DOI: 10.1038/s41598-023-49349-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 12/07/2023] [Indexed: 12/21/2023] Open
Abstract
Accurate prediction of fuel deposition during crude oil pyrolysis is pivotal for sustaining the combustion front and ensuring the effectiveness of in-situ combustion enhanced oil recovery (ISC EOR). Employing 2071 experimental TGA datasets from 13 diverse crude oil samples extracted from the literature, this study sought to precisely model crude oil pyrolysis. A suite of robust machine learning techniques, encompassing three black-box approaches (Categorical Gradient Boosting-CatBoost, Gaussian Process Regression-GPR, Extreme Gradient Boosting-XGBoost), and a white-box approach (Genetic Programming-GP), was employed to estimate crude oil residue at varying temperature intervals during TGA runs. Notably, the XGBoost model emerged as the most accurate, boasting a mean absolute percentage error (MAPE) of 0.7796% and a determination coefficient (R2) of 0.9999. Subsequently, the GPR, CatBoost, and GP models demonstrated commendable performance. The GP model, while displaying slightly higher error in comparison to the black-box models, yielded acceptable results and proved suitable for swift estimation of crude oil residue during pyrolysis. Furthermore, a sensitivity analysis was conducted to reveal the varying influence of input parameters on residual crude oil during pyrolysis. Among the inputs, temperature and asphaltenes were identified as the most influential factors in the crude oil pyrolysis process. Higher temperatures and oil °API gravity were associated with a negative impact, leading to a decrease in fuel deposition. On the other hand, increased values of asphaltenes, resins, and heating rates showed a positive impact, resulting in an increase in fuel deposition. These findings underscore the importance of precise modeling for fuel deposition during crude oil pyrolysis, offering insights that can significantly benefit ISC EOR practices.
Collapse
Affiliation(s)
- Fahimeh Hadavimoghaddam
- Key Laboratory of Continental Shale Hydrocarbon Accumulation and Efficient Development, Ministry of Education, Northeast Petroleum University, Daqing, 163318, China
- Ufa State Petroleum Technological University, Ufa, 450064, Russia
| | - Alexei Rozhenko
- Plekhanov Russian University of Economics, Moscow, 117997, Russia
| | | | | | - Peyman Pourafshary
- School of Mining and Geosciences, Nazarbayev University, Astana, Kazakhstan
| | - Abdolhossein Hemmati-Sarapardeh
- Department of Petroleum Engineering, Shahid Bahonar University of Kerman, Kerman, Iran.
- State Key Laboratory of Petroleum Resources and Prospecting, China University of Petroleum (Beijing), Beijing, China.
| |
Collapse
|
2
|
Guan S, Wang Y, Liu L, Gao J, Xu Z, Kan S. Ultra-short-term wind power prediction method combining financial technology feature engineering and XGBoost algorithm. Heliyon 2023; 9:e16938. [PMID: 37484352 PMCID: PMC10361039 DOI: 10.1016/j.heliyon.2023.e16938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Revised: 05/30/2023] [Accepted: 06/01/2023] [Indexed: 07/25/2023] Open
Abstract
The input features of existing wind power time-series data prediction models are difficult to indicate the potential relationships between data, and the prediction methods are based on deep learning, which makes the convergence of the models slow and difficult to be applied to the actual production environment. To solve the above problems, an ultra-short-term wind power prediction model based on the XGBoost algorithm combined with financial technical index feature engineering and variational ant colony algorithm is proposed. The model innovatively applies financial technical indicators from financial time series data to wind power time series data and creates a class of model input features that can highly condense the potential relationships between time series data. A bionic algorithm is used to search for the best computational parameters for financial technical indicators to reduce the reliance on financial experts' experience. Taking the German power company Tennet wind power data set as an example, the prediction model proposed in this study has an mean absolute error of 0.859 and a root mean square error of 1.329, and it takes only 244 ms to complete the prediction. Thus, this study provides a new solution for ultra-short-term wind power prediction.
Collapse
Affiliation(s)
- Shijie Guan
- School of Data Science and Application, Inner Mongolia University of Technology, Hohhot 010080, China
- Software Service Engineering Technology Research Center, Inner Mongolia Autonomous Region, Hohhot 010080, China
| | - Yongsheng Wang
- School of Data Science and Application, Inner Mongolia University of Technology, Hohhot 010080, China
- Software Service Engineering Technology Research Center, Inner Mongolia Autonomous Region, Hohhot 010080, China
| | - Limin Liu
- School of Data Science and Application, Inner Mongolia University of Technology, Hohhot 010080, China
- Software Service Engineering Technology Research Center, Inner Mongolia Autonomous Region, Hohhot 010080, China
| | - Jing Gao
- School of Computer and Information, Inner Mongolia Agricultural University, Hohhot 010018, China
| | - Zhiwei Xu
- School of Data Science and Application, Inner Mongolia University of Technology, Hohhot 010080, China
- Software Service Engineering Technology Research Center, Inner Mongolia Autonomous Region, Hohhot 010080, China
| | - Sijia Kan
- School of Natural Sciences, The University of Manchester, Manchester, M13 9PL, UK
| |
Collapse
|
3
|
Kim H, Kim S. A study on frost prediction model using machine learning. KOREAN JOURNAL OF APPLIED STATISTICS 2022. [DOI: 10.5351/kjas.2022.35.4.543] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Hyojeoung Kim
- Department of Applied Statistics, University of Chung-Ang
| | - Sahm Kim
- Department of Applied Statistics, University of Chung-Ang
| |
Collapse
|
4
|
Active power control strategy for wind farms based on power prediction errors distribution considering regional data. PLoS One 2022; 17:e0273257. [PMID: 36001548 PMCID: PMC9401127 DOI: 10.1371/journal.pone.0273257] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 08/02/2022] [Indexed: 11/20/2022] Open
Abstract
One of the renewable energy resources, wind energy is widely used due to its wide distribution, large reserves, green and clean energy, and it is also an important part of large-scale grid integration. However, wind power has strong randomness, volatility, anti-peaking characteristics, and the problem of low wind power prediction accuracy, which brings serious challenges to the power system. Based on the difference of power prediction error and confidence interval between different new energy power stations, an optimal control strategy for active power of wind farms was proposed. Therefore, we focus on solving the problem of wind power forecasting and improving the accuracy of wind power prediction. Due to the prediction error of wind power generation, the power control cannot meet the control target. An optimal control strategy for active power of wind farms is proposed based on the difference in power prediction error and confidence interval between different new energy power stations. The strategy used historical data to evaluate the prediction error distribution and confidence interval of wind power. We use confidence interval constraints to create a wind power active optimization model that realize active power distribution and complementary prediction errors among wind farms with asymmetric error distribution. Combined with the actual data of a domestic (Cox’s Bazar, Bangladesh) wind power base, a simulation example is designed to verify the rationality and effectiveness of the proposed strategy.
Collapse
|
5
|
Degradation Trend Prediction of Pumped Storage Unit Based on MIC-LGBM and VMD-GRU Combined Model. ENERGIES 2022. [DOI: 10.3390/en15020605] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
The harsh operating environment aggravates the degradation of pumped storage units (PSUs). Degradation trend prediction (DTP) provides important support for the condition-based maintenance of PSUs. However, the complexity of the performance degradation index (PDI) sequence poses a severe challenge of the reliability of DTP. Additionally, the accuracy of healthy model is often ignored, resulting in an unconvincing PDI. To solve these problems, a combined DTP model that integrates the maximal information coefficient (MIC), light gradient boosting machine (LGBM), variational mode decomposition (VMD) and gated recurrent unit (GRU) is proposed. Firstly, MIC-LGBM is utilized to generate a high-precision healthy model. MIC is applied to select the working parameters with the most relevance, then the LGBM is utilized to construct the healthy model. Afterwards, a performance degradation index (PDI) is generated based on the LGBM healthy model and monitoring data. Finally, the VMD-GRU prediction model is designed to achieve precise DTP under the complex PDI sequence. The proposed model is verified by applying it to a PSU located in Zhejiang province, China. The results reveal that the proposed model achieves the highest precision healthy model and the best prediction performance compared with other comparative models. The absolute average (|AVG|) and standard deviation (STD) of fitting errors are reduced to 0.0275 and 0.9245, and the RMSE, MAE, and R2 are 0.00395, 0.0032, and 0.9226 respectively, on average for two operating conditions.
Collapse
|
6
|
Oeing J, Neuendorf LM, Bittorf L, Krieger W, Kockmann N. Flooding Prevention in Distillation and Extraction Columns with Aid of Machine Learning Approaches. CHEM-ING-TECH 2021. [DOI: 10.1002/cite.202100051] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Affiliation(s)
- Jonas Oeing
- TU Dortmund University Department of Biochemical and Chemical Engineering Laboratory of Equipment Design Emil-Figge-Straße 68 44227 Dortmund Germany
| | - Laura Maria Neuendorf
- TU Dortmund University Department of Biochemical and Chemical Engineering Laboratory of Equipment Design Emil-Figge-Straße 68 44227 Dortmund Germany
| | - Lukas Bittorf
- TU Dortmund University Department of Biochemical and Chemical Engineering Laboratory of Equipment Design Emil-Figge-Straße 68 44227 Dortmund Germany
| | - Waldemar Krieger
- TU Dortmund University Department of Biochemical and Chemical Engineering Laboratory of Equipment Design Emil-Figge-Straße 68 44227 Dortmund Germany
| | - Norbert Kockmann
- TU Dortmund University Department of Biochemical and Chemical Engineering Laboratory of Equipment Design Emil-Figge-Straße 68 44227 Dortmund Germany
| |
Collapse
|
7
|
A Novel Ultra-Short-Term PV Power Forecasting Method Based on DBN-Based Takagi-Sugeno Fuzzy Model. ENERGIES 2021. [DOI: 10.3390/en14206447] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Forecasting uncertainties limit the development of photovoltaic (PV) power generation. New forecasting technologies are urgently needed to improve the accuracy of power generation forecasting. In this paper, a novel ultra-short-term PV power forecasting method is proposed based on a deep belief network (DBN)-based Takagi-Sugeno (T-S) fuzzy model. Firstly, the correlation analysis is used to filter redundant information. Furthermore, a T-S fuzzy model, which integrates fuzzy c-means (FCM) for the fuzzy division of input variables and DBN for fuzzy subsets forecasting, is developed. Finally, the proposed method is compared to a benchmark DBN method and the T-S fuzzy model in case studies. The numerical results show the feasibility and flexibility of the proposed ultra-short-term PV power forecasting approach.
Collapse
|
8
|
Forecasting of 10-Second Power Demand of Highly Variable Loads for Microgrid Operation Control. ENERGIES 2021. [DOI: 10.3390/en14051290] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
This paper addresses very short-term (10 s) forecasting of power demand of highly variable loads. The main purpose of this study is to develop methods useful for this type of forecast. We have completed a comprehensive study using two different time series, which are very difficult to access in practice, of 10 s power demand characterized by big dynamics of load changes. This is an emerging and promising forecasting research topic, yet to be more widely recognized in the forecasting research community. This problem is particularly important in microgrids, i.e., small energy micro-systems. Power demand forecasting, like forecasting of renewable power generation, is of key importance, especially in island mode operation of microgrids. This is due to the necessity of ensuring reliable power supplies to consumers. Inaccurate very short-term forecasts can cause improper operation of microgrids or increase costs/decrease profits in the electricity market. This paper presents a detailed statistical analysis of data for two sample low voltage loads characterized by large variability, which are located in a sewage treatment plant. The experience of the authors of this paper is that very short-term forecasting is very difficult for such loads. Special attention has been paid to different forecasting methods, which can be applied to this type of forecast, and to the selection of explanatory variables in these methods. Some of the ensemble models (eight selected models belonging to the following classes of methods: random forest regression, gradient boosted trees, weighted averaging ensemble, machine learning) proposed in the scope of choice of methods sets constituting the models set are unique models developed by the authors of this study. The obtained forecasts are presented and analyzed in detail. Moreover, qualitative analysis of the forecasts obtained has been carried out. We analyze various measures of forecasts quality. We think that some of the presented forecasting methods are promising for practical applications, i.e., for microgrid operation control, because of their accuracy and stability. The analysis of usefulness of various forecasting methods for two independent time series is an essential, very valuable element of the study carried out. Thanks to this, reliability of conclusions concerning the preferred methods has considerably increased.
Collapse
|
9
|
Hon KK, Ng CW, Chan PW. Machine learning based multi-index prediction of aviation turbulence over the Asia-Pacific. MACHINE LEARNING WITH APPLICATIONS 2020. [DOI: 10.1016/j.mlwa.2020.100008] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
|
10
|
Noise Prediction Using Machine Learning with Measurements Analysis. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10186619] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The noise prediction using machine learning is a special study that has recently received increased attention. This is particularly true in workplaces with noise pollution, which increases noise exposure for general laborers. This study attempts to analyze the noise equivalent level (Leq) at the National Synchrotron Radiation Research Center (NSRRC) facility and establish a machine learning model for noise prediction. This study utilized the gradient boosting model (GBM) as the learning model in which past noise measurement records and many other features are integrated as the proposed model makes a prediction. This study analyzed the time duration and frequency of the collected Leq and also investigated the impact of training data selection. The results presented in this paper indicate that the proposed prediction model works well in almost noise sensors and frequencies. Moreover, the model performed especially well in sensor 8 (125 Hz), which was determined to be a serious noise zone in the past noise measurements. The results also show that the root-mean-square-error (RMSE) of the predicted harmful noise was less than 1 dBA and the coefficient of determination (R2) value was greater than 0.7. That is, the working field showed a favorable noise prediction performance using the proposed method. This positive result shows the ability of the proposed approach in noise prediction, thus providing a notification to the laborer to prevent long-term exposure. In addition, the proposed model accurately predicts noise future pollution, which is essential for laborers in high-noise environments. This would keep employees healthy in avoiding noise harmful positions to prevent people from working in that environment.
Collapse
|
11
|
Comparative Analysis of Rainfall Prediction Models Using Machine Learning in Islands with Complex Orography: Tenerife Island. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app9224931] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
We present a comparative study between predictive monthly rainfall models for islands of complex orography using machine learning techniques. The models have been developed for the island of Tenerife (Canary Islands). Weather forecasting is influenced both by the local geographic characteristics as well as by the time horizon comprised. Accuracy of mid-term rainfall prediction on islands with complex orography is generally low when carried out with atmospheric models. Predictive models based on algorithms such as Random Forest or Extreme Gradient Boosting among others were analyzed. The predictors used in the models include weather predictors measured in two main meteorological stations, reanalysis predictors from the National Oceanic and Atmospheric Administration, and the global predictor North Atlantic Oscillation, all of them obtained over a period of time of more than four decades. When comparing the proposed models, we evaluated accuracy, kappa and interpretability of the model obtained, as well as the relevance of the predictors used. The results show that global predictors such as the North Atlantic Oscillation Index (NAO) have a very low influence, while the local Geopotential Height (GPH) predictor is relatively more important. Machine learning prediction models are a relevant proposition for predicting medium-term precipitation in similar geographical regions.
Collapse
|