1
|
Mu L, Bi S, Ding X, Xu Y. Transformer-based ozone multivariate prediction considering interpretable and priori knowledge: A case study of Beijing, China. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 366:121883. [PMID: 39047437 DOI: 10.1016/j.jenvman.2024.121883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 06/15/2024] [Accepted: 07/13/2024] [Indexed: 07/27/2024]
Abstract
Ozone pollution is the focus of current environmental governance in China and high-quality prediction of ozone concentration is the prerequisite to effective policymaking. The studied ozone pollution time series exhibits distinct seasonality and secular trends and is associated with various factors. This study developed an interpretable hybrid model by combining STL decomposition and the Transformer (STL-Transformer) with the prior information of ozone time series and global multi-source information as prediction basis. The STL decomposition decomposes ozone time series into trend, seasonal, and remainder components. Then, the three components, along with other air quality and meteorological data, are integrated into the input sequence of the Transformer. The experiment results show that the STL-Transformer outperforms the other five state-of-the-art models, including the standard Transformer. Specially, the univariate forecasting for ozone relies on mimicking the patterns and trends that have occurred in the past. In contrast, multivariate forecasting can effectively capture complex relationships and dependencies involving multiple variables. The method successfully grasps the prior and global multi-source information and simultaneously improves the interpretability of ozone prediction with high precision. This study provides new insights for air pollution forecasting and has reliable theoretical value and practical significance for environmental governance.
Collapse
Affiliation(s)
- Liangliang Mu
- Faculty of Information Science and Engineering, Ocean University of China, Qingdao, 266100, China
| | - Suhuan Bi
- School of Information and Control Engineering, Qingdao University of Technology, Qingdao, 266520, China.
| | - Xiangqian Ding
- Faculty of Information Science and Engineering, Ocean University of China, Qingdao, 266100, China
| | - Yan Xu
- Ocean University of China, Qingdao, 266100, China; Qingdao Financial Research Institute, Dongbei University of Finance and Economics, Qingdao, 266100, China.
| |
Collapse
|
2
|
Ma Z, Wang B, Luo W, Jiang J, Liu D, Wei H, Luo H. Air pollutant prediction model based on transfer learning two-stage attention mechanism. Sci Rep 2024; 14:7385. [PMID: 38548823 PMCID: PMC10978953 DOI: 10.1038/s41598-024-57784-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 03/21/2024] [Indexed: 04/01/2024] Open
Abstract
Atmospheric pollution significantly impacts the regional economy and human health, and its prediction has been increasingly emphasized. The performance of traditional prediction methods is limited due to the lack of historical data support in new atmospheric monitoring sites. Therefore, this paper proposes a two-stage attention mechanism model based on transfer learning (TL-AdaBiGRU). First, the first stage of the model utilizes a temporal distribution characterization algorithm to segment the air pollutant sequences into periods. It introduces a temporal attention mechanism to assign self-learning weights to the period segments in order to filter out essential period features. Then, in the second stage of the model, a multi-head external attention mechanism is introduced to mine the network's hidden layer key features. Finally, the adequate knowledge learned by the model at the source domain site is migrated to the new site to improve the prediction capability of the new site. The results show that (1) the model is modeled from the data distribution perspective, and the critical information within the sequence of periodic segments is mined in depth. (2) The model employs a unique two-stage attention mechanism to capture complex nonlinear relationships in air pollutant data. (3) Compared with the existing models, the mean absolute error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE) of the model decreased by 14%, 13%, and 4%, respectively, and the prediction accuracy was greatly improved.
Collapse
Affiliation(s)
- Zhanfei Ma
- School of Information Science and Technology, Baotou Teachers' College, Baotou, 014010, Inner Mongolia, China
- School of Information Engineering, Inner Mongolia University of Science and Technology, Baotou, 014010, Inner Mongolia, China
| | - Bisheng Wang
- School of Information Engineering, Inner Mongolia University of Science and Technology, Baotou, 014010, Inner Mongolia, China.
| | - Wenli Luo
- School of Information Engineering, Inner Mongolia University of Science and Technology, Baotou, 014010, Inner Mongolia, China
| | - Jing Jiang
- School of Information Engineering, Inner Mongolia University of Science and Technology, Baotou, 014010, Inner Mongolia, China
| | - Dongxiang Liu
- School of Information Science and Technology, Baotou Teachers' College, Baotou, 014010, Inner Mongolia, China
| | - Hui Wei
- School of Information Engineering, Inner Mongolia University of Science and Technology, Baotou, 014010, Inner Mongolia, China
| | - HaoYe Luo
- School of Information Science and Technology, Baotou Teachers' College, Baotou, 014010, Inner Mongolia, China
| |
Collapse
|
3
|
Du X, Yuan Z, Huang D, Ma W, Yang J, Mo J. Importance of secondary decomposition in the accurate prediction of daily-scale ozone pollution by machine learning. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 904:166963. [PMID: 37696411 DOI: 10.1016/j.scitotenv.2023.166963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Revised: 08/17/2023] [Accepted: 09/08/2023] [Indexed: 09/13/2023]
Abstract
Machine learning (ML) models have been proven as a reliable tool in predicting ambient pollution concentrations at various places in the world. However, their performance in predicting the maximum daily 8-h averaged ozone (MDA8 O3), the metric often used for O3 pollution assessment and management, is relatively poorer. This is largely resulted from more irregular data fluctuations of the MDA8 O3 levels governed collectively by the synoptic condition, local photochemistry, and long-range transport. In order to improve the prediction accuracy of MDA8 O3, this study developed a secondary decomposition ML model framework which coupled the complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) as the primary decomposition, the variational mode decomposition (VMD) as secondary decomposition, and the gate recurrent unit (GRU) ML model. By applying this secondary decomposition model framework on MDA8 O3 prediction for the first time, we showed that the prediction accuracy of MDA8 O3 is largely improved from R2 of 0.46 and RMSE of 30.4 μg/m3 for GRU without decomposition to R2 of 0.91 and RMSE of 12.6 μg/m3 over the Pearl River Delta of China. We also found that the prediction accuracy rate of O3 pollution non-attainments, an essential indicator for initiating contingency O3 pollution control, improved greatly from 14.9 % for GRU without decomposition to 72.5 %. The performance of O3 pollution non-attainment prediction is relatively higher in southwestern PRD, which is mainly due to greater number and severity of O3 non-attainments in southwestern cities located downwind of the emission hotspot area at central PRD. This study underscored the importance of secondary decomposition in accurately predicting daily-scale O3 concentration and non-attainments over the PRD, which can be extended to other photochemically active region worldwide to improve their O3 prediction accuracy and assist in O3 contingency control.
Collapse
Affiliation(s)
- Xinyue Du
- School of Environment and Energy, South China University of Technology, Guangzhou 510006, China
| | - Zibing Yuan
- School of Environment and Energy, South China University of Technology, Guangzhou 510006, China.
| | - Daojian Huang
- South China Institute of Environmental Sciences, Ministry of Ecology and Environment of China, Guangzhou 510655, China.
| | - Wei Ma
- School of Environment and Energy, South China University of Technology, Guangzhou 510006, China
| | - Jun Yang
- School of Environment and Energy, South China University of Technology, Guangzhou 510006, China
| | - Jianbin Mo
- School of Environment and Energy, South China University of Technology, Guangzhou 510006, China
| |
Collapse
|
4
|
Li J, Jang JC, Zhu Y, Lin CJ, Wang S, Xing J, Dong X, Li J, Zhao B, Zhang B, Yuan Y. Development of a recurrent spatiotemporal deep-learning method coupled with data fusion for correction of hourly ozone forecasts. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2023; 335:122291. [PMID: 37527757 DOI: 10.1016/j.envpol.2023.122291] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/28/2023] [Revised: 07/14/2023] [Accepted: 07/28/2023] [Indexed: 08/03/2023]
Abstract
Ambient ozone (O3) predictions can be very challenging mainly due to the highly nonlinear photochemistry among its precursors, and meteorological conditions and regional transport can further complicate the O3 formation processes. The emission-based chemical transport models (CTM) are broadly used to predict O3 formation, but they may deviate from observations due to input uncertainties such as emissions and meteorological data, in addition to the treatment of O3 nonlinear chemistry. In this study, an innovative recurrent spatiotemporal deep-learning (RSDL) method with model-monitor coupled convolutional recurrent neural networks (ConvRNN) has been developed to improve O3 predictions of CTM. The RSDL method was first used to build the ConvRNN within a 24-h scale to characterize the spatiotemporal relationships between the monitored O3 data and CTM simulations, and then incorporated the recurrent pattern to achieve 72-h multi-site forecasts based on a pilot study over the Pearl River Delta (PRD) region of China. The results showed that the RSDL method predicted O3 with high accuracy over this case study, with an increase of 27.54% in the correlation coefficient (R) average for all sites as well as an increase in R of 0.14-0.21 for all cities compared to CTM. Moreover, the regional distribution of CTM was further improved by the RSDL predictions with the data fusion technique, which greatly reduced the underpredictions of O3 concentrations, particularly in high O3-level areas (concentrations >160 μg/m3), with a 33.55% reduction in the mean absolute error (MAE).
Collapse
Affiliation(s)
- Jie Li
- Guangdong Provincial Key Laboratory of Atmospheric Environment and Pollution Control, School of Environment and Energy, South China University of Technology, Guangzhou Higher Education Mega Center, Guangzhou, 510006, China
| | - Ji-Cheng Jang
- Guangdong Provincial Key Laboratory of Atmospheric Environment and Pollution Control, School of Environment and Energy, South China University of Technology, Guangzhou Higher Education Mega Center, Guangzhou, 510006, China
| | - Yun Zhu
- Guangdong Provincial Key Laboratory of Atmospheric Environment and Pollution Control, School of Environment and Energy, South China University of Technology, Guangzhou Higher Education Mega Center, Guangzhou, 510006, China.
| | - Che-Jen Lin
- Department of Civil and Environmental Engineering, Lamar University, Beaumont, TX, 77710, USA
| | - Shuxiao Wang
- State Key Joint Laboratory of Environment Simulation and Pollution Control, School of Environment, Tsinghua University, Beijing, 100084, China
| | - Jia Xing
- State Key Joint Laboratory of Environment Simulation and Pollution Control, School of Environment, Tsinghua University, Beijing, 100084, China
| | - Xinyi Dong
- Joint International Research Laboratory of Atmospheric and Earth System Sciences and Institute for Climate and Global Change Research, School of Atmospheric Sciences, Nanjing University, Nanjing, 210023, China
| | - Jinying Li
- Guangdong Provincial Key Laboratory of Atmospheric Environment and Pollution Control, School of Environment and Energy, South China University of Technology, Guangzhou Higher Education Mega Center, Guangzhou, 510006, China
| | - Bin Zhao
- State Key Joint Laboratory of Environment Simulation and Pollution Control, School of Environment, Tsinghua University, Beijing, 100084, China
| | - Bingyao Zhang
- Guangdong Provincial Key Laboratory of Atmospheric Environment and Pollution Control, School of Environment and Energy, South China University of Technology, Guangzhou Higher Education Mega Center, Guangzhou, 510006, China
| | - Yingzhi Yuan
- Guangdong Provincial Key Laboratory of Atmospheric Environment and Pollution Control, School of Environment and Energy, South China University of Technology, Guangzhou Higher Education Mega Center, Guangzhou, 510006, China
| |
Collapse
|
5
|
Cheng M, Fang F, Navon IM, Zheng J, Zhu J, Pain C. Assessing uncertainty and heterogeneity in machine learning-based spatiotemporal ozone prediction in Beijing-Tianjin- Hebei region in China. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 881:163146. [PMID: 37011680 DOI: 10.1016/j.scitotenv.2023.163146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2022] [Revised: 03/03/2023] [Accepted: 03/25/2023] [Indexed: 06/01/2023]
Abstract
Accurate prediction of spatiotemporal ozone concentration is of great significance to effectively establish advanced early warning systems and regulate air pollution control. However, the comprehensive assessment of uncertainty and heterogeneity in spatiotemporal ozone prediction remains unknown. Here, we systematically analyze the hourly and daily spatiotemporal predictive performances using convolutional long short term memory (ConvLSTM) and deep convolutional generative adversarial network (DCGAN) models over the Beijing-Tianjin-Hebei region in China from 2013 to 2018. In extensive scenarios, our results show that the machine learning-based (ML-based) models achieve better spatiotemporal ozone concentration prediction performance with multiple meteorological conditions. A further comparison to the air pollution model-Nested Air Quality Prediction Modelling System (NAQPMS) and monitoring observations, the ConvLSTM model demonstrates the practical feasibility of identifying high ozone concentration distribution and capturing spatiotemporal ozone variation patterns at a high spatial resolution (here 15 km × 15 km).
Collapse
Affiliation(s)
- Meiling Cheng
- Applied Modelling and Computation Group, Department of Earth Science and Engineering, Imperial College London, SW7 2BP, UK
| | - Fangxin Fang
- Applied Modelling and Computation Group, Department of Earth Science and Engineering, Imperial College London, SW7 2BP, UK.
| | - Ionel Michael Navon
- Department of Scientific Computing, Florida State University, Tallahassee, FL 32306-4120, USA
| | - Jie Zheng
- Center for Excellence in Regional Atmospheric Environment, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen 361021, China
| | - Jiang Zhu
- International Center for Climate and Environment Sciences, Institute of Atmospheric Physics, Chinese Academy of Sciences, Beijing 100029, China
| | - Christopher Pain
- Applied Modelling and Computation Group, Department of Earth Science and Engineering, Imperial College London, SW7 2BP, UK
| |
Collapse
|
6
|
Construction and application of effluent quality prediction model with insufficient data based on transfer learning algorithm in wastewater treatment plants. Biochem Eng J 2023. [DOI: 10.1016/j.bej.2023.108807] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
|
7
|
Spatiotemporal prediction of O3 concentration based on the KNN-Prophet-LSTM model. Heliyon 2022; 8:e11670. [DOI: 10.1016/j.heliyon.2022.e11670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 10/19/2022] [Accepted: 11/10/2022] [Indexed: 11/23/2022] Open
|
8
|
Wang N, Zhang Y, Li L, Wang H, Zhao Y, Wu G, Li M, Zhou Z, Wang X, Yu JZ, Zhou Y. Ambient particle characteristics by single particle aerosol mass spectrometry at a coastal site in Hong Kong: a case study affected by the sea-land breeze. PeerJ 2022; 10:e14116. [PMID: 36325180 PMCID: PMC9620973 DOI: 10.7717/peerj.14116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Accepted: 09/04/2022] [Indexed: 01/21/2023] Open
Abstract
The sea-land breeze (SLB) circulation plays a vital role in the transport of atmospheric pollutants in coastal cities. In this study, a single particle aerosol mass spectrometer (SPAMS) and combined bulk aerosol instruments were deployed to investigate the ambient particle characteristic at a suburban coastal site in Hong Kong from February 22 to March 10, 2013. Significant SLB circulations were captured from March 6-10, 2013, during the campaign. During the SLB periods, air quality worsened, with PM2.5 concentrations reaching a peak of 55.6 μg m-3 and an average value of 42.8 ± 4.5 μg m-3. A total of 235,894 particles were measured during the SLB stage. Eight major sources were identified by investigating the mixing states of the total particles, including the coal-burning related particles (48.1%), biomass burning particles (6.7%), vehicle emission-related particles (16.4%), sea salt (9.2%), ship emission particles (2.7%), dust/steeling industries (3.7%), waste incineration (6.3%), and road dust (3.9%). It was noteworthy that the PM2.5 concentrations and particle numbers increased sharply during the transition of land wind to the sea breeze. Meanwhile, the continental sourced pollutants recirculated back to land resulting in a cumulative increase in pollutants. Both individual and bulk measurements support the above results, with high contributions from coal burning, biomass burning, bulk K+, and NO3 -, which were probably from the regional transportation from the nearby area. In contrast, the ship and vehicle emissions increased during the SLB period, with a high sulfate concentration partially originating from the ship emission. In this study, field evidence of continental-source pollutants backflow to land with the evolution of sea breeze was observed and helped our current understanding of the effect of SLB on air quality in the coastal city.
Collapse
Affiliation(s)
- Nana Wang
- College of Oceanic and Atmospheric Sciences, Ocean University of Qingdao, Qingdao, China
| | - Yanjing Zhang
- College of Oceanic and Atmospheric Sciences, Ocean University of Qingdao, Qingdao, China
| | - Lei Li
- Institute of Atmospheric Environment Safety and Pollution Control, Jinan University, Guangdong, China
| | - Houwen Wang
- College of Oceanic and Atmospheric Sciences, Ocean University of Qingdao, Qingdao, China
| | - Yunhui Zhao
- College of Oceanic and Atmospheric Sciences, Ocean University of Qingdao, Qingdao, China
| | - Guanru Wu
- College of Oceanic and Atmospheric Sciences, Ocean University of Qingdao, Qingdao, China
| | - Mei Li
- Institute of Atmospheric Environment Safety and Pollution Control, Jinan University, Guangdong, China
| | - Zhen Zhou
- Institute of Atmospheric Environment Safety and Pollution Control, Jinan University, Guangdong, China
| | - Xinfeng Wang
- Environment Research Institute, Shandong University, Qingdao, China
| | - Jian Zhen Yu
- Division of Environment, Hong Kong University of Science and Technology, Kowloon, Hong Kong,Department of Chemistry, Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Yang Zhou
- College of Oceanic and Atmospheric Sciences, Ocean University of Qingdao, Qingdao, China
| |
Collapse
|
9
|
PM2.5 forecasting for an urban area based on deep learning and decomposition method. Sci Rep 2022; 12:17565. [PMID: 36266317 PMCID: PMC9584903 DOI: 10.1038/s41598-022-21769-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 09/30/2022] [Indexed: 01/13/2023] Open
Abstract
Rapid growth in industrialization and urbanization have resulted in high concentration of air pollutants in the environment and thus causing severe air pollution. Excessive emission of particulate matter to ambient air has negatively impacted the health and well-being of human society. Therefore, accurate forecasting of air pollutant concentration is crucial to mitigate the associated health risk. This study aims to predict the hourly PM2.5 concentration for an urban area in Malaysia using a hybrid deep learning model. Ensemble empirical mode decomposition (EEMD) was employed to decompose the original sequence data of particulate matter into several subseries. Long short-term memory (LSTM) was used to individually forecast the decomposed subseries considering the influence of air pollutant parameters for 1-h ahead forecasting. Then, the outputs of each forecast were aggregated to obtain the final forecasting of PM2.5 concentration. This study utilized two air quality datasets from two monitoring stations to validate the performance of proposed hybrid EEMD-LSTM model based on various data distributions. The spatial and temporal correlation for the proposed dataset were analysed to determine the significant input parameters for the forecasting model. The LSTM architecture consists of two LSTM layers and the data decomposition method is added in the data pre-processing stage to improve the forecasting accuracy. Finally, a comparison analysis was conducted to compare the performance of the proposed model with other deep learning models. The results illustrated that EEMD-LSTM yielded the highest accuracy results among other deep learning models, and the hybrid forecasting model was proved to have superior performance as compared to individual models.
Collapse
|