Al Yammahi A, Aung Z. Forecasting the concentration of NO2 using statistical and machine learning methods: A case study in the UAE.
Heliyon 2023;
9:e12584. [PMID:
36793966 PMCID:
PMC9922785 DOI:
10.1016/j.heliyon.2022.e12584]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 10/06/2022] [Accepted: 12/15/2022] [Indexed: 12/26/2022] Open
Abstract
Nitrogen dioxide (NO2) is the most active pollutant gas emitted in the industrial era and is highly correlated with human activities. Tracking NO2 emissions and predicting their concentrations represent important steps toward controlling pollution and setting rules to protect people's health indoors, such as in factories, and in outdoor environments. The concentration of NO2 was affected by the COVID-19 lockdown period and decreased because of restrictions on outdoor activities. In this study, the concentration of NO2 was predicted at 14 ground stations in the United Arab Emirates (UAE) during December 2020 based on training over a full time period of two years (2019-2020). Statistical and machine learning models, such as autoregressive integrated moving average (ARIMA), seasonal autoregressive integrated moving average (SARIMA), long short-term memory (LSTM), and nonlinear autoregressive neural network (NAR-NN), are used with both open- and closed-loop architectures. The mean absolute percentage error (MAPE) was used to evaluate the performance of the models, and the results ranged from "very good" (MAPE of 8.64% at the Liwa station with the closed loop) to "acceptable" (MAPE of 42.45% at the Khadejah School station with the open loop). The results show that the predictions based on the open loop are generally better than those based on the closed loop because they yield statistically significantly lower MAPE values. For both loop types, we selected stations exhibiting the lowest, medium, and highest MAPE values as representative cases. In addition, we demonstrated that the MAPE value is highly correlated with the relative standard deviation of NO2 concentration values.
Collapse