1
|
Tao C, Jia M, Wang G, Zhang Y, Zhang Q, Wang X, Wang Q, Wang W. Time-sensitive prediction of NO 2 concentration in China using an ensemble machine learning model from multi-source data. J Environ Sci (China) 2024; 137:30-40. [PMID: 37980016 DOI: 10.1016/j.jes.2023.02.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 02/02/2023] [Accepted: 02/13/2023] [Indexed: 11/20/2023]
Abstract
Nitrogen dioxide (NO2) poses a critical potential risk to environmental quality and public health. A reliable machine learning (ML) forecasting framework will be useful to provide valuable information to support government decision-making. Based on the data from 1609 air quality monitors across China from 2014-2020, this study designed an ensemble ML model by integrating multiple types of spatial-temporal variables and three sub-models for time-sensitive prediction over a wide range. The ensemble ML model incorporates a residual connection to the gated recurrent unit (GRU) network and adopts the advantage of Transformer, extreme gradient boosting (XGBoost) and GRU with residual connection network, resulting in a 4.1%±1.0% lower root mean square error over XGBoost for the test results. The ensemble model shows great prediction performance, with coefficient of determination of 0.91, 0.86, and 0.77 for 1-hr, 3-hr, and 24-hr averages for the test results, respectively. In particular, this model has achieved excellent performance with low spatial uncertainty in Central, East, and North China, the major site-dense zones. Through the interpretability analysis based on the Shapley value for different temporal resolutions, we found that the contribution of atmospheric chemical processes is more important for hourly predictions compared with the daily scale predictions, while the impact of meteorological conditions would be ever-prominent for the latter. Compared with existing models for different spatiotemporal scales, the present model can be implemented at any air quality monitoring station across China to facilitate achieving rapid and dependable forecast of NO2, which will help developing effective control policies.
Collapse
Affiliation(s)
- Chenliang Tao
- Big Data Research Center for Ecology and Environment, Environment Research Institute, Shandong University, Qingdao 266237, China
| | - Man Jia
- Shandong Provincial Eco-environment Monitoring Center, Jinan 250101, China
| | - Guoqiang Wang
- Big Data Research Center for Ecology and Environment, Environment Research Institute, Shandong University, Qingdao 266237, China
| | - Yuqiang Zhang
- Big Data Research Center for Ecology and Environment, Environment Research Institute, Shandong University, Qingdao 266237, China
| | - Qingzhu Zhang
- Big Data Research Center for Ecology and Environment, Environment Research Institute, Shandong University, Qingdao 266237, China.
| | - Xianfeng Wang
- Shandong Provincial Eco-environment Monitoring Center, Jinan 250101, China.
| | - Qiao Wang
- Big Data Research Center for Ecology and Environment, Environment Research Institute, Shandong University, Qingdao 266237, China
| | - Wenxing Wang
- Big Data Research Center for Ecology and Environment, Environment Research Institute, Shandong University, Qingdao 266237, China
| |
Collapse
|
2
|
Barzegar Y, Gorelova I, Bellini F, D’Ascenzo F. Drinking Water Quality Assessment Using a Fuzzy Inference System Method: A Case Study of Rome (Italy). INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023; 20:6522. [PMID: 37569062 PMCID: PMC10418417 DOI: 10.3390/ijerph20156522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 07/26/2023] [Accepted: 07/28/2023] [Indexed: 08/13/2023]
Abstract
Drinking water quality assessment is a major issue today, as it is crucial to supply safe drinking water to ensure the well-being of society. Predicting drinking water quality helps strengthen water management and fight water pollution; technologies and practices for drinking water quality assessment are continuously improving; artificial intelligence methods prove their efficiency in this domain. This research effort seeks a hierarchical fuzzy model for predicting drinking water quality in Rome (Italy). The Mamdani fuzzy inference system is applied with different defuzzification methods. The proposed model includes three fuzzy intermediate models and one fuzzy final model. Each model consists of three input parameters and 27 fuzzy rules. A water quality assessment model is developed with a dataset that considers nine parameters (alkalinity, hardness, pH, Ca, Mg, fluoride, sulphate, nitrates, and iron). These nine parameters of drinking water are anticipated to be within the acceptable limits set to protect human health. Fuzzy-logic-based methods have been demonstrated to be appropriate to address uncertainty and subjectivity in drinking water quality assessment; they are an effective method for managing complicated, uncertain water systems and predicting drinking water quality. The proposed method can provide an effective solution for complex systems; this method can be modified easily to improve performance.
Collapse
Affiliation(s)
| | - Irina Gorelova
- Department of Management, Sapienza University of Rome, 00161 Rome, Italy; (Y.B.); (F.B.); (F.D.)
| | | | | |
Collapse
|
3
|
Machine learning methods to predict cadmium (Cd) concentration in rice grain and support soil management at a regional scale. FUNDAMENTAL RESEARCH 2023. [DOI: 10.1016/j.fmre.2023.02.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/12/2023] Open
|
4
|
Johnson DP, Lulla V. Predicting COVID-19 community infection relative risk with a Dynamic Bayesian Network. Front Public Health 2022; 10:876691. [PMID: 36388264 PMCID: PMC9650227 DOI: 10.3389/fpubh.2022.876691] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Accepted: 10/10/2022] [Indexed: 01/21/2023] Open
Abstract
As COVID-19 continues to impact the United States and the world at large it is becoming increasingly necessary to develop methods which predict local scale spread of the disease. This is especially important as newer variants of the virus are likely to emerge and threaten community spread. We develop a Dynamic Bayesian Network (DBN) to predict community-level relative risk of COVID-19 infection at the census tract scale in the U.S. state of Indiana. The model incorporates measures of social and environmental vulnerability-including environmental determinants of COVID-19 infection-into a spatial temporal prediction of infection relative risk 1-month into the future. The DBN significantly outperforms five other modeling techniques used for comparison and which are typically applied in spatial epidemiological applications. The logic behind the DBN also makes it very well-suited for spatial-temporal prediction and for "what-if" analysis. The research results also highlight the need for further research using DBN-type approaches that incorporate methods of artificial intelligence into modeling dynamic processes, especially prominent within spatial epidemiologic applications.
Collapse
Affiliation(s)
- Daniel P. Johnson
- Department of Geography, Indiana University – Purdue University at Indianapolis, Indianapolis, IN, United States,*Correspondence: Daniel P. Johnson
| | - Vijay Lulla
- Center for Complex Networks and Systems Research, Indiana University, Bloomington, IN, United States
| |
Collapse
|
5
|
Iskandaryan D, Ramos F, Trilles S. Comparison of Nitrogen Dioxide Predictions During a Pandemic and Non-pandemic Scenario in the City of Madrid using a Convolutional LSTM Network. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS 2022. [DOI: 10.1142/s1469026822500146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Traditionally, machine learning technologies with the methods and capabilities available, combined with a geospatial dimension, can perform predictive analyzes of air quality with greater accuracy. However, air pollution is influenced by many external factors, one of which has recently been caused by the restrictions applied to curb the relentless advance of COVID-19. These sudden changes in air quality levels can negatively influence current forecasting models. This work compares air pollution forecasts during a pandemic and non-pandemic period under the same conditions. The ConvLSTM algorithm was applied to predict the concentration of nitrogen dioxide using data from the air quality and meteorological stations in Madrid. The proposed model was applied for two scenarios: pandemic (January–June 2020) and non-pandemic (January–June 2019), each with sub-scenarios based on time granularity (1-h, 12-h, 24-h and 48-h) and combination of features. The Root Mean Square Error was taken as the estimation metric, and the results showed that the proposed method outperformed a reference model, and the feature selection technique significantly improved the overall accuracy.
Collapse
Affiliation(s)
- Ditsuhi Iskandaryan
- Institute of New Imaging Technologies, Universitat Jaume I, Avinguda de Vicent Sos Baynat, s/n Castelló de la Plana 12071, Spain
| | - Francisco Ramos
- Institute of New Imaging Technologies, Universitat Jaume I, Avinguda de Vicent Sos Baynat, s/n Castelló de la Plana 12071, Spain
| | - Sergio Trilles
- Institute of New Imaging Technologies, Universitat Jaume I, Avinguda de Vicent Sos Baynat, s/n Castelló de la Plana 12071, Spain
| |
Collapse
|
6
|
Iskandaryan D, Ramos F, Trilles S. Bidirectional convolutional LSTM for the prediction of nitrogen dioxide in the city of Madrid. PLoS One 2022; 17:e0269295. [PMID: 35648766 PMCID: PMC9159618 DOI: 10.1371/journal.pone.0269295] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Accepted: 05/18/2022] [Indexed: 12/03/2022] Open
Abstract
Nitrogen dioxide is one of the pollutants with the most significant health effects. Advanced information on its concentration in the air can help to monitor and control further consequences more effectively, while also making it easier to apply preventive and mitigating measures. Machine learning technologies with available methods and capabilities, combined with the geospatial dimension, can perform predictive analyses with higher accuracy and, as a result, can serve as a supportive tool for productive management. One of the most advanced machine learning algorithms, Bidirectional convolutional LSTM, is being used in ongoing work to predict the concentration of nitrogen dioxide. The model has been validated to perform more accurate spatiotemporal analysis based on the integration of temporal and geospatial factors. The analysis was carried out according to two scenarios developed on the basis of selected features using data from the city of Madrid for the periods January-June 2019 and January-June 2020. Evaluation of the model's performance was conducted using the Root Mean Square Error and the Mean Absolute Error which emphasises the superiority of the proposed model over the reference models. In addition, the significance of a feature selection technique providing improved accuracy was underlined. In terms of execution time, due to the complexity of the Bidirectional convolutional LSTM architecture, convergence and generalisation of the data took longer, resulting in the superiority of the reference models.
Collapse
Affiliation(s)
- Ditsuhi Iskandaryan
- Institute of New Imaging Technologies (INIT), Universitat Jaume I, Castelló de la Plana, Castellón, Spain
| | - Francisco Ramos
- Institute of New Imaging Technologies (INIT), Universitat Jaume I, Castelló de la Plana, Castellón, Spain
| | - Sergio Trilles
- Institute of New Imaging Technologies (INIT), Universitat Jaume I, Castelló de la Plana, Castellón, Spain
| |
Collapse
|
7
|
Abstract
Nowadays, observing, recording, and modeling the dynamics of atmospheric pollutants represent actual study areas given the effects of pollution on the population and ecosystems. The existence of aberrant values may influence reports on air quality when they are based on average values over a period. This may also influence the quality of models, which are further used in forecasting. Therefore, correct data collection and analysis is necessary before modeling. This study aimed to detect aberrant values in a nitrogen oxide concentration series recorded in the interval 1 January–8 June 2016 in Timisoara, Romania, and retrieved from the official reports of the National Network for Monitoring the Air Quality, Romania. Four methods were utilized, including the interquartile range (IQR), isolation forest, local outlier factor (LOF) methods, and the generalized extreme studentized deviate (GESD) test. Autoregressive integrated moving average (ARIMA), Generalized Regression Neural Networks (GRNN), and hybrid ARIMA-GRNN models were built for the series before and after the removal of aberrant values. The results show that the first approach provided a good model (from a statistical viewpoint) for the series after the anomalies removal. The best model was obtained by the hybrid ARIMA-GRNN. For example, for the raw NO2 series, the ARIMA model was not statistically validated, whereas, for the series without outliers, the ARIMA(1,1,1) was validated. The GRNN model for the raw series was able to learn the data well: R2 = 76.135%, the correlation between the actual and predicted values (rap) was 0.8778, the mean standard errors (MSE) = 0.177, the mean absolute error MAE = 0.2839, and the mean absolute percentage error MAPE = 9.9786. Still, on the test set, the results were worse: MSE = 1.5101, MAE = 0.8175, rap = 0.4482. For the series without outliers, the model was able to learn the data in the training set better than for the raw series (R2 = 0.996), whereas, on the test set, the results were not very good (R2 = 0.473). The performances of the hybrid ARIMA–GRNN on the initial series were not satisfactory on the test (the pattern of the computed values was almost linear) but were very good on the series without outliers (the correlation between the predicted values on the test set was very close to 1). The same was true for the models built for O3.
Collapse
|
8
|
Attention-Based Distributed Deep Learning Model for Air Quality Forecasting. SUSTAINABILITY 2022. [DOI: 10.3390/su14063269] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Air quality forecasting has become an essential factor in facilitating sustainable development worldwide. Several countries have implemented monitoring stations to collect air pollution particle data and meteorological information using parameters such as hourly timespans. This research focuses on unravelling a new framework for air quality prediction worldwide and features Busan, South Korea as its model city. The paper proposes the application of an attention-based convolutional BiLSTM autoencoder model. The proposed deep learning model has been trained on a distributed framework, referred to data parallelism, to forecast the intensity of particle pollution (PM2.5 and PM10). The algorithm automatically learns the intrinsic correlation among the particle pollution in different locations. Each location’s meteorological and traffic data is extensively exploited to improve the model’s performance. The model has been trained using air quality particle data and car traffic information. The traffic information is obtained by a device which counts cars passing a specific area through the YOLO algorithm, and then sends the data to a stacked deep autoencoder to be encoded alongside the meteorological data before the final prediction. In addition, multiple one-dimensional CNN layers are used to obtain the local spatial features jointly with a stacked attention-based BiLSTM layer to figure out how air quality particles are correlated in space and time. The evaluation of the new attention-based convolutional BiLSTM autoencoder model was derived from data collected and retrieved from comprehensive experiments conducted in South Korea. The results not only show that the framework outperforms the previous models both on short- and long-term predictions but also indicate that traffic information can improve the accuracy of air quality forecasting. For instance, during PM2.5 prediction, the proposed attention-based model obtained the lowest MAE (5.02 and 22.59, respectively, for short-term and long-term prediction), RMSE (7.48 and 28.02) and SMAPE (17.98 and 39.81) among all the models, which indicates strong accuracy between observed and predicted values. It was also found that the newly proposed model had the lowest average training time compared to the baseline algorithms. Furthermore, the proposed framework was successfully deployed in a cloud server in order to provide future air quality information in real time and when needed.
Collapse
|
9
|
Determination of Water Depth in Ports Using Satellite Data Based on Machine Learning Algorithms. ENERGIES 2021. [DOI: 10.3390/en14092486] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
One of the fundamental maintenance tasks of ports is the periodic dredging of them. This is necessary to guarantee a minimum draft that will enable ships to access ports safely. The determination of bathymetries is the instrument that determines the need for dredging and permits an analysis of the behavior of the port bottom over time, in order to achieve adequate water depth. Satellite data processing to predict environmental parameters is used increasingly. Based on satellite data and using different machine learning algorithm techniques, this study has sought to estimate the seabed in ports, taking into account the fact that the port areas are strongly anthropized areas. The algorithms that were used were Support Vector Machine (SVM), Random Forest (RF) and the Multi-Adaptive Regression Splines (MARS). The study was carried out in the ports of Candás and Luarca in the Principality of Asturias. In order to validate the results obtained, data was acquired in situ by using a single beam provided. The results show that this type of methodology can be used to estimate coastal bathymetry. However, when deciding which system was best, priority was given to simplicity and robustness. The results of the SVM and RF algorithms outperform those of the MARS. RF performs better in Candás with a mean absolute error (MAE) of 0.27 cm, whereas SVM performs better in Luarca with a mean absolute error of 0.37 cm. It is suggested that this approach is suitable as a simpler and more cost-effective rough resolution alternative, for estimating the depth of turbid water in ports, than single-beam sonar, which is labor-intensive and polluting.
Collapse
|
10
|
Analysis of the Spatio-Temporal Evolution of Dredging from Satellite Images: A Case Study in the Principality of Asturias (Spain). JOURNAL OF MARINE SCIENCE AND ENGINEERING 2021. [DOI: 10.3390/jmse9030267] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
One of the fundamental tasks in the maintenance of port operations is periodic dredging. These dredging operations facilitate the elimination of sediments that the coastal dynamics introduce. Dredging operations are increasingly restrictive and costly due to environmental requirements. Understanding the condition of the seabed before and after dredging is essential. In addition, determining how the seabed has behaved in recent years is important to consider when planning future dredging operations. In order to analyze the behavior of sediment transport and the changes to the seabed due to sedimentation, studies of littoral dynamics are conducted to model the deposition of sediments. Another methodology that could be used to analyze the real behavior of sediments would be to study and compare port bathymetries collected periodically. The problem with this methodology is that it requires numerous bathymetric surveys to produce a sufficiently significant analysis. This study provides an effective solution for obtaining a dense time series of bathymetry mapping using satellite data, and enables the past behavior of the seabed to be examined. The methodology proposed in this work uses Sentinel-2A (10 m resolution) satellite images to obtain historical bathymetric series by the development of a random forest algorithm. From these historical bathymetric series, it is possible to determine how the seabed has behaved and how the entry of sediments into the study area occurs. This methodology is applied in the Port of Luarca (Principality of Asturias), obtaining satellite images and extracting successive bathymetry mapping utilizing the random forest algorithm. This work reveals how once the dock was dredged, the sediments were redeposited and the seabed recovered its level prior to dredging in less than 2 months.
Collapse
|
11
|
Abstract
Air pollution and its consequences are negatively impacting on the world population and the environment, which converts the monitoring and forecasting air quality techniques as essential tools to combat this problem. To predict air quality with maximum accuracy, along with the implemented models and the quantity of the data, it is crucial also to consider the dataset types. This study selected a set of research works in the field of air quality prediction and is concentrated on the exploration of the datasets utilised in them. The most significant findings of this research work are: (1) meteorological datasets were used in 94.6% of the papers leaving behind the rest of the datasets with a big difference, which is complemented with others, such as temporal data, spatial data, and so on; (2) the usage of various datasets combinations has been commenced since 2009; and (3) the utilisation of open data have been started since 2012, 32.3% of the studies used open data, and 63.4% of the studies did not provide the data.
Collapse
|
12
|
Dupont MF, Elbourne A, Cozzolino D, Chapman J, Truong VK, Crawford RJ, Latham K. Chemometrics for environmental monitoring: a review. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2020; 12:4597-4620. [PMID: 32966380 DOI: 10.1039/d0ay01389g] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Environmental monitoring is necessary to ensure the overall health and conservation of an ecosystem. However, ecosystems (e.g. air, water, soil), are complex, involving numerous processes (both native and external), inputs, contaminants, and living organisms. As such, monitoring an environmental system is not a trivial task. The data obtained from natural systems is often multifaceted and convoluted, as a multitude of inputs can be intertwined within the matrix of the information obtained as part of a study. This means that trends and important results can be easily overlooked by conventional and single dimensional data analysis protocols. Recently, chemometric methods have emerged as a powerful method for maximizing the details contained within a chemical data set. Specifically, chemometrics refers to the use of mathematical and statistical analysis methods to evaluate chemical data, beyond univariant analysis. This type of analysis can provide a quantitative description of environmental measurements, while also having the capacity to reveal previously overlooked trends in data sets. Applying chemometrics to environmental data allows us to identify and describe the inter-relationship of environmental drivers, sources of contamination, and their potential impact upon the environment. This review aims to provide a detailed understanding of chemometric techniques, how they are currently used in environmental monitoring, and how these techniques can be used to improve current practices. An enhanced ability to monitor environmental conditions and to predict trends would be greatly beneficial to government and research agencies in their ability to develop environmental policies and analytical procedures.
Collapse
|
13
|
Distributed Deep Features Extraction Model for Air Quality Forecasting. SUSTAINABILITY 2020. [DOI: 10.3390/su12198014] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Several studies in environmental engineering emphasize the importance of air quality forecasting for sustainable development around the world. In this paper, we studied a new approach for air quality forecasting in Busan metropolitan city. We proposed a convolutional Bi-Directional Long-Short Term Memory (Bi-LSTM) autoencoder model trained using a distributed architecture to predict the concentration of the air quality particles (PM2.5 and PM10). The proposed deep learning model can automatically learn the intrinsic correlation among the pollutants in different location. Also, the meteorological and the pollution gas information at each location are fully utilized, which is beneficial for the performance of the model. We used multiple one-dimension convolutional neural network (CNN) layers to extract the local spatial features and a stacked Bi-LSTM layer to learn the spatiotemporal correlation of air quality particles. In addition, we used a stacked deep autoencoder to encode the essential transformation patterns of the pollution gas and the meteorological data, since they are very important for providing useful information that can significantly improve the prediction of the air quality particles. Finally, in order to reduce the training time and the resource consumption, we used a distributed deep leaning approach called data parallelism, which has never been used to tackle the problem of air quality forecasting. We evaluated our approach with extensive experiments based on the data collected in Busan metropolitan city. The results reveal the superiority of our framework over ten baseline models and display how the distributed deep learning model can significantly improve the training time and even the prediction accuracy.
Collapse
|
14
|
Stockwell WR, Saunders E, Goliff WS, Fitzgerald RM. A perspective on the development of gas-phase chemical mechanisms for Eulerian air quality models. JOURNAL OF THE AIR & WASTE MANAGEMENT ASSOCIATION (1995) 2020; 70:44-70. [PMID: 31750791 DOI: 10.1080/10962247.2019.1694605] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Revised: 10/21/2019] [Accepted: 11/13/2019] [Indexed: 06/10/2023]
Abstract
An essential component of a three-dimensional air quality model is its gas-phase mechanism. We present an overview of the necessary atmospheric chemistry and a discussion of the types of mechanisms with some specific examples such as the Master Chemical Mechanism, the Carbon Bond, SAPRC and the Regional Atmospheric Chemistry Mechanism (RACM). The first versions of the Carbon Bond and SAPRC mechanisms were developed through a hierarchy of chemical species approach that relied heavily on chemical environmental chamber data. Now a new approach has been proposed where the first step is to develop a highly detailed explicit mechanism such as the Master Chemical Mechanism and the second step is to test the detailed explicit mechanism against laboratory and field data. Finally, the detailed mechanism is condensed for use in a three-dimensional air quality model. Here it is argued that the development of highly detailed explicit mechanisms is very valuable for research, but we suggest that combining the hierarchy of chemical species and the detailed explicit mechanism approaches would be better than either alone.Implication: Many gas-phase mechanisms are available for urban, regional and global air quality modeling. A "hierarchy of chemical species approach," relying heavily on smog-chamber data was used for the development of the early series of mechanisms. Now the development of large, explicit master mechanisms that may be condensed is a significant, trend. However, a continuing problem with air quality mechanism development is due to the high complexity of atmospheric chemistry and the current availability of laboratory measurements. This problem requires a balance between completeness and speculation so that models maintain their utility for policymakers.
Collapse
Affiliation(s)
- William R Stockwell
- Department of Physics, University of Texas El Paso, El Paso, TX, USA
- Division of Atmospheric Sciences, Desert Research Institute, Nevada System of Higher Education, Reno, NV, USA
| | - Emily Saunders
- Science Systems and Applications, Inc. and Global Modeling Assimilation Office (GMAO), NASA Goddard Space Flight Center, Greenbelt, MD, USA
| | - Wendy S Goliff
- Chemistry Department, Riverside City College, Riverside, CA, USA
| | - Rosa M Fitzgerald
- Department of Physics, University of Texas El Paso, El Paso, TX, USA
| |
Collapse
|
15
|
Prediction of Ambient PM2.5 Concentrations Using a Correlation Filtered Spatial-Temporal Long Short-Term Memory Model. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app10010014] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Due to the increasingly serious air pollution problem, air quality prediction has been an important approach for air pollution control and prevention. Many prediction methods have been proposed in recent years to improve the prediction accuracy. However, most of the existing methods either did not consider the spatial relationships between monitoring stations or overlooked the strength of the correlation. Excluding the spatial correlation or including too much weak spatial inputs could influence the modeling and reduce the prediction accuracy. To overcome the limitation, this paper proposes a correlation filtered spatial-temporal long short-term memory (CFST-LSTM) model for air quality prediction. The model is designed based on the original LSTM model and is equipped with a spatial-temporal filter (STF) layer. This layer not only takes into account the spatial influence between stations, but also can extract highly correlated sequential data and drop weaker ones. To evaluate the proposed CFST-LSTM model, hourly PM2.5 concentration data of California are collected and preprocessed. Several experiments are conducted. The experimental results show that the CFST-LSTM model can effectively improve the prediction accuracy and has great generalization.
Collapse
|
16
|
Li X, Zhang X. Predicting ground-level PM 2.5 concentrations in the Beijing-Tianjin-Hebei region: A hybrid remote sensing and machine learning approach. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2019; 249:735-749. [PMID: 30933771 DOI: 10.1016/j.envpol.2019.03.068] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2018] [Revised: 02/13/2019] [Accepted: 03/17/2019] [Indexed: 06/09/2023]
Abstract
An accurate estimation of PM2.5 (fine particulate matters with diameters ≤ 2.5 μm) concentration is critical for health risk assessment and generating air pollution control strategies. In this study, a hybrid remote sensing and machine learning approach, named RSRF model is proposed to estimate daily ground-level PM2.5 concentrations, which integrates Random Forest (RF), one of machine learning (ML) models, and aerosol optical depth (AOD), one of remote sensing (RS) products. The proposed RSRF model provides an opportunity for an adequate characterization of real-time spatiotemporal PM2.5 distributions at uninhabited places and complex surfaces. It also offers advantages in handling complicated non-linear relationships among a large number of meteorological, environmental and air pollutant factors, as well as ever-increasing environmental data sets. The applicability of the proposed RSRF model is tested in the Beijing-Tianjin-Hebei region (BTH region) during 2015-2017. Deep Blue (DB) AOD from Aqua-retrieved Collection 6.1 (C_61) aerosol products of Moderate Resolution Imaging Spectroradiometer (MODIS) is validated with Aerosol Robotic Network. The validation results indicate C_61 DB AOD has a high correlation with ground based AOD in the BTH region. The proposed RSRF model performed well in characterizing spatiotemporal variations of annual and seasonal PM2.5 concentrations. It not only is useful to quantify the relationships between PM2.5 and relevant factors such as DB AOD, meteorological and air pollutant variables, but also can provide decision support for air pollution control at a regional environment during haze periods.
Collapse
Affiliation(s)
- Xintong Li
- School of Environmental Science and Engineering, Shandong University, Qingdao, Shandong, 266237, China
| | - Xiaodong Zhang
- School of Environmental Science and Engineering, Shandong University, Qingdao, Shandong, 266237, China.
| |
Collapse
|
17
|
Debnath J, Majumder D, Biswas A. Air quality assessment using weighted interval type-2 fuzzy inference system. ECOL INFORM 2018. [DOI: 10.1016/j.ecoinf.2018.06.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
18
|
|
19
|
Ghaemi Z, Alimohammadi A, Farnaghi M. LaSVM-based big data learning system for dynamic prediction of air pollution in Tehran. ENVIRONMENTAL MONITORING AND ASSESSMENT 2018; 190:300. [PMID: 29679160 PMCID: PMC5910457 DOI: 10.1007/s10661-018-6659-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/17/2017] [Accepted: 04/02/2018] [Indexed: 06/08/2023]
Abstract
Due to critical impacts of air pollution, prediction and monitoring of air quality in urban areas are important tasks. However, because of the dynamic nature and high spatio-temporal variability, prediction of the air pollutant concentrations is a complex spatio-temporal problem. Distribution of pollutant concentration is influenced by various factors such as the historical pollution data and weather conditions. Conventional methods such as the support vector machine (SVM) or artificial neural networks (ANN) show some deficiencies when huge amount of streaming data have to be analyzed for urban air pollution prediction. In order to overcome the limitations of the conventional methods and improve the performance of urban air pollution prediction in Tehran, a spatio-temporal system is designed using a LaSVM-based online algorithm. Pollutant concentration and meteorological data along with geographical parameters are continually fed to the developed online forecasting system. Performance of the system is evaluated by comparing the prediction results of the Air Quality Index (AQI) with those of a traditional SVM algorithm. Results show an outstanding increase of speed by the online algorithm while preserving the accuracy of the SVM classifier. Comparison of the hourly predictions for next coming 24 h, with those of the measured pollution data in Tehran pollution monitoring stations shows an overall accuracy of 0.71, root mean square error of 0.54 and coefficient of determination of 0.81. These results are indicators of the practical usefulness of the online algorithm for real-time spatial and temporal prediction of the urban air quality.
Collapse
Affiliation(s)
- Z. Ghaemi
- Faculty of Geodesy and Geomatics Engineering, K.N. Toosi University of Technology, No. 1346, ValiAsr Street, Mirdamad cross, Tehran, 19967-15433 Iran
| | - A. Alimohammadi
- Faculty of Geodesy and Geomatics Engineering, K.N. Toosi University of Technology, No. 1346, ValiAsr Street, Mirdamad cross, Tehran, 19967-15433 Iran
| | - M. Farnaghi
- Faculty of Geodesy and Geomatics Engineering, K.N. Toosi University of Technology, No. 1346, ValiAsr Street, Mirdamad cross, Tehran, 19967-15433 Iran
- GIS Center, Department of Physical Geography and Ecosystem Science, Lund University, Lund, Sweden
| |
Collapse
|
20
|
|
21
|
|
22
|
|
23
|
Saleh C, Dzakiyullah NR, Nugroho JB. Carbon dioxide emission prediction using support vector machine. ACTA ACUST UNITED AC 2016. [DOI: 10.1088/1757-899x/114/1/012148] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
24
|
Režnáková M, Tencer L, Cheriet M. SO-ARTIST: Self-Organized ART-2A inspired clustering for online Takagi–Sugeno fuzzy models. Appl Soft Comput 2015. [DOI: 10.1016/j.asoc.2015.02.022] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
25
|
Ha Q, Wahid H, Duc H, Azzi M. Enhanced radial basis function neural networks for ozone level estimation. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2014.12.048] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
26
|
Motai Y. Kernel association for classification and prediction: a survey. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2015; 26:208-223. [PMID: 25029489 DOI: 10.1109/tnnls.2014.2333664] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Kernel association (KA) in statistical pattern recognition used for classification and prediction have recently emerged in a machine learning and signal processing context. This survey outlines the latest trends and innovations of a kernel framework for big data analysis. KA topics include offline learning, distributed database, online learning, and its prediction. The structural presentation and the comprehensive list of references are geared to provide a useful overview of this evolving field for both specialists and relevant scholars.
Collapse
|
27
|
Wang P, Liu Y, Qin Z, Zhang G. A novel hybrid forecasting model for PM₁₀ and SO₂ daily concentrations. THE SCIENCE OF THE TOTAL ENVIRONMENT 2015; 505:1202-12. [PMID: 25461118 DOI: 10.1016/j.scitotenv.2014.10.078] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/27/2014] [Revised: 10/17/2014] [Accepted: 10/21/2014] [Indexed: 05/03/2023]
Abstract
Air-quality forecasting in urban areas is difficult because of the uncertainties in describing both the emission and meteorological fields. The use of incomplete information in the training phase restricts practical air-quality forecasting. In this paper, we propose a hybrid artificial neural network and a hybrid support vector machine, which effectively enhance the forecasting accuracy of an artificial neural network (ANN) and support vector machine (SVM) by revising the error term of the traditional methods. The hybrid methodology can be described in two stages. First, we applied the ANN or SVM forecasting system with historical data and exogenous parameters, such as meteorological variables. Then, the forecasting target was revised by the Taylor expansion forecasting model using the residual information of the error term in the previous stage. The innovation involved in this approach is that it sufficiently and validly utilizes the useful residual information on an incomplete input variable condition. The proposed method was evaluated by experiments using a 2-year dataset of daily PM₁₀ (particles with a diameter of 10 μm or less) concentrations and SO₂ (sulfur dioxide) concentrations from four air pollution monitoring stations located in Taiyuan, China. The theoretical analysis and experimental results demonstrated that the forecasting accuracy of the proposed model is very promising.
Collapse
Affiliation(s)
- Ping Wang
- Institute of Loess Plateau, Shanxi University, Taiyuan, 030006, PR China
| | - Yong Liu
- Institute of Loess Plateau, Shanxi University, Taiyuan, 030006, PR China.
| | - Zuodong Qin
- Institute of Loess Plateau, Shanxi University, Taiyuan, 030006, PR China
| | - Guisheng Zhang
- School of Economics and Management, Shanxi University, Taiyuan, 030006, PR China
| |
Collapse
|
28
|
Lu WZ, Wang D. Learning machines: Rationale and application in ground-level ozone prediction. Appl Soft Comput 2014. [DOI: 10.1016/j.asoc.2014.07.008] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
29
|
Liu C, Ding J, Toprac AJ, Chai T. Data-based adaptive online prediction model for plant-wide production indices. Knowl Inf Syst 2014. [DOI: 10.1007/s10115-014-0757-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
30
|
Kavousi-Fard A, Kavousi-Fard F. A new hybrid correction method for short-term load forecasting based on ARIMA, SVR and CSA. J EXP THEOR ARTIF IN 2013. [DOI: 10.1080/0952813x.2013.782351] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
31
|
Wenjian W, Husheng G, Yuanfeng J, Jingye B. Granular support vector machine based on mixed measure. Neurocomputing 2013. [DOI: 10.1016/j.neucom.2012.08.006] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
32
|
|
33
|
Elaissi I, Jaffel I, Taouali O, Messaoud H. Online prediction model based on the SVD-KPCA method. ISA TRANSACTIONS 2013; 52:96-104. [PMID: 23103049 DOI: 10.1016/j.isatra.2012.09.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2012] [Revised: 08/24/2012] [Accepted: 09/29/2012] [Indexed: 06/01/2023]
Abstract
This paper proposes a new method for online identification of a nonlinear system modelled on Reproducing Kernel Hilbert Space (RKHS). The proposed SVD-KPCA method uses the Singular Value Decomposition (SVD) technique to update the principal components. Then we use the Reduced Kernel Principal Component Analysis (RKPCA) to approach the principal components which represent the observations selected by the KPCA method.
Collapse
Affiliation(s)
- Ilyes Elaissi
- Unité de Recherche d'Automatique, Traitement de Signal et Image (ATSI), Monastir 5000, Tunisia.
| | | | | | | |
Collapse
|
34
|
López-Yáñez I, Argüelles-Cruz AJ, Camacho-Nieto O, Yáñez-Márquez C. Pollutants Time-Series Prediction using the Gamma Classifier. INT J COMPUT INT SYS 2011. [DOI: 10.1080/18756891.2011.9727822] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022] Open
|
35
|
Design and comparative study of online kernel methods identification of nonlinear system in RKHS space. Artif Intell Rev 2011. [DOI: 10.1007/s10462-011-9231-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
36
|
Cui L, Xu Y, Jia Q, Wu H, Yuan J. Prediction of the Profit Function for Industrial 2-Keto-L-Gulonic Acid Cultivation. Chem Eng Technol 2011. [DOI: 10.1002/ceat.201000507] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
37
|
Online identification of nonlinear system using reduced kernel principal component analysis. Neural Comput Appl 2010. [DOI: 10.1007/s00521-010-0461-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
38
|
|