1
|
Adhikary S, Tiwari SP, Banerjee S, Dwivedi AD, Rahman SM. Global marine phytoplankton dynamics analysis with machine learning and reanalyzed remote sensing. PeerJ 2024; 12:e17361. [PMID: 38737741 PMCID: PMC11088370 DOI: 10.7717/peerj.17361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Accepted: 04/18/2024] [Indexed: 05/14/2024] Open
Abstract
Phytoplankton are the world's largest oxygen producers found in oceans, seas and large water bodies, which play crucial roles in the marine food chain. Unbalanced biogeochemical features like salinity, pH, minerals, etc., can retard their growth. With advancements in better hardware, the usage of Artificial Intelligence techniques is rapidly increasing for creating an intelligent decision-making system. Therefore, we attempt to overcome this gap by using supervised regressions on reanalysis data targeting global phytoplankton levels in global waters. The presented experiment proposes the applications of different supervised machine learning regression techniques such as random forest, extra trees, bagging and histogram-based gradient boosting regressor on reanalysis data obtained from the Copernicus Global Ocean Biogeochemistry Hindcast dataset. Results obtained from the experiment have predicted the phytoplankton levels with a coefficient of determination score (R2) of up to 0.96. After further validation with larger datasets, the model can be deployed in a production environment in an attempt to complement in-situ measurement efforts.
Collapse
Affiliation(s)
| | | | | | | | - Syed Masiur Rahman
- King Fahd University of Petroleum and Minerals (KFUPM), Dhahran, Saudi Arabia
| |
Collapse
|
2
|
Zhang D, Shi K, Wang W, Wang X, Zhang Y, Qin B, Zhu M, Dong B, Zhang Y. An optical mechanism-based deep learning approach for deriving water trophic state of China's lakes from Landsat images. WATER RESEARCH 2024; 252:121181. [PMID: 38301525 DOI: 10.1016/j.watres.2024.121181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 12/21/2023] [Accepted: 01/22/2024] [Indexed: 02/03/2024]
Abstract
Widespread eutrophication has been considered as the most serious environment problems in the world. Given the critical roles of lakes in human society and serious negative effects of water eutrophication on lake ecosystems, it is thus fundamentally important to monitor and assess water trophic status of lakes. However, a reliable model for accurately estimating the trophic state index (TSI) of lakes across a large-scale region is still lacking due to their high complexity. Here, we proposed an optical mechanism-based deep learning approach to remotely estimate TSI of lakes based on Landsat images. The approach consists of two steps: (1) determining the optical indicators of TSI and modeling the relationship between them, and (2) developing an approach for remotely deriving the determined optical indicator from Landsat images. With a large number of in situ datasets measured from lakes (2804 samples from 88 lakes) across China with various optical properties, we trained and validated three machine learning methods including deep neural network (DNN), k-nearest neighbors (KNN) and random forest (RF) to model TSI with the optical indicators and TSI and derive the determined optical indicator from Landsat images. The results showed that (1) the total absorption coefficients of optically active constituents at 440 nm (at-w(440)) performs best in characterizing TSI, and (2) DNN outperforms other models in the inversion of both TSI and at-w(440). Overall, our proposed optical mechanism-based deep learning approach demonstrated a robust and satisfactory performance in assessing TSI using Landsat images (root mean squared error (RMSE) = 5.95, mean absolute error (MAE) = 4.81). This highlights its merit as a nationally-adopted method in lake water TSI estimation, enabling the convenience of the acquisition of water eutrophic information in large scale, thereby assisting us in managing lake ecology. Therefore, we assessed water TSI of 961 lakes (>10 km2) across China using the proposed approach. The resulting at-w(440) and TSI ranged from 0.01 m-1 to 31.42 m-1 and from 6 to 96, respectively. Of all these studied lakes, 96 lakes (11.40 %) were oligotrophic, 338 lakes were mesotrophic (40.14 %), 360 lakes were eutrophic (42.76 %), and 48 were hypertrophic (5.70 %) in 2020.
Collapse
Affiliation(s)
- Dong Zhang
- Taihu Laboratory for Lake Ecosystem Research, State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, 73 East Beijing Road, Nanjing 210008, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Kun Shi
- Taihu Laboratory for Lake Ecosystem Research, State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, 73 East Beijing Road, Nanjing 210008, China.
| | - Weijia Wang
- Taihu Laboratory for Lake Ecosystem Research, State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, 73 East Beijing Road, Nanjing 210008, China
| | - Xiwen Wang
- Taihu Laboratory for Lake Ecosystem Research, State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, 73 East Beijing Road, Nanjing 210008, China
| | - Yunlin Zhang
- Taihu Laboratory for Lake Ecosystem Research, State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, 73 East Beijing Road, Nanjing 210008, China
| | - Boqiang Qin
- Taihu Laboratory for Lake Ecosystem Research, State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, 73 East Beijing Road, Nanjing 210008, China
| | - Mengyuan Zhu
- Taihu Laboratory for Lake Ecosystem Research, State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, 73 East Beijing Road, Nanjing 210008, China
| | - Baili Dong
- Taihu Laboratory for Lake Ecosystem Research, State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, 73 East Beijing Road, Nanjing 210008, China
| | - Yibo Zhang
- Taihu Laboratory for Lake Ecosystem Research, State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, 73 East Beijing Road, Nanjing 210008, China
| |
Collapse
|
3
|
Mitra B, Tiwari SP, Uddin MS, Mahmud K, Rahman SM. Decision tree ensemble with Bayesian optimization to predict the spatial dynamics of chlorophyll-a concentration: A case study in Bay of Bengal. MARINE POLLUTION BULLETIN 2024; 199:115945. [PMID: 38150980 DOI: 10.1016/j.marpolbul.2023.115945] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2023] [Revised: 12/13/2023] [Accepted: 12/14/2023] [Indexed: 12/29/2023]
Abstract
An accurate prediction of the spatial distribution of phytoplankton biomass, as represented by Chlorophyll-a (CHL-a) concentrations, is important for assessing ecological conditions in the marine environment. This study developed a hyperparameter-optimized decision tree-based machine learning (ML) models to predict the geographical distribution of marine phytoplankton CHL-a in the Bay of Bengal. To predict CHL-a over a large spatial extent, satellite-derived remotely sensed data of ocean color features (CHL-a, colored dissolved organic matter, photosynthetically active radiation, particulate organic carbon) and climatic factors (nighttime sea surface temperature, surface absorbed longwave radiation, sea level pressure) from 2003 to 2022 are used to train and test the models. Results obtained from this study have shown the highest concentrations of CHL-a occurred near the Bay's coastal belts and river estuaries. Analysis revealed that aside from photosynthetically active radiation, organic components exhibited a stronger positive relationship with CHL-a than climatic features, which are correlated negatively. Results showed the chosen decision tree methods to all possess higher R2 and lower root mean square error (RMSE) errors. Furthermore, XGBoost outperforms all other models in predicting the geographic distribution of CHL-a. To assess the model efficacy on seasonal basis, a best performing XGBoost model was validated in the Bay of Bengal region which has shown a good performance in predicting the spatial distribution of Chl-a as well as the pixel values during the summer, winter and monsoon seasons. This study provides the best ML model to researchers for predicting CHL-a in the Bay of Bengal. Further it helps to improve our knowledge of CHL-a spatial dynamics and assist in monitoring marine resources in the Bay of Bengal. It worth noting that the water quality in the Indian Ocean is very dynamic in nature, therefore, additional efforts are needed to test the efficacy of this study model over different seasons and spatial gradients.
Collapse
Affiliation(s)
- Bijoy Mitra
- Department of Geography and Environmental Studies, University of Chittagong, Chittagong 4331, Bangladesh
| | - Surya Prakash Tiwari
- Applied Research Center for Environment and Marine Studies, Research Institute, King Fahd University of Petroleum & Minerals, Dhahran 31261, Kingdom of Saudi Arabia.
| | - Mohammed Sakib Uddin
- Department of Geography and Environmental Studies, University of Chittagong, Chittagong 4331, Bangladesh
| | - Khaled Mahmud
- Department of Geography and Environmental Studies, University of Chittagong, Chittagong 4331, Bangladesh
| | - Syed Masiur Rahman
- Applied Research Center for Environment and Marine Studies, Research Institute, King Fahd University of Petroleum & Minerals, Dhahran 31261, Kingdom of Saudi Arabia
| |
Collapse
|
4
|
Zhang Y, Kong X, Deng L, Liu Y. Monitor water quality through retrieving water quality parameters from hyperspectral images using graph convolution network with superposition of multi-point effect: A case study in Maozhou River. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023; 342:118283. [PMID: 37290307 DOI: 10.1016/j.jenvman.2023.118283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 05/06/2023] [Accepted: 05/26/2023] [Indexed: 06/10/2023]
Abstract
Quantitative prediction by unmanned aerial vehicle (UAV) remote sensing on water quality parameters (WQPs) including phosphorus, nitrogen, chemical oxygen demand (COD), biochemical oxygen demand (BOD), and chlorophyll a (Chl-a), total suspended solids (TSS), and turbidity provides a flexible and effective approach to monitor the variation in water quality. In this study, a deep learning-based method integrating graph convolution network (GCN), gravity model variant, and dual feedback machine involving parametric probability analysis and spatial distribution pattern analysis, named Graph Convolution Network with Superposition of Multi-point Effect (SMPE-GCN) has been developed to calculate concentrations of WQPs through UAV hyperspectral reflectance data on large scale efficiently. With an end-to-end structure, our proposed method has been applied to assisting environmental protection department to trace potential pollution sources in real time. The proposed method is trained on a real-world dataset and its effectiveness is validated on an equal amount of testing dataset with respect to three evaluation metrics including root of mean squared error (RMSE), mean absolute percent error (MAPE), and coefficient of determination (R2). The experimental results demonstrate that our proposed model achieves better performance in comparison with state-of-the-art baseline models in terms of RMSE, MAPE, and R2. The proposed method is applicable for quantifying seven various WQPs and has achieved good performance for each WQP. The resulting MAPE ranges from 7.16% to 10.96% and R2 ranges from 0.80 to 0.94 for all WQPs. This approach brings a novel and systematic insight into real-time quantitative water quality monitoring of urban rivers, and provides a unified framework for in-situ data acquisition, feature engineering, data conversion, and data modeling for further research. It provides fundamental support to assist environmental managers to efficiently monitor water quality of urban rivers.
Collapse
Affiliation(s)
- Yishan Zhang
- College of Mining Engineering, Taiyuan University of Technology, Taiyuan, Shanxi, 030024, China; Institute of Remote Sensing and Geographic Information, Peking University, Beijing, 100871, China.
| | - Xin Kong
- College of Environmental Science and Engineering, Taiyuan University of Technology, Taiyuan, Shanxi, 030024, China
| | - Licui Deng
- Shenzhen Huahan Technology Company, Shenzhen, Guangdong, 518057, China
| | - Yawei Liu
- Shenzhen Huahan Technology Company, Shenzhen, Guangdong, 518057, China
| |
Collapse
|
5
|
Niu J, Feng Z, He M, Xie M, Lv Y, Zhang J, Sun L, Liu Q, Hu BX. Incorporating marine particulate carbon into machine learning for accurate estimation of coastal chlorophyll-a. MARINE POLLUTION BULLETIN 2023; 192:115089. [PMID: 37267869 DOI: 10.1016/j.marpolbul.2023.115089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Revised: 05/18/2023] [Accepted: 05/19/2023] [Indexed: 06/04/2023]
Abstract
Accurate predictions of coastal ocean chlorophyll-a (Chl-a) concentrations are necessary for dynamic water quality monitoring, with eutrophication as a critical factor. Prior studies that used the driven-data method have typically overlooked the relationship between Chl-a and marine particulate carbon. To address this gap, marine particulate carbon was incorporated into machine learning (ML) and deep learning (DL) models to estimate Chl-a concentrations in the Yang Jiang coastal ocean of China. Incorporating particulate organic carbon (POC) and particulate inorganic carbon (PIC) as predictors can lead to successful Chl-a estimation. The Gaussian process regression (GPR) model significantly outperforming the DL model in terms of stability and robustness. A lower POC/Chl-a ratio was observed in coastal areas, in contrast to the higher ratios detected in the southern regions of the study area. This study highlights the efficacy of the GPR model for estimating Chl-a and the importance of considering POC in modeling Chl-a concentrations.
Collapse
Affiliation(s)
- Jie Niu
- College of Resources and Environmental Engineering, Guizhou University, Guiyang 550025, China
| | - Ziyang Feng
- Research Center of Red Tides and Marine Biology, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Mingxia He
- School of Water Resources and Environment, China University of Geosciences, Beijing 10083, China.
| | - Mengyu Xie
- School of Environment, Jinan University, Guangzhou 510632, China
| | - Yanqun Lv
- School of Environment, Jinan University, Guangzhou 510632, China
| | - Juan Zhang
- College of Geographic and Environmental Science, Tianjin Normal University, Tianjin 300387, China
| | - Liwei Sun
- Institute of Environmental and Ecological Engineering, Guangdong University of Technology, Guangzhou 510006, China
| | - Qi Liu
- Research Center of Red Tides and Marine Biology, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Bill X Hu
- School of Water Conservancy and Environment, University of Jinan, Jinan 250022, China
| |
Collapse
|