1
|
Lasko K, O'Neill FD, Sava E. Automated Mapping of Land Cover Type within International Heterogenous Landscapes Using Sentinel-2 Imagery with Ancillary Geospatial Data. SENSORS (BASEL, SWITZERLAND) 2024; 24:1587. [PMID: 38475125 DOI: 10.3390/s24051587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/31/2023] [Revised: 02/01/2024] [Accepted: 02/27/2024] [Indexed: 03/14/2024]
Abstract
A near-global framework for automated training data generation and land cover classification using shallow machine learning with low-density time series imagery does not exist. This study presents a methodology to map nine-class, six-class, and five-class land cover using two dates (winter and non-winter) of a Sentinel-2 granule across seven international sites. The approach uses a series of spectral, textural, and distance decision functions combined with modified ancillary layers (such as global impervious surface and global tree cover) to create binary masks from which to generate a balanced set of training data applied to a random forest classifier. For the land cover masks, stepwise threshold adjustments were applied to reflectance, spectral index values, and Euclidean distance layers, with 62 combinations evaluated. Global (all seven scenes) and regional (arid, tropics, and temperate) adaptive thresholds were computed. An annual 95th and 5th percentile NDVI composite was used to provide temporal corrections to the decision functions, and these corrections were compared against the original model. The accuracy assessment found that the regional adaptive thresholds for both the two-date land cover and the temporally corrected land cover could accurately map land cover type within nine-class (68.4% vs. 73.1%), six-class (79.8% vs. 82.8%), and five-class (80.1% vs. 85.1%) schemes. Lastly, the five-class and six-class models were compared with a manually labeled deep learning model (Esri), where they performed with similar accuracies (five classes: Esri 80.0 ± 3.4%, region corrected 85.1 ± 2.9%). The results highlight not only performance in line with an intensive deep learning approach, but also that reasonably accurate models can be created without a full annual time series of imagery.
Collapse
Affiliation(s)
- Kristofer Lasko
- Geospatial Research Laboratory, Engineer Research and Development Center, 7701 Telegraph Road, Bldg 2592, Alexandria, VA 22315, USA
| | - Francis D O'Neill
- Geospatial Research Laboratory, Engineer Research and Development Center, 7701 Telegraph Road, Bldg 2592, Alexandria, VA 22315, USA
| | - Elena Sava
- Geospatial Research Laboratory, Engineer Research and Development Center, 7701 Telegraph Road, Bldg 2592, Alexandria, VA 22315, USA
| |
Collapse
|
2
|
Modelling and evaluation of land use changes through satellite images in a multifunctional catchment: Social, economic and environmental implications. ECOL INFORM 2022. [DOI: 10.1016/j.ecoinf.2022.101777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
3
|
Limiting the Collection of Ground Truth Data for Land Use and Land Cover Maps with Machine Learning Algorithms. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2022. [DOI: 10.3390/ijgi11060333] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Land use and land cover (LULC) classification maps help understand the state and trends of agricultural production and provide insights for applications in environmental monitoring. One of the major downfalls of the LULC technique is inherently linked to its need for ground truth data to cross-validate maps. This paper aimed at evaluating the efficiency of machine learning (ML) in limiting the use of ground truth data for LULC maps. This was accomplished by (1) extracting reliable LULC information from Sentinel-2 and Landsat-8 s images, (2) generating remote sensing indices used to train ML algorithms, and (3) comparing the results with ground truth data. The remote sensing indices that were tested include the difference vegetation index (DVI), the normalized difference vegetation index (NDVI), the normalized built-up index (NDBI), the urban index (UI), and the normalized bare land index (NBLI). Extracted vegetation indices were evaluated on three ML algorithms, namely, random forest (RF), k-nearest neighbour (K-NN), and k dimensional-tree (KD-Tree). The accuracy of these algorithms was assessed with standard statistical measures and ground truth data randomly collected in Prince Edward Island, Canada. Results showed that high kappa coefficient values were achieved by K-NN (82% and 74%), KD-Tree (80% and 78%), and RF (83% and 73%) for Sentinel-2A and Landsat-8 imagery, respectively. RF was a better classifier than K-NN and KD-Tree and had the highest overall accuracy with Sentinel-2A satellite images (92%). This approach provides the basis for limiting the collection of ground truth data and thus reduces the labour cost, time, and resources needed to collect ground truth data for LULC maps.
Collapse
|
4
|
Extracting tea plantations in complex landscapes using Sentinel-2 imagery and machine learning algorithms. COMMUNITY ECOL 2022. [DOI: 10.1007/s42974-022-00077-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
5
|
Can a Hierarchical Classification of Sentinel-2 Data Improve Land Cover Mapping? REMOTE SENSING 2022. [DOI: 10.3390/rs14040989] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Monitoring of land cover plays an important role in effective environmental management, assessment of natural resources, environmental protection, urban planning and sustainable development. Increasing demand for accurate and repeatable information on land cover and land cover changes causes rapid development of the advanced, machine learning algorithms dedicated to land cover mapping using satellite images. Free and open access to Sentinel-2 data, characterized with high spatial and temporal resolution, increased the potential to map and to monitor land surface with high accuracy and frequency. Despite a considerable number of approaches towards land cover classification based on satellite data, there is still a challenge to clearly separate complex land cover classes, for example grasslands, arable land and wetlands. The aim of this study is to examine, whether a hierarchal classification of Sentinel-2 data can improve the accuracy of land cover mapping and delineation of complex land cover classes. The study is conducted in the Lodz Province, in central Poland. The pixel-based land cover classification is carried out using the machine learning Random Forest (RF) algorithm, based on a time series of Sentinel-2 imagery acquired in 2020. The following nine land cover classes are mapped: sealed surfaces, woodland broadleaved, woodland coniferous, shrubs, permanent herbaceous (grassy cover), periodically herbaceous (i.e., arable land), mosses, non-vegetated (bare soil) and water bodies. The land cover classification is conducted following two approaches: (1) flat, where all land cover classes are classified together, and (2) hierarchical, where the stratification is applied to first separate the most stable land cover classes and then classifying the most problematic once. The national databases served as the source of the reference sampling plots for the classification process. The process of selection and verification of the reference sampling plots is performed automatically. To assess the stability of the classification models the classification processes are performed iteratively. The results of this study confirmed that the hierarchical approach gave more accurate results compared to the commonly used flat approach. The median of the overall accuracy (OA) of the hierarchical classification was higher by 3–9 percentage points compared to the flat one. Of interest, the OA of the hierarchical classification reached 0.93–0.99, whereas the flat approach reached 0.90. Individual classes are also better classified in the hierarchical approach.
Collapse
|
6
|
Patel P, Chaudhary S, Parmar H. Analyze the Impact of Weather Parameters for Crop Yield Prediction Using Deep Learning. BIG DATA ANALYTICS 2022. [DOI: 10.1007/978-3-031-24094-2_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
|
7
|
From Land Cover Map to Land Use Map: A Combined Pixel-Based and Object-Based Approach Using Multi-Temporal Landsat Data, a Random Forest Classifier, and Decision Rules. REMOTE SENSING 2021. [DOI: 10.3390/rs13091700] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
It is essential to produce land cover maps and land use maps separately for different purposes. This study was conducted to generate such maps in Binh Duong province, Vietnam, using a novel combination of pixel-based and object-based classification techniques and geographic information system (GIS) analysis on multi-temporal Landsat images. Firstly, the connection between land cover and land use was identified; thereafter, the land cover map and land use function regions were extracted with a random forest classifier. Finally, a land use map was generated by combining the land cover map and the land use function regions in a set of decision rules. The results showed that land cover and land use were linked by spectral, spatial, and temporal characteristics, and this helped effectively convert the land cover map into a land use map. The final land cover map attained an overall accuracy (OA) = 93.86%, with producer’s accuracy (PA) and user’s accuracy (UA) of its classes ranging from 73.91% to 100%. Meanwhile, the final land use map achieved OA = 93.45%, and the UA and PA ranged from 84% to 100%. The study demonstrated that it is possible to create high-accuracy maps based entirely on free multi-temporal satellite imagery that promote the reproducibility and proactivity of the research as well as cost-efficiency and time savings.
Collapse
|
8
|
Machine Learning-Based Processing Proof-of-Concept Pipeline for Semi-Automatic Sentinel-2 Imagery Download, Cloudiness Filtering, Classifications, and Updates of Open Land Use/Land Cover Datasets. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2021. [DOI: 10.3390/ijgi10020102] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Land use and land cover are continuously changing in today’s world. Both domains, therefore, have to rely on updates of external information sources from which the relevant land use/land cover (classification) is extracted. Satellite images are frequent candidates due to their temporal and spatial resolution. On the contrary, the extraction of relevant land use/land cover information is demanding in terms of knowledge base and time. The presented approach offers a proof-of-concept machine-learning pipeline that takes care of the entire complex process in the following manner. The relevant Sentinel-2 images are obtained through the pipeline. Later, cloud masking is performed, including the linear interpolation of merged-feature time frames. Subsequently, four-dimensional arrays are created with all potential training data to become a basis for estimators from the scikit-learn library; the LightGBM estimator is then used. Finally, the classified content is applied to the open land use and open land cover databases. The verification of the provided experiment was conducted against detailed cadastral data, to which Shannon’s entropy was applied since the number of cadaster information classes was naturally consistent. The experiment showed a good overall accuracy (OA) of 85.9%. It yielded a classified land use/land cover map of the study area consisting of 7188 km2 in the southern part of the South Moravian Region in the Czech Republic. The developed proof-of-concept machine-learning pipeline is replicable to any other area of interest so far as the requirements for input data are met.
Collapse
|
9
|
Abstract
This paper studies the oil spill, which occurred in the Norilsk and Taimyr region of Russia due to the collapse of the fuel tank at the power station on May 29, 2020. We monitored the snow, ice, water, vegetation and wetland of the region using data from the Multi-Spectral Instruments (MSI) of Sentinel-2 satellite. We analyzed the spectral band absorptions of Sentinel-2 data acquired before, during and after the incident, developed true and false-color composites (FCC), decorrelated spectral bands and used the indices, i.e. Snow Water Index (SWI), Normalized Difference Water Index (NDWI) and Normalized Difference Vegetation Index (NDVI). The results of decorrelated spectral bands 3, 8, and 11 of Sentinel-2 well confirmed the results of SWI, NDWI, NDVI, and FCC images showing the intensive snow and ice melt between May 21 and 31, 2020. We used Sentinel-2 results, field photographs, analysis of the 1980-2020 daily air temperature and precipitation data, permafrost observations and modeling to explore the hypothesis that either the long-term dynamics of the frozen ground, changing climate and environmental factors, or abnormal weather conditions may have caused or contributed to the collapse of the oil tank.
Collapse
|
10
|
Abd El-Hamid HT, Kaloop MR, Abdalla EM, Hu JW, Zarzoura F. Assessment and prediction of land-use/land-cover change around Blue Nile and White Nile due to flood hazards in Khartoum, Sudan, based on geospatial analysis. GEOMATICS, NATURAL HAZARDS AND RISK 2021; 12:1258-1286. [DOI: 10.1080/19475705.2021.1923577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Revised: 04/23/2021] [Accepted: 04/23/2021] [Indexed: 09/02/2023]
Affiliation(s)
| | - Mosbeh R. Kaloop
- Department of Civil and Environmental Engineering, Incheon National University, Incheon, Korea
- Incheon Disaster Prevention Research Center, Incheon National University, Incheon, Korea
- Public Works Engineering Department, Mansoura University, Mansoura, Egypt
| | - Elhamem M. Abdalla
- National Geological Council, Khartoum, Sudan
- Department of Information Engineering, Ningxia University, Ningxia, China
| | - Jong Wan Hu
- Department of Civil and Environmental Engineering, Incheon National University, Incheon, Korea
- Incheon Disaster Prevention Research Center, Incheon National University, Incheon, Korea
| | - Fawzi Zarzoura
- Public Works Engineering Department, Mansoura University, Mansoura, Egypt
| |
Collapse
|
11
|
RUESVMs: An Ensemble Method to Handle the Class Imbalance Problem in Land Cover Mapping Using Google Earth Engine. REMOTE SENSING 2020. [DOI: 10.3390/rs12213484] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Timely and accurate Land Cover (LC) information is required for various applications, such as climate change analysis and sustainable development. Although machine learning algorithms are most likely successful in LC mapping tasks, the class imbalance problem is known as a common challenge in this regard. This problem occurs during the training phase and reduces classification accuracy for infrequent and rare LC classes. To address this issue, this study proposes a new method by integrating random under-sampling of majority classes and an ensemble of Support Vector Machines, namely Random Under-sampling Ensemble of Support Vector Machines (RUESVMs). The performance of RUESVMs for LC classification was evaluated in Google Earth Engine (GEE) over two different case studies using Sentinel-2 time-series data and five well-known spectral indices, including the Normalized Difference Vegetation Index (NDVI), Green Normalized Difference Vegetation Index (GNDVI), Soil-Adjusted Vegetation Index (SAVI), Normalized Difference Built-up Index (NDBI), and Normalized Difference Water Index (NDWI). The performance of RUESVMs was also compared with the traditional SVM and combination of SVM with three benchmark data balancing techniques namely the Random Over-Sampling (ROS), Random Under-Sampling (RUS), and Synthetic Minority Over-sampling Technique (SMOTE). It was observed that the proposed method considerably improved the accuracy of LC classification, especially for the minority classes. After adopting RUESVMs, the overall accuracy of the generated LC map increased by approximately 4.95 percentage points, and this amount for the geometric mean of producer’s accuracies was almost 3.75 percentage points, in comparison to the most accurate data balancing method (i.e., SVM-SMOTE). Regarding the geometric mean of users’ accuracies, RUESVMs also outperformed the SVM-SMOTE method with an average increase of 6.45 percentage points.
Collapse
|