1
|
Yuan Y, Yang L, Chang K, Huang Y, Yang H, Wang J. DSCA-PSPNet: Dynamic spatial-channel attention pyramid scene parsing network for sugarcane field segmentation in satellite imagery. FRONTIERS IN PLANT SCIENCE 2024; 14:1324491. [PMID: 38298601 PMCID: PMC10829042 DOI: 10.3389/fpls.2023.1324491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 12/20/2023] [Indexed: 02/02/2024]
Abstract
Sugarcane plays a vital role in many global economies, and its efficient cultivation is critical for sustainable development. A central challenge in sugarcane yield prediction and cultivation management is the precise segmentation of sugarcane fields from satellite imagery. This task is complicated by numerous factors, including varying environmental conditions, scale variability, and spectral similarities between crops and non-crop elements. To address these segmentation challenges, we introduce DSCA-PSPNet, a novel deep learning model with a unique architecture that combines a modified ResNet34 backbone, the Pyramid Scene Parsing Network (PSPNet), and newly proposed Dynamic Squeeze-and-Excitation Context (D-scSE) blocks. Our model effectively adapts to discern the importance of both spatial and channel-wise information, providing superior feature representation for sugarcane fields. We have also created a comprehensive high-resolution satellite imagery dataset from Guangxi's Fusui County, captured on December 17, 2017, which encompasses a broad spectrum of sugarcane field characteristics and environmental conditions. In comparative studies, DSCA-PSPNet outperforms other state-of-the-art models, achieving an Intersection over Union (IoU) of 87.58%, an accuracy of 92.34%, a precision of 93.80%, a recall of 93.21%, and an F1-Score of 92.38%. Application tests on an RTX 3090 GPU, with input image resolutions of 512 × 512, yielded a prediction time of 4.57ms, a parameter size of 22.57MB, GFLOPs of 11.41, and a memory size of 84.47MB. An ablation study emphasized the vital role of the D-scSE module in enhancing DSCA-PSPNet's performance. Our contributions in dataset generation and model development open new avenues for tackling the complexities of sugarcane field segmentation, thus contributing to advances in precision agriculture. The source code and dataset will be available on the GitHub repository https://github.com/JulioYuan/DSCA-PSPNet/tree/main.
Collapse
Affiliation(s)
- Yujian Yuan
- School of Computer, Electronics, and Information, Guangxi University, Nanning, China
- Guangxi Key Laboratory of Multimedia Communications and Network Technology, School of Computer, Electronics, and Information, Guangxi University, Nanning, China
| | - Lina Yang
- School of Computer, Electronics, and Information, Guangxi University, Nanning, China
- Guangxi Key Laboratory of Multimedia Communications and Network Technology, School of Computer, Electronics, and Information, Guangxi University, Nanning, China
| | - Kan Chang
- School of Computer, Electronics, and Information, Guangxi University, Nanning, China
| | - Youju Huang
- Guangxi Institute of Remote Sensing of Natural Resources, Nanning, China
| | - Haoyan Yang
- School of Computer, Electronics, and Information, Guangxi University, Nanning, China
| | - Jiale Wang
- School of Computer, Electronics, and Information, Guangxi University, Nanning, China
| |
Collapse
|