1
|
Wen C, Ma Z, Ren J, Zhang T, Zhang L, Chen H, Su H, Yang C, Chen H, Guo W. A generalized model for accurate wheat spike detection and counting in complex scenarios. Sci Rep 2024; 14:24189. [PMID: 39407029 PMCID: PMC11480395 DOI: 10.1038/s41598-024-75523-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Accepted: 10/07/2024] [Indexed: 10/19/2024] Open
Abstract
Wheat is a crucial crop worldwide, and accurate detection and counting of wheat spikes are vital for yield estimation and breeding. However, these tasks are daunting in complex field environments. To tackle this, we introduce RIA-SpikeNet, a model designed to detect and count wheat spikes in such conditions. First, we introduce an Implicit Decoupling Detection Head to incorporate more implicit knowledge, enabling the model to better distinguish visually similar wheat spikes. Second, Asymmetric Loss is employed as the confidence loss function, enhancing the learning weights of positive and hard samples, thus improving performance in complex scenes. Lastly, the backbone network is modified through reparameterization and the use of larger convolutional kernels, expanding the effective receptive field and improving shape information extraction. These enhancements significantly improve the model's ability to detect and count wheat spikes accurately. RIA-SpikeNet outperforms the state-of-the-art YOLOv8 detection model, achieving a competitive 81.54% mAP and 90.29% R2. The model demonstrates superior performance in challenging scenarios, providing an effective tool for wheat spike yield estimation in field environments and valuable support for wheat production and breeding efforts.
Collapse
Affiliation(s)
- Changji Wen
- College of Information and Technology, Jilin Agricultural University, Changchun, China.
| | - Zhenyu Ma
- College of Information and Technology, Jilin Agricultural University, Changchun, China
| | - Junfeng Ren
- College of Information and Technology, Jilin Agricultural University, Changchun, China
| | - Tian Zhang
- College of Information and Technology, Jilin Agricultural University, Changchun, China
| | - Long Zhang
- College of Information and Technology, Jilin Agricultural University, Changchun, China
| | - Hongrui Chen
- College of Information and Technology, Jilin Agricultural University, Changchun, China
| | - Hengqiang Su
- College of Information and Technology, Jilin Agricultural University, Changchun, China
| | - Ce Yang
- College of Food, Agricultural and Natural Resource Sciences, University of Minnesota, Twin Cities, Minnesota, MN, USA
| | - Hongbing Chen
- College of Information and Technology, Jilin Agricultural University, Changchun, China
- Key Laboratory of Urban Land Resources Monitoring and Simulation, Ministry of Natural Resources, Shenzhen, China
| | - Wei Guo
- Graduate School of Agriculture and Life Sciences, The University of Tokyo, Tokyo, Japan.
| |
Collapse
|
2
|
Mustafa G, Liu Y, Khan IH, Hussain S, Jiang Y, Liu J, Arshad S, Osman R. Establishing a knowledge structure for yield prediction in cereal crops using unmanned aerial vehicles. FRONTIERS IN PLANT SCIENCE 2024; 15:1401246. [PMID: 39184579 PMCID: PMC11341481 DOI: 10.3389/fpls.2024.1401246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Accepted: 07/15/2024] [Indexed: 08/27/2024]
Abstract
Recently, a rapid advancement in using unmanned aerial vehicles (UAVs) for yield prediction (YP) has led to many YP research findings. This study aims to visualize the intellectual background, research progress, knowledge structure, and main research frontiers of the entire YP domain for main cereal crops using VOSviewer and a comprehensive literature review. To develop visualization networks of UAVs related knowledge for YP of wheat, maize, rice, and soybean (WMRS) crops, the original research articles published between January 2001 and August 2023 were retrieved from the web of science core collection (WOSCC) database. Significant contributors have been observed to the growth of YP-related research, including the most active countries, prolific publications, productive writers and authors, the top contributing institutions, influential journals, papers, and keywords. Furthermore, the study observed the primary contributions of YP for WMRS crops using UAVs at the micro, meso, and macro levels and the degree of collaboration and information sources for YP. Moreover, the policy assistance from the People's Republic of China, the United States of America, Germany, and Australia considerably advances the knowledge of UAVs connected to YP of WMRS crops, revealed under investigation of grants and collaborating nations. Lastly, the findings of WMRS crops for YP are presented regarding the data type, algorithms, results, and study location. The remote sensing community can significantly benefit from this study by being able to discriminate between the most critical sub-domains of the YP literature for WMRS crops utilizing UAVs and to recommend new research frontiers for concentrating on the essential directions for subsequent studies.
Collapse
Affiliation(s)
- Ghulam Mustafa
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing, China
- College of Agriculture, Nanjing Agricultural University, Nanjing, China
| | - Yuhong Liu
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing, China
| | - Imran Haider Khan
- College of Agriculture, Nanjing Agricultural University, Nanjing, China
| | - Sarfraz Hussain
- College of Physics and Optoelectronic Engineering, Shenzhen University, Shenzhen, China
| | - Yuhan Jiang
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing, China
| | - Jiayuan Liu
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing, China
| | - Saeed Arshad
- College of Agriculture, Nanjing Agricultural University, Nanjing, China
| | - Raheel Osman
- Department of Agronomy, Iowa State University, Ames, IA, United States
| |
Collapse
|
3
|
Xu X, Zhou L, Yu H, Sun G, Fei S, Zhu J, Ma Y. Winter wheat ear counting based on improved YOLOv7x and Kalman filter tracking algorithm with video streaming. FRONTIERS IN PLANT SCIENCE 2024; 15:1346182. [PMID: 38952848 PMCID: PMC11215144 DOI: 10.3389/fpls.2024.1346182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Accepted: 05/31/2024] [Indexed: 07/03/2024]
Abstract
Accurate and real-time field wheat ear counting is of great significance for wheat yield prediction, genetic breeding and optimized planting management. In order to realize wheat ear detection and counting under the large-resolution Unmanned Aerial Vehicle (UAV) video, Space to depth (SPD) module was added to the deep learning model YOLOv7x. The Normalized Gaussian Wasserstein Distance (NWD) Loss function is designed to create a new detection model YOLOv7xSPD. The precision, recall, F1 score and AP of the model on the test set are 95.85%, 94.71%, 95.28%, and 94.99%, respectively. The AP value is 1.67% higher than that of YOLOv7x, and 10.41%, 39.32%, 2.96%, and 0.22% higher than that of Faster RCNN, SSD, YOLOv5s, and YOLOv7. YOLOv7xSPD is combined with the Kalman filter tracking and the Hungarian matching algorithm to establish a wheat ear counting model with the video flow, called YOLOv7xSPD Counter, which can realize real-time counting of wheat ears in the field. In the video with a resolution of 3840×2160, the detection frame rate of YOLOv7xSPD Counter is about 5.5FPS. The counting results are highly correlated with the ground truth number (R2 = 0.99), and can provide model basis for wheat yield prediction, genetic breeding and optimized planting management.
Collapse
Affiliation(s)
- Xingmei Xu
- College of Information and Technology, Jilin Agricultural University, Changchun, Jilin, China
| | - Lei Zhou
- College of Information and Technology, Jilin Agricultural University, Changchun, Jilin, China
| | - Helong Yu
- College of Information and Technology, Jilin Agricultural University, Changchun, Jilin, China
| | - Guangyao Sun
- State Key Laboratory of Vegetable Biobreeding, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, China
- College of Land Science and Technology, China Agricultural University, Beijing, China
| | - Shuaipeng Fei
- State Key Laboratory of Vegetable Biobreeding, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, China
- College of Land Science and Technology, China Agricultural University, Beijing, China
| | - Jinyu Zhu
- State Key Laboratory of Vegetable Biobreeding, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yuntao Ma
- College of Land Science and Technology, China Agricultural University, Beijing, China
| |
Collapse
|
4
|
Zhang G, Wang Z, Liu B, Gu L, Zhen W, Yao W. A density map-based method for counting wheat ears. FRONTIERS IN PLANT SCIENCE 2024; 15:1354428. [PMID: 38751835 PMCID: PMC11094358 DOI: 10.3389/fpls.2024.1354428] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Accepted: 04/15/2024] [Indexed: 05/18/2024]
Abstract
Introduction Field wheat ear counting is an important step in wheat yield estimation, and how to solve the problem of rapid and effective wheat ear counting in a field environment to ensure the stability of food supply and provide more reliable data support for agricultural management and policy making is a key concern in the current agricultural field. Methods There are still some bottlenecks and challenges in solving the dense wheat counting problem with the currently available methods. To address these issues, we propose a new method based on the YOLACT framework that aims to improve the accuracy and efficiency of dense wheat counting. Replacing the pooling layer in the CBAM module with a GeM pooling layer, and then introducing the density map into the FPN, these improvements together make our method better able to cope with the challenges in dense scenarios. Results Experiments show our model improves wheat ear counting performance in complex backgrounds. The improved attention mechanism reduces the RMSE from 1.75 to 1.57. Based on the improved CBAM, the R2 increases from 0.9615 to 0.9798 through pixel-level density estimation, the density map mechanism accurately discerns overlapping count targets, which can provide more granular information. Discussion The findings demonstrate the practical potential of our framework for intelligent agriculture applications.
Collapse
Affiliation(s)
- Guangwei Zhang
- College of Information Science and Technology, Hebei Agricultural University, Baoding, China
- Hebei Key Laboratory of Agricultural Big Data, Hebei Agricultural University, Baoding, China
| | - Zhichao Wang
- College of Information Science and Technology, Hebei Agricultural University, Baoding, China
- Hebei Key Laboratory of Agricultural Big Data, Hebei Agricultural University, Baoding, China
| | - Bo Liu
- College of Information Science and Technology, Hebei Agricultural University, Baoding, China
- Hebei Key Laboratory of Agricultural Big Data, Hebei Agricultural University, Baoding, China
| | - Limin Gu
- State Key Laboratory of North China Crop Improvement and Regulation, Baoding, China
- College of Agronomy, Hebei Agricultural University, Baoding, China
- Key Laboratory of North China Water-savinssg Agriculture, Ministry of Agriculture and Rural Affairs, Baoding, Hebei, China
| | - Wenchao Zhen
- State Key Laboratory of North China Crop Improvement and Regulation, Baoding, China
- College of Agronomy, Hebei Agricultural University, Baoding, China
- Key Laboratory of North China Water-savinssg Agriculture, Ministry of Agriculture and Rural Affairs, Baoding, Hebei, China
| | - Wei Yao
- College of Information Science and Technology, Hebei Agricultural University, Baoding, China
- Hebei Key Laboratory of Agricultural Big Data, Hebei Agricultural University, Baoding, China
- Key Laboratory of North China Water-savinssg Agriculture, Ministry of Agriculture and Rural Affairs, Baoding, Hebei, China
| |
Collapse
|
5
|
Lai H, Chen L, Liu W, Yan Z, Ye S. STC-YOLO: Small Object Detection Network for Traffic Signs in Complex Environments. SENSORS (BASEL, SWITZERLAND) 2023; 23:5307. [PMID: 37300034 PMCID: PMC10255978 DOI: 10.3390/s23115307] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 05/29/2023] [Accepted: 06/01/2023] [Indexed: 06/12/2023]
Abstract
The detection of traffic signs is easily affected by changes in the weather, partial occlusion, and light intensity, which increases the number of potential safety hazards in practical applications of autonomous driving. To address this issue, a new traffic sign dataset, namely the enhanced Tsinghua-Tencent 100K (TT100K) dataset, was constructed, which includes the number of difficult samples generated using various data augmentation strategies such as fog, snow, noise, occlusion, and blur. Meanwhile, a small traffic sign detection network for complex environments based on the framework of YOLOv5 (STC-YOLO) was constructed to be suitable for complex scenes. In this network, the down-sampling multiple was adjusted, and a small object detection layer was adopted to obtain and transmit richer and more discriminative small object features. Then, a feature extraction module combining a convolutional neural network (CNN) and multi-head attention was designed to break the limitations of ordinary convolution extraction to obtain a larger receptive field. Finally, the normalized Gaussian Wasserstein distance (NWD) metric was introduced to make up for the sensitivity of the intersection over union (IoU) loss to the location deviation of tiny objects in the regression loss function. A more accurate size of the anchor boxes for small objects was achieved using the K-means++ clustering algorithm. Experiments on 45 types of sign detection results on the enhanced TT100K dataset showed that the STC-YOLO algorithm outperformed YOLOv5 by 9.3% in the mean average precision (mAP), and the performance of STC-YOLO was comparable with that of the state-of-the-art methods on the public TT100K dataset and CSUST Chinese Traffic Sign Detection Benchmark (CCTSDB2021) dataset.
Collapse
Affiliation(s)
| | - Liangyan Chen
- School of Electric and Electronic Engineering, Wuhan Polytechnic University, Wuhan 430023, China; (H.L.); (W.L.); (Z.Y.); (S.Y.)
| | | | | | | |
Collapse
|