1
|
Li L, Gao S, Wu F, An X. MBAN: multi-branch attention network for small object detection. PeerJ Comput Sci 2024; 10:e1965. [PMID: 38660186 PMCID: PMC11041927 DOI: 10.7717/peerj-cs.1965] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Accepted: 03/06/2024] [Indexed: 04/26/2024]
Abstract
Recent years small object detection has seen remarkable advancement. However, small objects are difficult to accurately detect in complex scenes due to their low resolution. The downsampling operation inevitably leads to the loss of information for small objects. In order to solve these issues, this article proposes a novel Multi-branch Attention Network (MBAN) to improve the detection performance of small objects. Firstly, an innovative Multi-branch Attention Module (MBAM) is proposed, which consists of two parts, i.e. Multi-branch structure consisting of convolution and maxpooling, and the parameter-free SimAM attention mechanism. By combining these two parts, the number of network parameters is reduced, the information loss of small objects is reduced, and the representation of small object features is enhanced. Furthermore, to systematically solve the problem of small object localization, a pre-processing method called Adaptive Clustering Relocation (ACR) is proposed. To validate our network, we conducted extensive experiments on two benchmark datasets, i.e. NWPU VHR-10 and PASCAL VOC. The findings from the experiment demonstrates the significant performance gains of MBAN over most existing algorithms, the mAP of MBAN achieved 96.55% and 84.96% on NWPU VHR-10 and PASCAL VOC datasets, respectively, which proves that MBAN has significant performance in small object detection.
Collapse
Affiliation(s)
- Li Li
- School of Information and Electrical Engineering, Hebei University of Engineering, Handan, Hebei, China
| | - Shuaikun Gao
- School of Information and Electrical Engineering, Hebei University of Engineering, Handan, Hebei, China
| | - Fangfang Wu
- School of Information and Electrical Engineering, Hebei University of Engineering, Handan, Hebei, China
| | - Xin An
- School of Information and Electrical Engineering, Hebei University of Engineering, Handan, Hebei, China
| |
Collapse
|
2
|
Lee C, Liao Z, Li Y, Lai Q, Guo Y, Huang J, Li S, Wang Y, Shi R. Placental MRI segmentation based on multi-receptive field and mixed attention separation mechanism. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023; 242:107699. [PMID: 37769416 DOI: 10.1016/j.cmpb.2023.107699] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2022] [Revised: 06/21/2023] [Accepted: 06/25/2023] [Indexed: 09/30/2023]
Abstract
OBJECTIVE To reduce the occurrence of massive bleeding during placental abruption in patients with placenta accrete, we established a medical imaging based on multi-receptive field and mixed attention separation mechanism (MRF-MAS) model to improve the accuracy of MRI placenta segmentation and provide a basis for subsequent placenta accreta. METHODS We propose a placenta MRI segmentation technology using the MRF-MAS framework to develop a medical image diagnostic technique. The model first uses the multi-receptive field feature structure to obtain multi-level information, and improves the expression of features at differing scales. Note that the hybrid attention mechanism combines channel attention and spatial attention, separates the input feature sets and computes the attention separately, and finally reorganizes the feature maps. To show that the model can improve the accuracy of segmenting the placenta, we adopt mean Intersection over Union (IoU), Dice similarity coefficient (Dice) and area under the receiver operating characteristic curve (AUC) with U-Net, Mask RCNN, Deeplab v3 for comparison. RESULTS The four models achieved different outcomes based on our placenta dataset, with our model IoU and Dice up to 0.8169 and 0.8992, which are 5.51% and 3.03% higher than the average of the three comparison models. CONCLUSION The model proposed by us is helpful to assist the imaging diagnosis and at the same time provides a quantitative reference for the precise treatment of placenta accreta, assists the Equationtion of the clinical operation plan of the physician, and promotes the precision medicine of placenta accreta.
Collapse
Affiliation(s)
- Cong Lee
- School of Computer Science and Engineering, Central South University, Changsha, Hunan 410083, China
| | - Zhifang Liao
- School of Computer Science and Engineering, Central South University, Changsha, Hunan 410083, China
| | - Yuanzhe Li
- Department of CT/MRI, The Second Affiliated Hospital of Fujian Medical University, Quanzhou 362000, China
| | - Qingquan Lai
- Department of CT/MRI, The Second Affiliated Hospital of Fujian Medical University, Quanzhou 362000, China
| | - Yingying Guo
- Department of CT/MRI, The Second Affiliated Hospital of Fujian Medical University, Quanzhou 362000, China
| | - Jing Huang
- Department of CT/MRI, The Second Affiliated Hospital of Fujian Medical University, Quanzhou 362000, China
| | - Shuting Li
- Department of CT/MRI, The Second Affiliated Hospital of Fujian Medical University, Quanzhou 362000, China
| | - Yi Wang
- Department of CT/MRI, The Second Affiliated Hospital of Fujian Medical University, Quanzhou 362000, China
| | - Ruizheng Shi
- Department of Cardiovascular Medicine, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China; National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China.
| |
Collapse
|
3
|
Wu Y, Li J. YOLOv4 with Deformable-Embedding-Transformer Feature Extractor for Exact Object Detection in Aerial Imagery. SENSORS (BASEL, SWITZERLAND) 2023; 23:s23052522. [PMID: 36904727 PMCID: PMC10007093 DOI: 10.3390/s23052522] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 02/21/2023] [Accepted: 02/23/2023] [Indexed: 05/27/2023]
Abstract
The deep learning method for natural-image object detection tasks has made tremendous progress in recent decades. However, due to multiscale targets, complex backgrounds, and high-scale small targets, methods from the field of natural images frequently fail to produce satisfactory results when applied to aerial images. To address these problems, we proposed the DET-YOLO enhancement based on YOLOv4. Initially, we employed a vision transformer to acquire highly effective global information extraction capabilities. In the transformer, we proposed deformable embedding instead of linear embedding and a full convolution feedforward network (FCFN) instead of a feedforward network in order to reduce the feature loss caused by cutting in the embedding process and improve the spatial feature extraction capability. Second, for improved multiscale feature fusion in the neck, we employed a depth direction separable deformable pyramid module (DSDP) rather than a feature pyramid network. Experiments on the DOTA, RSOD, and UCAS-AOD datasets demonstrated that our method's average accuracy (mAP) values reached 0.728, 0.952, and 0.945, respectively, which were comparable to the existing state-of-the-art methods.
Collapse
Affiliation(s)
| | - Jianjun Li
- Correspondence: ; Tel.: +86-137-5513-6109
| |
Collapse
|
4
|
Silva LA, Sales Mendes A, Sánchez San Blas H, Caetano Bastos L, Leopoldo Gonçalves A, Fabiano de Moraes A. Active Actions in the Extraction of Urban Objects for Information Quality and Knowledge Recommendation with Machine Learning. SENSORS (BASEL, SWITZERLAND) 2022; 23:138. [PMID: 36616734 PMCID: PMC9824459 DOI: 10.3390/s23010138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Revised: 12/16/2022] [Accepted: 12/20/2022] [Indexed: 06/17/2023]
Abstract
Due to the increasing urban development, it has become important for municipalities to permanently understand land use and ecological processes, and make cities smart and sustainable by implementing technological tools for land monitoring. An important problem is the absence of technologies that certify the quality of information for the creation of strategies. In this context, expressive volumes of data are used, requiring great effort to understand their structures, and then access information with the desired quality. This study are designed to provide an initial response to the need for mapping zones in the city of Itajaí (SC), Brazil. The solution proposes to aid object recognition employing object-based classifiers OneR, NaiveBayes, J48, IBk, and Hoeffding Tree algorithms used together with GeoDMA, and a first approach in the use of Region-based Convolutional Neural Network (R-CNN) and the YOLO algorithm. All this is to characterize vegetation zones, exposed soil zones, asphalt, and buildings within an urban and rural area. Through the implemented model for active identification of geospatial objects with similarity levels, it was possible to apply the data crossover after detecting the best classifier with accuracy (85%) and the kappa agreement coefficient (76%). The case study presents the dynamics of urban and rural expansion, where expressive volumes of data are obtained and submitted to different methods of cataloging and preparation to subsidize rapid control actions. Finally, the research describes a practical and systematic approach, evaluating the extraction of information to the recommendation of knowledge with greater scientific relevance. Allowing the methods presented to apply the calibration of values for each object, to achieve results with greater accuracy, which is proposed to help improve conservation and management decisions related to the zones within the city, leaving as a legacy the construction of a minimum technological infrastructure to support the decision.
Collapse
Affiliation(s)
- Luis Augusto Silva
- Expert Systems and Applications Lab (ESALAB), Faculty of Science, University of Salamanca, 37008 Salamanca, Spain
| | - André Sales Mendes
- Expert Systems and Applications Lab (ESALAB), Faculty of Science, University of Salamanca, 37008 Salamanca, Spain
| | - Héctor Sánchez San Blas
- Expert Systems and Applications Lab (ESALAB), Faculty of Science, University of Salamanca, 37008 Salamanca, Spain
| | - Lia Caetano Bastos
- Department of Knowledge Engineering and Management, Federal University of Santa Catarina, Florianopolis 88040-900, Brazil
| | - Alexandre Leopoldo Gonçalves
- Department of Knowledge Engineering and Management, Federal University of Santa Catarina, Florianopolis 88040-900, Brazil
| | - André Fabiano de Moraes
- Department of Knowledge Engineering and Management, Federal University of Santa Catarina, Florianopolis 88040-900, Brazil
- Department Information Technology, IT Institute Federal of Science Technology IFC, Camboriú 88340-055, Brazil
| |
Collapse
|
5
|
Zhang Q, Wang Y, Song L, Han M, Song H. Using an improved YOLOv5s network for the automatic detection of silicon on wheat straw epidermis of micrographs. J FIELD ROBOT 2022. [DOI: 10.1002/rob.22120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Affiliation(s)
- Qianru Zhang
- College of Mechanical and Electronic Engineering Northwest A&F University Yangling Shaanxi China
- Key Laboratory of Agricultural Internet of Things Ministry of Agriculture and Rural Affairs Yangling Shaanxi China
- Shaanxi Key Laboratory of Agricultural Information Perception and Intelligent Services Yangling China
| | - Yunfei Wang
- College of Mechanical and Electronic Engineering Northwest A&F University Yangling Shaanxi China
- Key Laboratory of Agricultural Internet of Things Ministry of Agriculture and Rural Affairs Yangling Shaanxi China
- Shaanxi Key Laboratory of Agricultural Information Perception and Intelligent Services Yangling China
| | - Lei Song
- College of Mechanical and Electronic Engineering Northwest A&F University Yangling Shaanxi China
- Key Laboratory of Agricultural Internet of Things Ministry of Agriculture and Rural Affairs Yangling Shaanxi China
- Shaanxi Key Laboratory of Agricultural Information Perception and Intelligent Services Yangling China
| | - Mengxuan Han
- College of Mechanical and Electronic Engineering Northwest A&F University Yangling Shaanxi China
- Key Laboratory of Agricultural Internet of Things Ministry of Agriculture and Rural Affairs Yangling Shaanxi China
- Shaanxi Key Laboratory of Agricultural Information Perception and Intelligent Services Yangling China
| | - Huaibo Song
- College of Mechanical and Electronic Engineering Northwest A&F University Yangling Shaanxi China
- Key Laboratory of Agricultural Internet of Things Ministry of Agriculture and Rural Affairs Yangling Shaanxi China
- Shaanxi Key Laboratory of Agricultural Information Perception and Intelligent Services Yangling China
| |
Collapse
|
6
|
YOLO-DSD: A YOLO-Based Detector Optimized for Better Balance between Accuracy, Deployability and Inference Time in Optical Remote Sensing Object Detection. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12157622] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
Many deep learning (DL)-based detectors have been developed for optical remote sensing object detection in recent years. However, most of the recent detectors are developed toward the pursuit of a higher accuracy, but little toward a balance between accuracy, deployability and inference time, which hinders the practical application for these detectors, especially in embedded devices. In order to achieve a higher detection accuracy and reduce the computational consumption and inference time simultaneously, a novel convolutional network named YOLO-DSD was developed based on YOLOv4. Firstly, a new feature extraction module, a dense residual (DenseRes) block, was proposed in a backbone network by utilizing a series-connected residual structure with the same topology for improving feature extraction while reducing the computational consumption and inference time. Secondly, convolution layer–batch normalization layer–leaky ReLu (CBL) ×5 modules in the neck, named S-CBL×5, were improved with a short-cut connection in order to mitigate feature loss. Finally, a low-cost novel attention mechanism called a dual channel attention (DCA) block was introduced to each S-CBL×5 for a better representation of features. The experimental results in the DIOR dataset indicate that YOLO-DSD outperforms YOLOv4 by increasing mAP0.5 from 71.3% to 73.0%, with a 23.9% and 29.7% reduction in Params and Flops, respectively, but a 50.2% improvement in FPS. In the RSOD dataset, the mAP0.5 of YOLO-DSD is increased from 90.0~94.0% to 92.6~95.5% under different input sizes. Compared with the SOTA detectors, YOLO-DSD achieves a better balance between the accuracy, deployability and inference time.
Collapse
|
7
|
An SAR Ship Object Detection Algorithm Based on Feature Information Efficient Representation Network. REMOTE SENSING 2022. [DOI: 10.3390/rs14143489] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
In the synthetic aperture radar (SAR) ship image, the target size is small and dense, the background is complex and changeable, the ship target is difficult to distinguish from the surrounding background, and there are many ship-like targets in the image. This makes it difficult for deep-learning-based target detection algorithms to obtain effective feature information, resulting in missed and false detection. The effective expression of the feature information of the target to be detected is the key to the target detection algorithm. How to improve the clear expression of image feature information in the network has always been a difficult point. Aiming at the above problems, this paper proposes a new target detection algorithm, the feature information efficient representation network (FIERNet). The algorithm can extract better feature details, enhance network feature fusion and information expression, and improve model detection capabilities. First, the convolution transformer feature extraction (CTFE) module is proposed, and a convolution transformer feature extraction network (CTFENet) is built with this module as a feature extraction block. The network enables the model to obtain more accurate and comprehensive feature information, weakens the interference of invalid information, and improves the overall performance of the network. Second, a new effective feature information fusion (EFIF) module is proposed to enhance the transfer and fusion of the main information of feature maps. Finally, a new frame-decoding formula is proposed to further improve the coincidence between the predicted frame and the target frame and obtain more accurate picture information. Experiments show that the method achieves 94.14% and 92.01% mean precision (mAP) on SSDD and SAR-ship datasets, and it works well on large-scale SAR ship images. In addition, FIERNet greatly reduces the occurrence of missed detection and false detection in SAR ship detection. Compared to other state-of-the-art object detection algorithms, FIERNet outperforms them on various performance metrics on SAR images.
Collapse
|
8
|
MGBM-YOLO: a Faster Light-Weight Object Detection Model for Robotic Grasping of Bolster Spring Based on Image-Based Visual Servoing. J INTELL ROBOT SYST 2022. [DOI: 10.1007/s10846-021-01564-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
9
|
Feature Enhancement-Based Ship Target Detection Method in Optical Remote Sensing Images. ELECTRONICS 2022. [DOI: 10.3390/electronics11040634] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Ship targets in ORSIs (Optical Remote Sensing Images) have the characteristics of various scales, and most of them are medium and small-scale targets. When the existing target detection algorithms are applied to ship target detection in ORSIs, the detection accuracy is low. There are two main reasons for the above problems, one is the mismatch of the receptive fields, and the other is the lack of feature information. For resolving the problem that multi-scale ship targets are difficult to detect, this paper proposes a ship target detection algorithm based on feature enhancement. Firstly, EIRM (Elastic Inception Residual Module) is proposed for feature enhancement, which can capture feature information of different dimensions and provide receptive fields of different scales for mid- and low-level feature maps. Secondly, the SandGlass-L block is proposed by replacing the ReLu6 activation function of the SandGlass block with the Leaky ReLu activation function. Leaky ReLu solves the problem of 0 output when ReLu6 has negative input, so the SandGlass-L block can retain more feature information. Finally, based on SandGlass-L, SGLPANet (SandGlass-L Path Aggregation Network) is proposed to alleviate the problem of information loss caused by dimension transformation and retain more feature information. The backbone network of the algorithm in this paper is CSPDarknet53, and the SPP module and EIRM act after the backbone network. The neck network is SGLPANet. Experiments on the NWPU VHR-10 dataset show that the algorithm in this paper can well solve the problem of low detection accuracy caused by mismatched receptive fields and missing feature information. It not only improves the accuracy of ship target detection, but also achieves good results when extended to other categories. At the same time, the extended experiments on the LEVIR dataset show that the algorithm also has certain applicability on different datasets.
Collapse
|
10
|
Zhou L, Deng G, Li W, Mi J, Lei B. A Lightweight SE-YOLOv3 Network for Multi-Scale Object Detection in Remote Sensing Imagery. INT J PATTERN RECOGN 2021. [DOI: 10.1142/s0218001421500373] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Current state-of-the-art detectors achieved impressive performance in detection accuracy with the use of deep learning. However, most of such detectors cannot detect objects in real time due to heavy computational cost, which limits their wide application. Although some one-stage detectors are designed to accelerate the detection speed, it is still not satisfied for task in high-resolution remote sensing images. To address this problem, a lightweight one-stage approach based on YOLOv3 is proposed in this paper, which is named Squeeze-and-Excitation YOLOv3 (SE-YOLOv3). The proposed algorithm maintains high efficiency and effectiveness simultaneously. With an aim to reduce the number of parameters and increase the ability of feature description, two customized modules, lightweight feature extraction and attention-aware feature augmentation, are embedded by utilizing global information and suppressing redundancy features, respectively. To meet the scale invariance, a spatial pyramid pooling method is used to aggregate local features. The evaluation experiments on two remote sensing image data sets, DOTA and NWPU VHR-10, reveal that the proposed approach achieves more competitive detection effect with less computational consumption.
Collapse
Affiliation(s)
- Lifang Zhou
- College of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China
- Chongqing Key Laboratory of Image Cognition, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China
- Hubei Key Laboratory of Intelligent, Vision Based Monitoring for Hydroelectric Engineering, China Three Gorges University, Yichang 443002, P. R. China
- China Yichang Key Laboratory of Intelligent, Vision Based Monitoring for Hydroelectric Engineering, China Three Gorges University, Yichang 443002, P. R. China
| | - Guang Deng
- College of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China
- Chongqing Key Laboratory of Image Cognition, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China
| | - Weisheng Li
- College of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China
- Chongqing Key Laboratory of Image Cognition, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China
| | - Jianxun Mi
- College of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China
- Chongqing Key Laboratory of Image Cognition, Chongqing University of Posts and Telecommunications, Chongqing 400065, P. R. China
| | - Bangjun Lei
- Hubei Key Laboratory of Intelligent, Vision Based Monitoring for Hydroelectric Engineering, China Three Gorges University, Yichang 443002, P. R. China
- China Yichang Key Laboratory of Intelligent, Vision Based Monitoring for Hydroelectric Engineering, China Three Gorges University, Yichang 443002, P. R. China
| |
Collapse
|
11
|
Aircraft Detection for Remote Sensing Image Based on Bidirectional and Dense Feature Fusion. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2021; 2021:7618828. [PMID: 34567103 PMCID: PMC8457952 DOI: 10.1155/2021/7618828] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 08/25/2021] [Accepted: 09/04/2021] [Indexed: 11/18/2022]
Abstract
Aircraft, as one of the indispensable transport tools, plays an important role in military activities. Therefore, it is a significant task to locate the aircrafts in the remote sensing images. However, the current object detection methods cause a series of problems when applied to the aircraft detection for the remote sensing image, for instance, the problems of low rate of detection accuracy and high rate of missed detection. To address the problems of low rate of detection accuracy and high rate of missed detection, an object detection method for remote sensing image based on bidirectional and dense feature fusion is proposed to detect aircraft targets in sophisticated environments. On the fundamental of the YOLOv3 detection framework, this method adds a feature fusion module to enrich the details of the feature map by mixing the shallow features with the deep features together. Experimental results on the RSOD-DataSet and NWPU-DataSet indicate that the new method raised in the article is capable of improving the problems of low rate of detection accuracy and high rate of missed detection. Meanwhile, the AP for the aircraft increases by 1.57% compared with YOLOv3.
Collapse
|
12
|
Oil Well Detection via Large-Scale and High-Resolution Remote Sensing Images Based on Improved YOLO v4. REMOTE SENSING 2021. [DOI: 10.3390/rs13163243] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Oil is an important resource for the development of modern society. Accurate detection of oil wells is of great significance to the investigation of oil exploitation status and the formulation of an exploitation plan. However, detecting small objects in large-scale and high-resolution remote sensing images, such as oil wells, is a challenging task due to the problems of large number, limited pixels, and complex background. In order to overcome this problem, first, we create our own oil well dataset to conduct experiments given the lack of a public dataset. Second, we provide a comparative assessment of two state-of-the-art object detection algorithms, SSD and YOLO v4, for oil well detection in our image dataset. The results show that both of them have good performance, but YOLO v4 has better accuracy in oil well detection because of its better feature extraction capability for small objects. In view of the fact that small objects are currently difficult to be detected in large-scale and high-resolution remote sensing images, this article proposes an improved algorithm based on YOLO v4 with sliding slices and discarding edges. The algorithm effectively solves the problems of repeated detection and inaccurate positioning of oil well detection in large-scale and high-resolution remote sensing images, and the accuracy of detection result increases considerably. In summary, this study investigates an appropriate algorithm for oil well detection, improves the algorithm, and achieves an excellent effect on a large-scale and high-resolution satellite image. It provides a new idea for small objects detection in large-scale and high-resolution remote sensing images.
Collapse
|
13
|
Aircraft Detection for Remote Sensing Images Based on Deep Convolutional Neural Networks. JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING 2021. [DOI: 10.1155/2021/4685644] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Aircraft detection for remote sensing images, as one of the fields of computer vision, is one of the significant tasks of image processing based on deep learning. Recently, many high-performance algorithms for aircraft detection have been developed and applied in different scenarios. However, the proposed algorithms still have a series of problems; for instance, the algorithms will miss some small-scale aircrafts when applied to the remote sensing image. There are two main reasons for the problem; one reason is that the aircrafts in the remote sensing image are usually small in size, leading to detecting difficulty. The other reason is that the background of the remote sensing image is usually complex, so the algorithms applied to the scenario are easy to be affected by the background. To address the problem of small size, this paper proposes the Multiscale Detection Network (MSDN) which introduces a multiscale detection architecture to detect small-scale aircrafts. With the intention to resist the background noise, this paper proposes the Deeper and Wider Module (DAWM) which increases the perceptual field of the network to alleviate the affection. Besides, to address the two problems simultaneously, this paper introduces the DAWM into the MSDN and names the novel network structure as Multiscale Refined Detection Network (MSRDN). The experimental results show that the MSRDN method has detected the small-scale aircrafts that other algorithms missed and the performance indicators have higher performance than other algorithms.
Collapse
|
14
|
Multi-Sector Oriented Object Detector for Accurate Localization in Optical Remote Sensing Images. REMOTE SENSING 2021. [DOI: 10.3390/rs13101921] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Oriented object detection in optical remote sensing images (ORSIs) is a challenging task since the targets in ORSIs are displayed in an arbitrarily oriented manner and on small scales, and are densely packed. Current state-of-the-art oriented object detection models used in ORSIs primarily evolved from anchor-based and direct regression-based detection paradigms. Nevertheless, they still encounter a design difficulty from handcrafted anchor definitions and learning complexities in direct localization regression. To tackle these issues, in this paper, we proposed a novel multi-sector oriented object detection framework called MSO2-Det, which quantizes the scales and orientation prediction of targets in ORSIs via an anchor-free classification-to-regression approach. Specifically, we first represented the arbitrarily oriented bounding box as four scale offsets and angles in four quadrant sectors of the corresponding Cartesian coordinate system. Then, we divided the scales and angle space into multiple discrete sectors and obtained more accurate localization information by a coarse-granularity classification to fine-grained regression strategy. In addition, to decrease the angular-sector classification loss and accelerate the network’s convergence, we designed a smooth angular-sector label (SASL) that smoothly distributes label values with a definite tolerance radius. Finally, we proposed a localization-aided detection score (LADS) to better represent the confidence of a detected box by combining the category-classification score and the sector-selection score. The proposed MSO2-Det achieves state-of-the-art results on three widely used benchmarks, including the DOTA, HRSC2016, and UCAS-AOD data sets.
Collapse
|