1
|
Lin X, Huang X, Wang L. Underwater object detection method based on learnable query recall mechanism and lightweight adapter. PLoS One 2024; 19:e0298739. [PMID: 38416764 PMCID: PMC10901356 DOI: 10.1371/journal.pone.0298739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Accepted: 01/29/2024] [Indexed: 03/01/2024] Open
Abstract
With the rapid development of ocean observation technology, underwater object detection has begun to occupy an essential position in the fields of aquaculture, environmental monitoring, marine science, etc. However, due to the problems unique to underwater images such as severe noise, blurred objects, and multi-scale, deep learning-based target detection algorithms lack sufficient capabilities to cope with these challenges. To address these issues, we improve DETR to make it well suited for underwater scenarios. First, a simple and effective learnable query recall mechanism is proposed to mitigate the effect of noise and can significantly improve the detection performance of the object. Second, for underwater small and irregular object detection, a lightweight adapter is designed to provide multi-scale features for the encoding and decoding stages. Third, the regression mechanism of the bounding box is optimized using the combination loss of smooth L1 and CIoU. Finally, we validate the designed network against other state-of-the-art methods on the RUOD dataset. The experimental results show that the proposed method is effective.
Collapse
Affiliation(s)
- Xi Lin
- Institute of Logistics Science and Engineering, Shanghai Maritime University, Shanghai, People's Republic of China
| | - Xixia Huang
- Institute of Logistics Science and Engineering, Shanghai Maritime University, Shanghai, People's Republic of China
| | - Le Wang
- Institute of Logistics Science and Engineering, Shanghai Maritime University, Shanghai, People's Republic of China
| |
Collapse
|
2
|
Kim J. Camera-Based Net Avoidance Controls of Underwater Robots. SENSORS (BASEL, SWITZERLAND) 2024; 24:674. [PMID: 38276365 PMCID: PMC10820847 DOI: 10.3390/s24020674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 01/15/2024] [Accepted: 01/19/2024] [Indexed: 01/27/2024]
Abstract
Fishing nets are dangerous obstacles for an underwater robot whose aim is to reach a goal in unknown underwater environments. This paper proposes how to make the robot reach its goal, while avoiding fishing nets that are detected using the robot's camera sensors. For the detection of underwater nets based on camera measurements of the robot, we can use deep neural networks. Passive camera sensors do not provide the distance information between the robot and a net. Camera sensors only provide the bearing angle of a net, with respect to the robot's camera pose. There may be trailing wires that extend from a net, and the wires can entangle the robot before the robot detects the net. Moreover, light, viewpoint, and sea floor condition can decrease the net detection probability in practice. Therefore, whenever a net is detected by the robot's camera, we make the robot avoid the detected net by moving away from the net abruptly. For moving away from the net, the robot uses the bounding box for the detected net in the camera image. After the robot moves backward for a certain distance, the robot makes a large circular turn to approach the goal, while avoiding the net. A large circular turn is used, since moving close to a net is too dangerous for the robot. As far as we know, our paper is unique in addressing reactive control laws for approaching the goal, while avoiding fishing nets detected using camera sensors. The effectiveness of the proposed net avoidance controls is verified using simulations.
Collapse
Affiliation(s)
- Jonghoek Kim
- System Engineering Department, Sejong University, Seoul 5006, Republic of Korea
| |
Collapse
|
3
|
Yan S, Chen X, Wu Z, Tan M, Yu J. HybrUR: A Hybrid Physical-Neural Solution for Unsupervised Underwater Image Restoration. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2023; 32:5004-5016. [PMID: 37656642 DOI: 10.1109/tip.2023.3309408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/03/2023]
Abstract
Robust vision restoration of underwater images remains a challenge. Owing to the lack of well-matched underwater and in-air images, unsupervised methods based on the cyclic generative adversarial framework have been widely investigated in recent years. However, when using an end-to-end unsupervised approach with only unpaired image data, mode collapse could occur, and the color correction of the restored images is usually poor. In this paper, we propose a data- and physics-driven unsupervised architecture to perform underwater image restoration from unpaired underwater and in-air images. For effective color correction and quality enhancement, an underwater image degeneration model must be explicitly constructed based on the optically unambiguous physics law. Thus, we employ the Jaffe-McGlamery degeneration theory to design a generator and use neural networks to model the process of underwater visual degeneration. Furthermore, we impose physical constraints on the scene depth and degeneration factors for backscattering estimation to avoid the vanishing gradient problem during the training of the hybrid physical-neural model. Experimental results show that the proposed method can be used to perform high-quality restoration of unconstrained underwater images without supervision. On multiple benchmarks, the proposed method outperforms several state-of-the-art supervised and unsupervised approaches. We demonstrate that our method yields encouraging results in real-world applications.
Collapse
|
4
|
Mbani B, Buck V, Greinert J. An automated image-based workflow for detecting megabenthic fauna in optical images with examples from the Clarion-Clipperton Zone. Sci Rep 2023; 13:8350. [PMID: 37221273 DOI: 10.1038/s41598-023-35518-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 05/19/2023] [Indexed: 05/25/2023] Open
Abstract
Recent advances in optical underwater imaging technologies enable the acquisition of huge numbers of high-resolution seafloor images during scientific expeditions. While these images contain valuable information for non-invasive monitoring of megabenthic fauna, flora and the marine ecosystem, traditional labor-intensive manual approaches for analyzing them are neither feasible nor scalable. Therefore, machine learning has been proposed as a solution, but training the respective models still requires substantial manual annotation. Here, we present an automated image-based workflow for Megabenthic Fauna Detection with Faster R-CNN (FaunD-Fast). The workflow significantly reduces the required annotation effort by automating the detection of anomalous superpixels, which are regions in underwater images that have unusual properties relative to the background seafloor. The bounding box coordinates of the detected anomalous superpixels are proposed as a set of weak annotations, which are then assigned semantic morphotype labels and used to train a Faster R-CNN object detection model. We applied this workflow to example underwater images recorded during cruise SO268 to the German and Belgian contract areas for Manganese-nodule exploration, within the Clarion-Clipperton Zone (CCZ). A performance assessment of our FaunD-Fast model showed a mean average precision of 78.1% at an intersection-over-union threshold of 0.5, which is on a par with competing models that use costly-to-acquire annotations. In more detail, the analysis of the megafauna detection results revealed that ophiuroids and xenophyophores were among the most abundant morphotypes, accounting for 62% of all the detections within the surveyed area. Investigating the regional differences between the two contract areas further revealed that both megafaunal abundance and diversity was higher in the shallower German area, which might be explainable by the higher food availability in form of sinking organic material that decreases from east-to-west across the CCZ. Since these findings are consistent with studies based on conventional image-based methods, we conclude that our automated workflow significantly reduces the required human effort, while still providing accurate estimates of megafaunal abundance and their spatial distribution. The workflow is thus useful for a quick but objective generation of baseline information to enable monitoring of remote benthic ecosystems.
Collapse
Affiliation(s)
- Benson Mbani
- DeepSea Monitoring Group, GEOMAR Helmholtz Center for Ocean Research Kiel, Wischhofstraße 1-3, 24148, Kiel, Germany.
| | - Valentin Buck
- DeepSea Monitoring Group, GEOMAR Helmholtz Center for Ocean Research Kiel, Wischhofstraße 1-3, 24148, Kiel, Germany
| | - Jens Greinert
- DeepSea Monitoring Group, GEOMAR Helmholtz Center for Ocean Research Kiel, Wischhofstraße 1-3, 24148, Kiel, Germany
- Institute of Geosciences, Kiel University, Ludewig-Meyn-Str. 10-12, 24118, Kiel, Germany
| |
Collapse
|
5
|
Localisation of Unmanned Underwater Vehicles (UUVs) in Complex and Confined Environments: A Review. SENSORS 2020; 20:s20216203. [PMID: 33143242 PMCID: PMC7663020 DOI: 10.3390/s20216203] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Revised: 10/21/2020] [Accepted: 10/23/2020] [Indexed: 11/17/2022]
Abstract
The inspection of aquatic environments is a challenging activity, which is made more difficult if the environment is complex or confined, such as those that are found in nuclear storage facilities and accident sites, marinas and boatyards, liquid storage tanks, or flooded tunnels and sewers. Human inspections of these environments are often dangerous or infeasible, so remote inspection using unmanned underwater vehicles (UUVs) is used. Due to access restrictions and environmental limitations, such as low illumination levels, turbidity, and a lack of salient features, traditional localisation systems that have been developed for use in large bodies of water cannot be used. This means that UUV capabilities are severely restricted to manually controlled low-quality visual inspections, generating non-geospatially located data. The localisation of UUVs in these environments would enable the autonomous behaviour and the development of accurate maps. This article presents a review of the state-of-the-art in localisation technologies for these environments and identifies areas of future research to overcome the challenges posed.
Collapse
|
6
|
Autonomous Underwater Monitoring System for Detecting Life on the Seabed by Means of Computer Vision Cloud Services. REMOTE SENSING 2020. [DOI: 10.3390/rs12121981] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Autonomous underwater vehicles (AUVs) have increasingly played a key role in monitoring the marine environment, studying its physical-chemical parameters for the supervision of endangered species. AUVs now include a power source and an intelligent control system that allows them to autonomously carry out programmed tasks. Their navigation system is much more challenging than that of land-based applications, due to the lack of connected networks in the marine environment. On the other hand, due to the latest developments in neural networks, particularly deep learning (DL), the visual recognition systems can achieve impressive performance. Computer vision (CV) has especially improved the field of object detection. Although all the developed DL algorithms can be deployed in the cloud, the present cloud computing system is unable to manage and analyze the massive amount of computing power and data. Edge intelligence is expected to replace DL computation in the cloud, providing various distributed, low-latency and reliable intelligent services. This paper proposes an AUV model system designed to overcome latency challenges in the supervision and tracking process by using edge computing in an IoT gateway. The IoT gateway is used to connect the AUV control system to the internet. The proposed model successfully carried out a long-term monitoring mission in a predefined area of shallow water in the Mar Menor (Spain) to track the underwater Pinna nobilis (fan mussel) species. The obtained results clearly justify the proposed system’s design and highlight the cloud and edge architecture performances. They also indicate the need for a hybrid cloud/edge architecture to ensure a real-time control loop for better latency and accuracy to meet the system’s requirements.
Collapse
|
7
|
Feng H, Yin X, Xu L, Lv G, Li Q, Wang L. Underwater salient object detection jointly using improved spectral residual and Fuzzy c-Means. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2019. [DOI: 10.3233/jifs-179089] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Hui Feng
- College of Computer and Information, Hohai University, Nanjing, Jiangsu, China
| | - Xinghui Yin
- College of Computer and Information, Hohai University, Nanjing, Jiangsu, China
| | - Lizhong Xu
- College of Computer and Information, Hohai University, Nanjing, Jiangsu, China
| | - Guofang Lv
- College of Energy and Electrical Engineering, Hohai University, Nanjing, Jiangsu, China
| | - Qi Li
- College of Energy and Electrical Engineering, Hohai University, Nanjing, Jiangsu, China
| | - Lulu Wang
- College of Computer and Information, Hohai University, Nanjing, Jiangsu, China
| |
Collapse
|
8
|
|
9
|
Chen Z, Zhang Z, Dai F, Bu Y, Wang H. Monocular Vision-Based Underwater Object Detection. SENSORS 2017; 17:s17081784. [PMID: 28771194 PMCID: PMC5580077 DOI: 10.3390/s17081784] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/24/2017] [Revised: 07/25/2017] [Accepted: 07/31/2017] [Indexed: 11/16/2022]
Abstract
In this paper, we propose an underwater object detection method using monocular vision sensors. In addition to commonly used visual features such as color and intensity, we investigate the potential of underwater object detection using light transmission information. The global contrast of various features is used to initially identify the region of interest (ROI), which is then filtered by the image segmentation method, producing the final underwater object detection results. We test the performance of our method with diverse underwater datasets. Samples of the datasets are acquired by a monocular camera with different qualities (such as resolution and focal length) and setups (viewing distance, viewing angle, and optical environment). It is demonstrated that our ROI detection method is necessary and can largely remove the background noise and significantly increase the accuracy of our underwater object detection method.
Collapse
Affiliation(s)
- Zhe Chen
- College of Computer and Information, Hohai University, Nanjing 211100, Jiangsu, China.
- Key Laboratory of Trusted Cloud Computing and Big Data Analysis, Nanjing Xiaozhuang University, Nanjing 211100, Jiangsu, China.
| | - Zhen Zhang
- College of Computer and Information, Hohai University, Nanjing 211100, Jiangsu, China.
| | - Fengzhao Dai
- Laboratory of Information Optics and Opto-Electronic Technology, Shanghai Institute of Optics and Fine Mechanics, Shanghai 201800, China.
| | - Yang Bu
- Laboratory of Information Optics and Opto-Electronic Technology, Shanghai Institute of Optics and Fine Mechanics, Shanghai 201800, China.
| | - Huibin Wang
- College of Computer and Information, Hohai University, Nanjing 211100, Jiangsu, China.
| |
Collapse
|
10
|
Kia C, Arshad MR. Robotics Vision-based Heuristic Reasoning for Underwater Target Tracking and Navigation. INT J ADV ROBOT SYST 2005. [DOI: 10.5772/5782] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
This paper presents a robotics vision-based heuristic reasoning system for underwater target tracking and navigation. This system is introduced to improve the level of automation of underwater Remote Operated Vehicles (ROVs) operations. A prototype which combines computer vision with an underwater robotics system is successfully designed and developed to perform target tracking and intelligent navigation. This study focuses on developing image processing algorithms and fuzzy inference system for the analysis of the terrain. The vision system developed is capable of interpreting underwater scene by extracting subjective uncertainties of the object of interest. Subjective uncertainties are further processed as multiple inputs of a fuzzy inference system that is capable of making crisp decisions concerning where to navigate. The important part of the image analysis is morphological filtering. The applications focus on binary images with the extension of gray-level concepts. An open-loop fuzzy control system is developed for classifying the traverse of terrain. The great achievement is the system's capability to recognize and perform target tracking of the object of interest (pipeline) in perspective view based on perceived condition. The effectiveness of this approach is demonstrated by computer and prototype simulations. This work is originated from the desire to develop robotics vision system with the ability to mimic the human expert's judgement and reasoning when maneuvering ROV in the traverse of the underwater terrain.
Collapse
Affiliation(s)
- Chua Kia
- Underwater Robotics Research Group (URRG), School of Electrical and Electronics Engineering, Universiti Sains Malaysia, Penang, Malaysia
| | - Mohd Rizal Arshad
- Underwater Robotics Research Group (URRG), School of Electrical and Electronics Engineering, Universiti Sains Malaysia, Penang, Malaysia
| |
Collapse
|
11
|
|