1
|
Masmitja I, Martin M, O'Reilly T, Kieft B, Palomeras N, Navarro J, Katija K. Dynamic robotic tracking of underwater targets using reinforcement learning. Sci Robot 2023; 8:eade7811. [PMID: 37494462 DOI: 10.1126/scirobotics.ade7811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Accepted: 06/26/2023] [Indexed: 07/28/2023]
Abstract
To realize the potential of autonomous underwater robots that scale up our observational capacity in the ocean, new techniques are needed. Fleets of autonomous robots could be used to study complex marine systems and animals with either new imaging configurations or by tracking tagged animals to study their behavior. These activities can then inform and create new policies for community conservation. The role of animal connectivity via active movement of animals represents a major knowledge gap related to the distribution of deep ocean populations. Tracking underwater targets represents a major challenge for observing biological processes in situ, and methods to robustly respond to a changing environment during monitoring missions are needed. Analytical techniques for optimal sensor placement and path planning to locate underwater targets are not straightforward in such cases. The aim of this study was to investigate the use of reinforcement learning as a tool for range-only underwater target-tracking optimization, whose promising capabilities have been demonstrated in terrestrial scenarios. To evaluate its usefulness, a reinforcement learning method was implemented as a path planning system for an autonomous surface vehicle while tracking an underwater mobile target. A complete description of an open-source model, performance metrics in simulated environments, and evaluated algorithms based on more than 15 hours of at-sea field experiments are presented. These efforts demonstrate that deep reinforcement learning is a powerful approach that enhances the abilities of autonomous robots in the ocean and encourages the deployment of algorithms like these for monitoring marine biological systems in the future.
Collapse
Affiliation(s)
- I Masmitja
- Institut de Ciències del Mar (ICM), CSIC, Barcelona 95062, Spain
- Research and Development, Bioinspiration Lab, Monterey Bay Aquarium Research Institute MBARI, Moss Landing, CA 95062, USA
| | - M Martin
- Knowledge Engineering and Machine Learning Group, Universitat Politècnica de Catalunya, Barcelona Tech., Barcelona 08034, Spain
- Barcelona Supercomputing Center (BSC), Barcelona 08034, Spain
| | - T O'Reilly
- Research and Development, Bioinspiration Lab, Monterey Bay Aquarium Research Institute MBARI, Moss Landing, CA 95062, USA
| | - B Kieft
- Research and Development, Bioinspiration Lab, Monterey Bay Aquarium Research Institute MBARI, Moss Landing, CA 95062, USA
| | - N Palomeras
- Computer vision and Robotics Institute, Universitat de Girona, Girona 17003, Spain
| | - J Navarro
- Institut de Ciències del Mar (ICM), CSIC, Barcelona 95062, Spain
| | - K Katija
- Research and Development, Bioinspiration Lab, Monterey Bay Aquarium Research Institute MBARI, Moss Landing, CA 95062, USA
| |
Collapse
|