Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Bae H, Kim G, Kim J, Qian D, Lee S. Multi-Robot Path Planning Method Using Reinforcement Learning. Applied Sciences 2019;9:3057. [DOI: 10.3390/app9153057] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Number

Cited by Other Article(s)

Song C, He Z, Dong L. A Local-and-Global Attention Reinforcement Learning Algorithm for Multiagent Cooperative Navigation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024;35:7767-7777. [PMID: 36383584 DOI: 10.1109/tnnls.2022.3220798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Zhang Q, Li R, Sun J, Wei L, Huang J, Tan Y. Dynamic 3D Point-Cloud-Driven Autonomous Hierarchical Path Planning for Quadruped Robots. Biomimetics (Basel) 2024;9:259. [PMID: 38786469 PMCID: PMC11117888 DOI: 10.3390/biomimetics9050259] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 04/19/2024] [Accepted: 04/20/2024] [Indexed: 05/25/2024] Open

Deguale DA, Yu L, Sinishaw ML, Li K. Enhancing Stability and Performance in Mobile Robot Path Planning with PMR-Dueling DQN Algorithm. SENSORS (BASEL, SWITZERLAND) 2024;24:1523. [PMID: 38475059 DOI: 10.3390/s24051523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 02/01/2024] [Accepted: 02/07/2024] [Indexed: 03/14/2024]

Xu M, Yang F, Fang Y, Li F, Yan R. Research on Time Series-Based Pipeline Ground Penetrating Radar Calibration Angle Prediction Algorithm. SENSORS (BASEL, SWITZERLAND) 2024;24:379. [PMID: 38257472 PMCID: PMC10819543 DOI: 10.3390/s24020379] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 12/20/2023] [Accepted: 01/04/2024] [Indexed: 01/24/2024]

Abstract

The pipeline ground-penetrating radar stands as an indispensable detection device for ensuring underground space security. A wheeled pipeline robot is deployed to traverse the interior of urban underground drainage pipelines along their central axis. It is subject to influences such as resistance, speed, and human factors, leading to deviations in its posture. A guiding wheel is employed to rectify its roll angle and ensure the precise spatial positioning of defects both inside and outside the pipeline, as detected by the radar antenna. By analyzing its deflection factors and correction trajectories, the intelligent correction control of the pipeline ground-penetrating radar falls into the realm of nonlinear multi-constraint optimization. Consequently, a time-series-based correction angle prediction algorithm is proposed. The application of the long short-term memory (LSTM) deep learning model facilitates the prediction of correction angles and torque for the guiding wheel. This study compares the performance of LSTM with an autoregressive integrated moving average model under identical dataset conditions. The subsequent findings reveal a reduction of 4.11° and 8.25 N·m in mean absolute error, and a decrease of 10.66% and 7.27% in mean squared error for the predicted correction angles and torques, respectively. These outcomes are achieved utilizing the three-channel drainage pipeline ground-penetrating radar device with top antenna operating at 1.2 GHz and left/right antennas at 750 MHz. The LSTM prediction model intelligently corrects its posture. Experimental results demonstrate an average correction speed of 5 s and an average angular error of ±1°. It is verified that the model can correct its attitude in real-time with small errors, thereby enhancing the accuracy of ground-penetrating radar antennas in locating pipeline defects.

Collapse

Caccavale R, Ermini M, Fedeli E, Finzi A, Lippiello V, Tavano F. A multi-robot deep Q-learning framework for priority-based sanitization of railway stations. APPL INTELL 2023;53:1-19. [PMID: 37363385 PMCID: PMC10111085 DOI: 10.1007/s10489-023-04529-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/12/2023] [Indexed: 06/28/2023]

Orr J, Dutta A. Multi-Agent Deep Reinforcement Learning for Multi-Robot Applications: A Survey. SENSORS (BASEL, SWITZERLAND) 2023;23:3625. [PMID: 37050685 PMCID: PMC10098527 DOI: 10.3390/s23073625] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 03/22/2023] [Accepted: 03/28/2023] [Indexed: 06/19/2023]

Sahu B, Kumar Das P, Kumar R. A Modified Cuckoo Search Algorithm implemented with SCA and PSO for Multi-robot Cooperation and Path Planning. COGN SYST RES 2023. [DOI: 10.1016/j.cogsys.2023.01.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]

Zheng L, Tang Y, Guo S, Ma Y, Deng L. Dynamic Analysis and Path Planning of a Turtle-Inspired Amphibious Spherical Robot. MICROMACHINES 2022;13:2130. [PMID: 36557429 PMCID: PMC9784272 DOI: 10.3390/mi13122130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 11/23/2022] [Accepted: 11/28/2022] [Indexed: 06/17/2023]

A Framework and Algorithm for Human-Robot Collaboration Based on Multimodal Reinforcement Learning. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022;2022:2341898. [PMID: 36210974 PMCID: PMC9534615 DOI: 10.1155/2022/2341898] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Revised: 06/23/2022] [Accepted: 06/27/2022] [Indexed: 11/24/2022]

Controlling Fleets of Autonomous Mobile Robots with Reinforcement Learning: A Brief Survey. ROBOTICS 2022. [DOI: 10.3390/robotics11050085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Distributed Multi-Mobile Robot Path Planning and Obstacle Avoidance Based on ACO–DWA in Unknown Complex Terrain. ELECTRONICS 2022. [DOI: 10.3390/electronics11142144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Reinforcement Learning-Based Algorithm to Avoid Obstacles by the Anthropomorphic Robotic Arm. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12136629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Yang J, Ni J, Li Y, Wen J, Chen D. The Intelligent Path Planning System of Agricultural Robot via Reinforcement Learning. SENSORS 2022;22:s22124316. [PMID: 35746099 PMCID: PMC9227048 DOI: 10.3390/s22124316] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Revised: 05/29/2022] [Accepted: 06/04/2022] [Indexed: 01/27/2023]

A Path-Planning Approach Based on Potential and Dynamic Q-Learning for Mobile Robots in Unknown Environment. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022;2022:2540546. [PMID: 35694567 PMCID: PMC9184183 DOI: 10.1155/2022/2540546] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Accepted: 05/19/2022] [Indexed: 11/17/2022]

Neural Tracking Control of a Four-Wheeled Mobile Robot with Mecanum Wheels. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12115322] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]

Counterfactual-Based Action Evaluation Algorithm in Multi-Agent Reinforcement Learning. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12073439] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Decentralized Multi-Robot Collision Avoidance: A Systematic Review from 2015 to 2021. Symmetry (Basel) 2022. [DOI: 10.3390/sym14030610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open

Abstract An exploration task can be performed by a team of mobile robots more efficiently than human counterparts. They can access and give live updates for hard-to-reach areas such as a disaster site or a sewer. However, they face some issues hindering them from optimal path planning due to the symmetrical shape of the environments. Multiple robots are expected to explore more areas in less time while solving robot localization and collision-avoidance issues. When deploying a multi-robot system, it is ensured that the hardware parts do not collide with each other or the surroundings, especially in symmetric environments. Two types of methods are used for collision avoidance: centralized and decentralized. The decentralized approach has mainly been used in recent times, as it is computationally less expensive. This article aims to conduct a systematic literature review of different collision-avoidance strategies and analyze the performance of innovative collision-avoidance techniques. Different methods such as Reinforcement Learning (RL), Model Predictive Control (MPC), Altruistic Coordination, and other approaches followed by selected studies are also discussed. A total of 17 studies are included in this review, extracted from seven databases. Two experimental designs are studied: empty/open space and confined indoor space. Our analysis observed that most of the studies focused on empty/open space scenarios and verified the proposed model only through simulation. ORCA is the primary method, against which all the state-of-the-art techniques are evaluated. This article provides a comparison between different methods used for multi-robot collision avoidance. It discusses if the methods used are focused on safety or path planning. It also sheds light on the limitations of the studies included and possible future directions. Collapse

An Experimental Safety Response Mechanism for an Autonomous Moving Robot in a Smart Manufacturing Environment Using Q-Learning Algorithm and Speech Recognition. SENSORS 2022;22:s22030941. [PMID: 35161688 PMCID: PMC8838134 DOI: 10.3390/s22030941] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Revised: 12/15/2021] [Accepted: 12/22/2021] [Indexed: 11/17/2022]

Abstract

The industrial manufacturing sector is undergoing a tremendous revolution moving from traditional production processes to intelligent techniques. Under this revolution, known as Industry 4.0 (I40), a robot is no longer static equipment but an active workforce to the factory production alongside human operators. Safety becomes crucial for humans and robots to ensure a smooth production run in such environments. The loss of operating moving robots in plant evacuation can be avoided with the adequate safety induction for them. Operators are subject to frequent safety inductions to react in emergencies, but very little is done for robots. Our research proposes an experimental safety response mechanism for a small manufacturing plant, through which an autonomous robot learns the obstacle-free trajectory to the closest safety exit in emergencies. We implement a reinforcement learning (RL) algorithm, Q-learning, to enable the path learning abilities of the robot. After obtaining the robot optimal path selection options with Q-learning, we code the outcome as a rule-based system for the safety response. We also program a speech recognition system for operators to react timeously, with a voice command, to an emergency that requires stopping all plant activities even when they are far away from the emergency stops (ESTOPs) button. An ESTOP or a voice command sent directly to the factory central controller can give the factory an emergency signal. We tested this functionality on real hardware from an S7-1200 Siemens programmable logic controller (PLC). We simulate a simple and small manufacturing environment overview to test our safety procedure. Our results show that the safety response mechanism successfully generates paths without obstacles to the closest safety exits from all the factory locations. Our research benefits any manufacturing SME intending to implement the initial and primary use of autonomous moving robots (AMR) in their factories. It also impacts manufacturing SMEs using legacy devices such as traditional PLCs by offering them intelligent strategies to incorporate current state-of-the-art technologies such as speech recognition to improve their performances. Our research empowers SMEs to adopt advanced and innovative technological concepts within their operations.

Collapse

Indoor Emergency Path Planning Based on the Q-Learning Optimization Algorithm. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2022. [DOI: 10.3390/ijgi11010066] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Abstract The internal structure of buildings is becoming increasingly complex. Providing a scientific and reasonable evacuation route for trapped persons in a complex indoor environment is important for reducing casualties and property losses. In emergency and disaster relief environments, indoor path planning has great uncertainty and higher safety requirements. Q-learning is a value-based reinforcement learning algorithm that can complete path planning tasks through autonomous learning without establishing mathematical models and environmental maps. Therefore, we propose an indoor emergency path planning method based on the Q-learning optimization algorithm. First, a grid environment model is established. The discount rate of the exploration factor is used to optimize the Q-learning algorithm, and the exploration factor in the ε-greedy strategy is dynamically adjusted before selecting random actions to accelerate the convergence of the Q-learning algorithm in a large-scale grid environment. An indoor emergency path planning experiment based on the Q-learning optimization algorithm was carried out using simulated data and real indoor environment data. The proposed Q-learning optimization algorithm basically converges after 500 iterative learning rounds, which is nearly 2000 rounds higher than the convergence rate of the Q-learning algorithm. The SASRA algorithm has no obvious convergence trend in 5000 iterations of learning. The results show that the proposed Q-learning optimization algorithm is superior to the SARSA algorithm and the classic Q-learning algorithm in terms of solving time and convergence speed when planning the shortest path in a grid environment. The convergence speed of the proposed Q- learning optimization algorithm is approximately five times faster than that of the classic Q- learning algorithm. The proposed Q-learning optimization algorithm in the grid environment can successfully plan the shortest path to avoid obstacle areas in a short time. Collapse

Towards the Achievement of Path Planning with Multi-robot Systems in Dynamic Environments. J INTELL ROBOT SYST 2021. [DOI: 10.1007/s10846-021-01555-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Bonny T, Kashkash M. Highly optimized Q‐learning‐based bees approach for mobile robot path planning in static and dynamic environments. J FIELD ROBOT 2021. [DOI: 10.1002/rob.22052] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Moon J. Plugin Framework-Based Neuro-Symbolic Grounded Task Planning for Multi-Agent System. SENSORS (BASEL, SWITZERLAND) 2021;21:7896. [PMID: 34883897 PMCID: PMC8659725 DOI: 10.3390/s21237896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 11/18/2021] [Accepted: 11/23/2021] [Indexed: 11/16/2022]

An Improved Dueling Deep Double-Q Network Based on Prioritized Experience Replay for Path Planning of Unmanned Surface Vehicles. JOURNAL OF MARINE SCIENCE AND ENGINEERING 2021. [DOI: 10.3390/jmse9111267] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Methods of Condition Monitoring and Fault Detection for Electrical Machines. ENERGIES 2021. [DOI: 10.3390/en14227459] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Wen S, Wen Z, Zhang D, Zhang H, Wang T. A multi-robot path-planning algorithm for autonomous navigation using meta-reinforcement learning based on transfer learning. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107605] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Reinforcement-Learning-Based Route Generation for Heavy-Traffic Autonomous Mobile Robot Systems. SENSORS 2021;21:s21144809. [PMID: 34300548 PMCID: PMC8309928 DOI: 10.3390/s21144809] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Revised: 07/11/2021] [Accepted: 07/12/2021] [Indexed: 11/19/2022]

Research on Motion Planning Based on Flocking Control and Reinforcement Learning for Multi-Robot Systems. MACHINES 2021. [DOI: 10.3390/machines9040077] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Multi AGV Coordination Tolerant to Communication Failures. ROBOTICS 2021. [DOI: 10.3390/robotics10020055] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Faryadi S, Mohammadpour Velni J. A reinforcement learning‐based approach for modeling and coverage of an unknown field using a team of autonomous ground vehicles. INT J INTELL SYST 2020. [DOI: 10.1002/int.22331] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Prianto E, Kim M, Park JH, Bae JH, Kim JS. Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor-Critic with Hindsight Experience Replay. SENSORS 2020;20:s20205911. [PMID: 33086774 PMCID: PMC7590214 DOI: 10.3390/s20205911] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Revised: 10/14/2020] [Accepted: 10/17/2020] [Indexed: 11/16/2022]

An Overview of Reinforcement Learning Methods for Variable Speed Limit Control. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10144917] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

A Fuzzy Analytic Hierarchy Process and Cooperative Game Theory Combined Multiple Mobile Robot Navigation Algorithm. SENSORS 2020;20:s20102827. [PMID: 32429339 PMCID: PMC7288072 DOI: 10.3390/s20102827] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/10/2020] [Revised: 05/13/2020] [Accepted: 05/15/2020] [Indexed: 11/17/2022]

Special Issue on Mobile Robots Navigation. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10041317] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Motion Planning of Robot Manipulators for a Smoother Path Using a Twin Delayed Deep Deterministic Policy Gradient with Hindsight Experience Replay. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10020575] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]