Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Nguyen TT, Nguyen ND, Nahavandi S. Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications. IEEE Trans Cybern 2020;50:3826-3839. [PMID: 32203045 DOI: 10.1109/tcyb.2020.2977374] [Citation(s) in RCA: 99] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

For:	Nguyen TT, Nguyen ND, Nahavandi S. Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications. IEEE Trans Cybern 2020;50:3826-3839. [PMID: 32203045 DOI: 10.1109/tcyb.2020.2977374] [Citation(s) in RCA: 99] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Number

Cited by Other Article(s)

Dong S, Li C, Yang S, An B, Li W, Gao Y. Egoism, utilitarianism and egalitarianism in multi-agent reinforcement learning. Neural Netw 2024;178:106544. [PMID: 39053197 DOI: 10.1016/j.neunet.2024.106544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 06/02/2024] [Accepted: 07/14/2024] [Indexed: 07/27/2024]

Croll HC, Ikuma K, Ong SK, Sarkar S. Unified control of diverse actions in a wastewater treatment activated sludge system using reinforcement learning for multi-objective optimization. WATER RESEARCH 2024;263:122179. [PMID: 39096812 DOI: 10.1016/j.watres.2024.122179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2023] [Revised: 07/10/2024] [Accepted: 07/28/2024] [Indexed: 08/05/2024]

Cao Y, Xu B, Li B, Fu H. Advanced Design of Soft Robots with Artificial Intelligence. NANO-MICRO LETTERS 2024;16:214. [PMID: 38869734 DOI: 10.1007/s40820-024-01423-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Accepted: 04/22/2024] [Indexed: 06/14/2024]

Zhou W, Wu L, Gao Y, Chen X. A Dynamic Window Method Based on Reinforcement Learning for SSVEP Recognition. IEEE Trans Neural Syst Rehabil Eng 2024;32:2114-2123. [PMID: 38829754 DOI: 10.1109/tnsre.2024.3408273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/05/2024]

Huang J, Guo X. A Novel Method of UAV-Assisted Trajectory Localization for Forestry Environments. SENSORS (BASEL, SWITZERLAND) 2024;24:3398. [PMID: 38894189 PMCID: PMC11174491 DOI: 10.3390/s24113398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2024] [Revised: 05/06/2024] [Accepted: 05/21/2024] [Indexed: 06/21/2024]

Amin S, Uddin MI, Alarood AA, Mashwani WK, Alzahrani AO, Alzahrani HA. An adaptable and personalized framework for top-N course recommendations in online learning. Sci Rep 2024;14:10382. [PMID: 38710728 DOI: 10.1038/s41598-024-56497-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Accepted: 03/07/2024] [Indexed: 05/08/2024] Open

Abstract

In recent years, the proliferation of Massive Open Online Courses (MOOC) platforms on a global scale has been remarkable. Learners can now meet their learning demands with the help of MOOC. However, learners might not understand the course material well if they have access to a lot of information due to their inadequate expertise and cognitive ability. Personalized Recommender Systems (RSs), a cutting-edge technology, can assist in addressing this issue. It greatly increases resource acquisition through personalized availability for various people of all ages. Intelligent learning methods, such as machine learning and Reinforcement Learning (RL) can be used in RS challenges. However, machine learning needs supervised data and classical RL is not suitable for multi-task recommendations in online learning platforms. To address these challenges, the proposed framework integrates a Deep Reinforcement Learning (DRL) and multi-agent approach. This adaptive system personalizes the learning experience by considering key factors such as learner sentiments, learning style, preferences, competency, and adaptive difficulty levels. We formulate the interactive RS problem using a DRL-based Actor-Critic model named DRR, treating recommendations as a sequential decision-making process. The DRR enables the system to provide top-N course recommendations and personalized learning paths, enriching the student's experience. Extensive experiments on a MOOC dataset such as the 100 K Coursera course review validate the proposed DRR model, demonstrating its superiority over baseline models in major evaluation metrics for long-term recommendations. The outcomes of this research contribute to the field of e-learning technology, guiding the design and implementation of course RSs, to facilitate personalized and relevant recommendations for online learning students.

Collapse

Liu P, Guo Y, Liu P, Ding H, Cao J, Zhou J, Feng Z. What can we learn from the AV crashes? - An association rule analysis for identifying the contributing risky factors. ACCIDENT; ANALYSIS AND PREVENTION 2024;199:107492. [PMID: 38428241 DOI: 10.1016/j.aap.2024.107492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 01/23/2024] [Accepted: 01/29/2024] [Indexed: 03/03/2024]

Abstract

The objective of this study is to explore the contributing risky factors to Autonomous Vehicle (AV) crashes and their interdependencies. AV crash data between 2015 and 2023 were collected from the autonomous vehicle collision report published by California Department of Motor Vehicles (DMV). AV crashes were categorized into four types based on vehicle damage. AV crashes features including crash location and time, driving mode, vehicle movements, crash type and vehicle damage, traffic conditions, and among others were used as potential risk factors. Association Rule Mining methods (ARM) were utilized to identify sets of contributing risky factors that often occur together in AV crashes. Several association rules suggest that AV crashes result from complex interactions between road factors, vehicle factors, and environmental conditions. No damage and minor crashes are more likely affected by the road features and traffic conditions. In contrast, the movements of vehicles are more sensitive to severe AV crashes. Improper vehicle operations could increase the probability of severe AV crashes. In addition, results suggest that adverse weather conditions could increase the damage of AV crashes. AV interactions with roadside infrastructure or vulnerable road users on wet road surfaces during the night could potentially lead to significant loss of life and property. Furthermore, the safety effects of vehicle mode on the different AV crash damage are revealed. In some contexts, the autonomous driving mode can mitigate the risk of crash damages compared with conventional driving mode. The findings of this study should be indicative of policy measures and engineering countermeasures that improve the safety and efficiency of AV on the road, ultimately improving road transportation's overall safety and reliability.

Collapse

Ding S, Du W, Ding L, Zhang J, Guo L, An B. Robust Multi-Agent Communication With Graph Information Bottleneck Optimization. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2024;46:3096-3107. [PMID: 38019627 DOI: 10.1109/tpami.2023.3337534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/01/2023]

Gabler V, Wollherr D. Decentralized multi-agent reinforcement learning based on best-response policies. Front Robot AI 2024;11:1229026. [PMID: 38690119 PMCID: PMC11059992 DOI: 10.3389/frobt.2024.1229026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 02/07/2024] [Indexed: 05/02/2024] Open

Abstract

Introduction: Multi-agent systems are an interdisciplinary research field that describes the concept of multiple decisive individuals interacting with a usually partially observable environment. Given the recent advances in single-agent reinforcement learning, multi-agent reinforcement learning (RL) has gained tremendous interest in recent years. Most research studies apply a fully centralized learning scheme to ease the transfer from the single-agent domain to multi-agent systems. Methods: In contrast, we claim that a decentralized learning scheme is preferable for applications in real-world scenarios as this allows deploying a learning algorithm on an individual robot rather than deploying the algorithm to a complete fleet of robots. Therefore, this article outlines a novel actor-critic (AC) approach tailored to cooperative MARL problems in sparsely rewarded domains. Our approach decouples the MARL problem into a set of distributed agents that model the other agents as responsive entities. In particular, we propose using two separate critics per agent to distinguish between the joint task reward and agent-based costs as commonly applied within multi-robot planning. On one hand, the agent-based critic intends to decrease agent-specific costs. On the other hand, each agent intends to optimize the joint team reward based on the joint task critic. As this critic still depends on the joint action of all agents, we outline two suitable behavior models based on Stackelberg games: a game against nature and a dyadic game against each agent. Following these behavior models, our algorithm allows fully decentralized execution and training. Results and Discussion: We evaluate our presented method using the proposed behavior models within a sparsely rewarded simulated multi-agent environment. Although our approach already outperforms the state-of-the-art learners, we conclude this article by outlining possible extensions of our algorithm that future research may build upon.

Collapse

Du C, Lu Y, Meng H, Park J. Evolution of cooperation on reinforcement-learning driven-adaptive networks. CHAOS (WOODBURY, N.Y.) 2024;34:041101. [PMID: 38558043 DOI: 10.1063/5.0201968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 03/12/2024] [Indexed: 04/04/2024]

Lussange J, Vrizzi S, Palminteri S, Gutkin B. Mesoscale effects of trader learning behaviors in financial markets: A multi-agent reinforcement learning study. PLoS One 2024;19:e0301141. [PMID: 38557590 PMCID: PMC10984546 DOI: 10.1371/journal.pone.0301141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Accepted: 03/08/2024] [Indexed: 04/04/2024] Open

Negm A, Ma X, Aggidis G. Deep reinforcement learning challenges and opportunities for urban water systems. WATER RESEARCH 2024;253:121145. [PMID: 38330870 DOI: 10.1016/j.watres.2024.121145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Revised: 01/09/2024] [Accepted: 01/14/2024] [Indexed: 02/10/2024]

Pina R, Silva VD, Hook J, Kondoz A. Residual Q-Networks for Value Function Factorizing in Multiagent Reinforcement Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024;35:1534-1544. [PMID: 35737605 DOI: 10.1109/tnnls.2022.3183865] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Jiang Q, Li J, Sun Y, Huang J, Zou R, Ma W, Guo H, Wang Z, Liu Y. Deep-reinforcement-learning-based water diversion strategy. ENVIRONMENTAL SCIENCE AND ECOTECHNOLOGY 2024;17:100298. [PMID: 37554624 PMCID: PMC10405199 DOI: 10.1016/j.ese.2023.100298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Revised: 06/23/2023] [Accepted: 07/05/2023] [Indexed: 08/10/2023]

Cai M, Wang Q, Qi Z, Jin D, Wu X, Xu T, Zhang L. Deep Reinforcement Learning Framework-Based Flow Rate Rejection Control of Soft Magnetic Miniature Robots. IEEE TRANSACTIONS ON CYBERNETICS 2023;53:7699-7711. [PMID: 36070281 DOI: 10.1109/tcyb.2022.3199213] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Diaz MA, Vos M, Dillen A, Tassignon B, Flynn L, Geeroms J, Meeusen R, Verstraten T, Babic J, Beckerle P, De Pauw K. Human-in-the-Loop Optimization of Wearable Robotic Devices to Improve Human-Robot Interaction: A Systematic Review. IEEE TRANSACTIONS ON CYBERNETICS 2023;53:7483-7496. [PMID: 37015459 DOI: 10.1109/tcyb.2022.3224895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

Croll HC, Ikuma K, Ong SK, Sarkar S. Systematic Performance Evaluation of Reinforcement Learning Algorithms Applied to Wastewater Treatment Control Optimization. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023;57:18382-18390. [PMID: 37405782 DOI: 10.1021/acs.est.3c00353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/06/2023]

Guo W, Lv C, Guo M, Zhao Q, Yin X, Zhang L. Innovative applications of artificial intelligence in zoonotic disease management. SCIENCE IN ONE HEALTH 2023;2:100045. [PMID: 39077042 PMCID: PMC11262289 DOI: 10.1016/j.soh.2023.100045] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Accepted: 10/22/2023] [Indexed: 07/31/2024]

Liu K, Zhang H, Zhang Y, Sun C. False Data-Injection Attack Detection in Cyber-Physical Systems With Unknown Parameters: A Deep Reinforcement Learning Approach. IEEE TRANSACTIONS ON CYBERNETICS 2023;53:7115-7125. [PMID: 37015355 DOI: 10.1109/tcyb.2022.3225236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

Wang X, Yang Z, Bai X, Ji M, Li H, Ran D. A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader-Follower Tracking Problem. SENSORS (BASEL, SWITZERLAND) 2023;23:8814. [PMID: 37960514 PMCID: PMC10650083 DOI: 10.3390/s23218814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 10/16/2023] [Accepted: 10/20/2023] [Indexed: 11/15/2023]

Zhang R, Zong Q, Zhang X, Dou L, Tian B. Game of Drones: Multi-UAV Pursuit-Evasion Game With Online Motion Planning by Deep Reinforcement Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023;34:7900-7909. [PMID: 35157597 DOI: 10.1109/tnnls.2022.3146976] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Li S, Tang Z, Yang L, Li M, Shang Z. Application of deep reinforcement learning for spike sorting under multi-class imbalance. Comput Biol Med 2023;164:107253. [PMID: 37536094 DOI: 10.1016/j.compbiomed.2023.107253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 06/27/2023] [Accepted: 07/07/2023] [Indexed: 08/05/2023]

Zhang J, Zhou X, Zhou J, Qiu S, Liang G, Cai S, Bao G. A High-Efficient Reinforcement Learning Approach for Dexterous Manipulation. Biomimetics (Basel) 2023;8:264. [PMID: 37366859 DOI: 10.3390/biomimetics8020264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Revised: 06/05/2023] [Accepted: 06/06/2023] [Indexed: 06/28/2023] Open

Han H, Wang J, Kuang L, Han X, Xue H. Improved Robot Path Planning Method Based on Deep Reinforcement Learning. SENSORS (BASEL, SWITZERLAND) 2023;23:5622. [PMID: 37420785 DOI: 10.3390/s23125622] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 06/11/2023] [Accepted: 06/14/2023] [Indexed: 07/09/2023]

Yadav P, Mishra A, Kim S. A Comprehensive Survey on Multi-Agent Reinforcement Learning for Connected and Automated Vehicles. SENSORS (BASEL, SWITZERLAND) 2023;23:4710. [PMID: 37430623 DOI: 10.3390/s23104710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 05/07/2023] [Accepted: 05/11/2023] [Indexed: 07/12/2023]

Kwa HL, Kit JL, Horsevad N, Philippot J, Savari M, Bouffanais R. Adaptivity: a path towards general swarm intelligence? Front Robot AI 2023;10:1163185. [PMID: 37228356 PMCID: PMC10203170 DOI: 10.3389/frobt.2023.1163185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Accepted: 04/17/2023] [Indexed: 05/27/2023] Open

Liu S, Feng Y, Wu K, Cheng G, Huang J, Liu Z. Graph-Attention-Based Casual Discovery With Trust Region-Navigated Clipping Policy Optimization. IEEE TRANSACTIONS ON CYBERNETICS 2023;53:2311-2324. [PMID: 34665751 DOI: 10.1109/tcyb.2021.3116762] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Orr J, Dutta A. Multi-Agent Deep Reinforcement Learning for Multi-Robot Applications: A Survey. SENSORS (BASEL, SWITZERLAND) 2023;23:3625. [PMID: 37050685 PMCID: PMC10098527 DOI: 10.3390/s23073625] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 03/22/2023] [Accepted: 03/28/2023] [Indexed: 06/19/2023]

Cooperative multi-agent target searching: a deep reinforcement learning approach based on parallel hindsight experience replay. COMPLEX INTELL SYST 2023. [DOI: 10.1007/s40747-023-00985-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2023]

Guan Y, Ren Y, Sun Q, Li SE, Ma H, Duan J, Dai Y, Cheng B. Integrated Decision and Control: Toward Interpretable and Computationally Efficient Driving Intelligence. IEEE TRANSACTIONS ON CYBERNETICS 2023;53:859-873. [PMID: 35439160 DOI: 10.1109/tcyb.2022.3163816] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Bai C, Wang L, Wang Y, Wang Z, Zhao R, Bai C, Liu P. Addressing Hindsight Bias in Multigoal Reinforcement Learning. IEEE TRANSACTIONS ON CYBERNETICS 2023;53:392-405. [PMID: 34495860 DOI: 10.1109/tcyb.2021.3107202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Explaining deep reinforcement learning decisions in complex multiagent settings: towards enabling automation in air traffic flow management. APPL INTELL 2023;53:4063-4098. [PMID: 35694685 PMCID: PMC9169601 DOI: 10.1007/s10489-022-03605-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/03/2022] [Indexed: 02/04/2023]

A fractional filter based on reinforcement learning for effective tracking under impulsive noise. Neurocomputing 2023. [DOI: 10.1016/j.neucom.2022.10.038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Huang H, Hu Z, Lu Z, Wen X. Network-Scale Traffic Signal Control via Multiagent Reinforcement Learning With Deep Spatiotemporal Attentive Network. IEEE TRANSACTIONS ON CYBERNETICS 2023;53:262-274. [PMID: 34343099 DOI: 10.1109/tcyb.2021.3087228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Learning multi-agent coordination through connectivity-driven communication. Mach Learn 2022. [DOI: 10.1007/s10994-022-06286-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Twin attentive deep reinforcement learning for multi-agent defensive convoy. INT J MACH LEARN CYB 2022. [DOI: 10.1007/s13042-022-01759-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]

Li J, Ma Y, Gao R, Cao Z, Lim A, Song W, Zhang J. Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem. IEEE TRANSACTIONS ON CYBERNETICS 2022;52:13572-13585. [PMID: 34554923 DOI: 10.1109/tcyb.2021.3111082] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Ji Z, Chen C, He J, Zhu S, Guan X. Edge Sensing and Control Co-Design for Industrial Cyber-Physical Systems: Observability Guaranteed Method. IEEE TRANSACTIONS ON CYBERNETICS 2022;52:13350-13362. [PMID: 34343098 DOI: 10.1109/tcyb.2021.3079149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Bahrpeyma F, Reichelt D. A review of the applications of multi-agent reinforcement learning in smart factories. Front Robot AI 2022;9:1027340. [DOI: 10.3389/frobt.2022.1027340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Accepted: 11/08/2022] [Indexed: 12/04/2022] Open

Zhu Y, Pang JH, Gao T, Tian FB. Learning to school in dense configurations with multi-agent deep reinforcement learning. BIOINSPIRATION & BIOMIMETICS 2022;18:015003. [PMID: 36322983 DOI: 10.1088/1748-3190/ac9fb5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Accepted: 11/01/2022] [Indexed: 06/16/2023]

Sun Q, Yao Y, Yi P, Hu Y, Yang Z, Yang G, Zhou X. Learning controlled and targeted communication with the centralized critic for the multi-agent system. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04225-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Wong A, Bäck T, Kononova AV, Plaat A. Deep multiagent reinforcement learning: challenges and directions. Artif Intell Rev 2022. [DOI: 10.1007/s10462-022-10299-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

A review of cooperative multi-agent deep reinforcement learning. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04105-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Bahamid A, Mohd Ibrahim A. A review on crowd analysis of evacuation and abnormality detection based on machine learning systems. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07758-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]

Cheng Y, Huang L, Wang X. Authentic Boundary Proximal Policy Optimization. IEEE TRANSACTIONS ON CYBERNETICS 2022;52:9428-9438. [PMID: 33705327 DOI: 10.1109/tcyb.2021.3051456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Xie S, Zhang H, Yu H, Li Y, Zhang Z, Luo X. ET-HF: A novel information sharing model to improve multi-agent cooperation. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109916] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]

Controlling Fleets of Autonomous Mobile Robots with Reinforcement Learning: A Brief Survey. ROBOTICS 2022. [DOI: 10.3390/robotics11050085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Artificial Intelligence in Adaptive and Intelligent Educational System: A Review. FUTURE INTERNET 2022. [DOI: 10.3390/fi14090245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Shi Y, Mu C, Hao Y, Ma S, Xu N, Chong Z. Day‐ahead optimal dispatching of hybrid power system based on deep reinforcement learning. COGNITIVE COMPUTATION AND SYSTEMS 2022. [DOI: 10.1049/ccs2.12068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Jing F, Zhang H, Gao M, Xue B, Cao K. RIS-Assisted Multi-Antenna AmBC Signal Detection Using Deep Reinforcement Learning. SENSORS (BASEL, SWITZERLAND) 2022;22:6137. [PMID: 36015896 PMCID: PMC9414307 DOI: 10.3390/s22166137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 08/09/2022] [Accepted: 08/10/2022] [Indexed: 06/15/2023]