Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Hu Y, Gao Y, An B. Multiagent reinforcement learning with unshared value functions. IEEE Trans Cybern 2015;45:647-662. [PMID: 25014990 DOI: 10.1109/tcyb.2014.2332042] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]

Number

Cited by Other Article(s)

Yao Q, Wang Y, Xiong X, Wang P, Li Y. Adversarial Decision-Making for Moving Target Defense: A Multi-Agent Markov Game and Reinforcement Learning Approach. ENTROPY (BASEL, SWITZERLAND) 2023;25:e25040605. [PMID: 37190393 PMCID: PMC10137508 DOI: 10.3390/e25040605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 03/29/2023] [Accepted: 03/31/2023] [Indexed: 05/17/2023]

Shi H, Li J, Mao J, Hwang KS. Lateral Transfer Learning for Multiagent Reinforcement Learning. IEEE TRANSACTIONS ON CYBERNETICS 2023;53:1699-1711. [PMID: 34506297 DOI: 10.1109/tcyb.2021.3108237] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Modeling opponent learning in multiagent repeated games. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04249-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Min–Max Q-learning for multi-player pursuit-evasion games. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2021.12.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Multi-Agent Reinforcement Learning Approach for Residential Microgrid Energy Scheduling. ENERGIES 2019. [DOI: 10.3390/en13010123] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Da Silva FL, Glatt R, Costa AHR. MOO-MDP: An Object-Oriented Representation for Cooperative Multiagent Reinforcement Learning. IEEE TRANSACTIONS ON CYBERNETICS 2019;49:567-579. [PMID: 29990289 DOI: 10.1109/tcyb.2017.2781130] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Niu L, Ren F, Zhang M, Bai Q. A Concurrent Multiple Negotiation Protocol Based on Colored Petri Nets. IEEE TRANSACTIONS ON CYBERNETICS 2017;47:3692-3705. [PMID: 27337734 DOI: 10.1109/tcyb.2016.2577635] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Zhang Z, Zhao D, Gao J, Wang D, Dai Y. FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks. IEEE TRANSACTIONS ON CYBERNETICS 2017;47:1367-1379. [PMID: 27101627 DOI: 10.1109/tcyb.2016.2544866] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Zha W, Chen J, Peng Z, Gu D. Construction of Barrier in a Fishing Game With Point Capture. IEEE TRANSACTIONS ON CYBERNETICS 2017;47:1409-1422. [PMID: 27071205 DOI: 10.1109/tcyb.2016.2546381] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Zhou L, Yang P, Chen C, Gao Y. Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer. IEEE TRANSACTIONS ON CYBERNETICS 2017;47:1238-1250. [PMID: 27046917 DOI: 10.1109/tcyb.2016.2543238] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Abstract

Reinforcement learning has significant applications for multiagent systems, especially in unknown dynamic environments. However, most multiagent reinforcement learning (MARL) algorithms suffer from such problems as exponential computation complexity in the joint state-action space, which makes it difficult to scale up to realistic multiagent problems. In this paper, a novel algorithm named negotiation-based MARL with sparse interactions (NegoSIs) is presented. In contrast to traditional sparse-interaction-based MARL algorithms, NegoSI adopts the equilibrium concept and makes it possible for agents to select the nonstrict equilibrium-dominating strategy profile (nonstrict EDSP) or meta equilibrium for their joint actions. The presented NegoSI algorithm consists of four parts: 1) the equilibrium-based framework for sparse interactions; 2) the negotiation for the equilibrium set; 3) the minimum variance method for selecting one joint action; and 4) the knowledge transfer of local Q -values. In this integrated algorithm, three techniques, i.e., unshared value functions, equilibrium solutions, and sparse interactions are adopted to achieve privacy protection, better coordination and lower computational complexity, respectively. To evaluate the performance of the presented NegoSI algorithm, two groups of experiments are carried out regarding three criteria: 1) steps of each episode; 2) rewards of each episode; and 3) average runtime. The first group of experiments is conducted using six grid world games and shows fast convergence and high scalability of the presented algorithm. Then in the second group of experiments NegoSI is applied to an intelligent warehouse problem and simulated results demonstrate the effectiveness of the presented NegoSI algorithm compared with other state-of-the-art MARL algorithms.

Collapse

Wu Y, Su H, Shi P, Shu Z, Wu ZG. Consensus of Multiagent Systems Using Aperiodic Sampled-Data Control. IEEE TRANSACTIONS ON CYBERNETICS 2016;46:2132-2143. [PMID: 26316291 DOI: 10.1109/tcyb.2015.2466115] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Ye M, Hu G. Solving Potential Games With Dynamical Constraint. IEEE TRANSACTIONS ON CYBERNETICS 2016;46:1156-1164. [PMID: 25974960 DOI: 10.1109/tcyb.2015.2425411] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Yu C, Zhang M, Ren F, Tan G. Multiagent Learning of Coordination in Loosely Coupled Multiagent Systems. IEEE TRANSACTIONS ON CYBERNETICS 2015;45:2853-2867. [PMID: 25594993 DOI: 10.1109/tcyb.2014.2387277] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]