1
|
Zeng W, Li M. Selective attention to historical comparison or social comparison in the evolutionary iterated prisoner’s dilemma game. Artif Intell Rev 2020. [DOI: 10.1007/s10462-020-09842-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
2
|
Chan CK, Hao J, Leung HF. Reciprocal Social Strategy in Social Repeated Games and Emergence of Social Norms. INT J ARTIF INTELL T 2017. [DOI: 10.1142/s0218213017600077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
In an artificial society where agents repeatedly interact with one another, effective coordination among agents is generally a challenge. This is especially true when the participating agents are self-interested, and that there is no central authority to coordinate, and direct communication or negotiation are not possible. Recently, the problem was studied in a paper by Hao and Leung, where a new repeated game mechanism for modeling multi-agent interactions as well as a new reinforcement learning based agent learning method were proposed. In particular, the game mechanism differs from traditional repeated games in that the agents are anonymous, and the agents interact with randomly chosen opponents during each iteration. Their learning mechanism allows agents to coordinate without negotiations. The initial results had been promising. However, extended simulation also reveals that the outcomes are not stable in the long run in some cases, as the high level of cooperation is eventually not sustainable. In this work, we revisit he problem and propose a new learning mechanism as follows. First, we propose an enhanced Q-learning-based framework that allows the agents to better capture both the individual and social utilities that they have learned through observations. Second, we propose a new concept of \social attitude" for determining the action of the agents throughout the game. Simulation results reveal that this approach can achieve higher social utility, including close-to-optimal results in some scenarios, and more importantly, the results are sustainable with social norms emerging.
Collapse
Affiliation(s)
- Chi-Kong Chan
- Department of Computing, Hang Sang Management College Sha Tin, Hong Kong, China
| | - Jianye Hao
- School of Computer Software, Tianjin University, Nankai, Tianjin, China
| | - Ho-Fung Leung
- Department of Computer Science and Engineering The Chinese University of Hong Kong, Sha Tin, Hong Kong, China
| |
Collapse
|
3
|
Zeng W, Li M, Chen F. Cooperation in the evolutionary iterated prisoner’s dilemma game with risk attitude adaptation. Appl Soft Comput 2016. [DOI: 10.1016/j.asoc.2016.03.025] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
4
|
Zeng W, Li M, Chen F, Nan G. Risk consideration and cooperation in the iterated prisoner’s dilemma. Soft comput 2014. [DOI: 10.1007/s00500-014-1523-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|