1
|
Glynatsi NE, McAvoy A, Hilbe C. Evolution of reciprocity with limited payoff memory. Proc Biol Sci 2024; 291:20232493. [PMID: 38889792 DOI: 10.1098/rspb.2023.2493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 05/08/2024] [Indexed: 06/20/2024] Open
Abstract
Direct reciprocity is a mechanism for the evolution of cooperation in repeated social interactions. According to the literature, individuals naturally learn to adopt conditionally cooperative strategies if they have multiple encounters with their partner. Corresponding models have greatly facilitated our understanding of cooperation, yet they often make strong assumptions on how individuals remember and process payoff information. For example, when strategies are updated through social learning, it is commonly assumed that individuals compare their average payoffs. This would require them to compute (or remember) their payoffs against everyone else in the population. To understand how more realistic constraints influence direct reciprocity, we consider the evolution of conditional behaviours when individuals learn based on more recent experiences. Even in the most extreme case that they only take into account their very last interaction, we find that cooperation can still evolve. However, such individuals adopt less generous strategies, and they cooperate less often than in the classical setup with average payoffs. Interestingly, once individuals remember the payoffs of two or three recent interactions, cooperation rates quickly approach the classical limit. These findings contribute to a literature that explores which kind of cognitive capabilities are required for reciprocal cooperation. While our results suggest that some rudimentary form of payoff memory is necessary, it suffices to remember a few interactions.
Collapse
Affiliation(s)
- Nikoleta E Glynatsi
- Max Planck Research Group on the Dynamics of Social Behavior, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Alex McAvoy
- School of Data Science and Society, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Department of Mathematics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Christian Hilbe
- Max Planck Research Group on the Dynamics of Social Behavior, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| |
Collapse
|
2
|
LaPorte P, Hilbe C, Nowak MA. Adaptive dynamics of memory-one strategies in the repeated donation game. PLoS Comput Biol 2023; 19:e1010987. [PMID: 37384811 PMCID: PMC10337984 DOI: 10.1371/journal.pcbi.1010987] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Revised: 07/12/2023] [Accepted: 06/13/2023] [Indexed: 07/01/2023] Open
Abstract
Human interactions can take the form of social dilemmas: collectively, people fare best if all cooperate but each individual is tempted to free ride. Social dilemmas can be resolved when individuals interact repeatedly. Repetition allows them to adopt reciprocal strategies which incentivize cooperation. The most basic model for direct reciprocity is the repeated donation game, a variant of the prisoner's dilemma. Two players interact over many rounds; in each round they decide whether to cooperate or to defect. Strategies take into account the history of the play. Memory-one strategies depend only on the previous round. Even though they are among the most elementary strategies of direct reciprocity, their evolutionary dynamics has been difficult to study analytically. As a result, much previous work has relied on simulations. Here, we derive and analyze their adaptive dynamics. We show that the four-dimensional space of memory-one strategies has an invariant three-dimensional subspace, generated by the memory-one counting strategies. Counting strategies record how many players cooperated in the previous round, without considering who cooperated. We give a partial characterization of adaptive dynamics for memory-one strategies and a full characterization for memory-one counting strategies.
Collapse
Affiliation(s)
- Philip LaPorte
- Department of Mathematics, University of California, Berkeley, Berkeley, California, United States of America
| | - Christian Hilbe
- Max Planck Research Group ‘Dynamics of Social Behavior’, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Martin A. Nowak
- Department of Mathematics, Harvard University, Cambridge, Massachussetts, United States of America
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachussetts, United States of America
| |
Collapse
|
3
|
Tkadlec J, Hilbe C, Nowak MA. Mutation enhances cooperation in direct reciprocity. Proc Natl Acad Sci U S A 2023; 120:e2221080120. [PMID: 37155877 PMCID: PMC10193978 DOI: 10.1073/pnas.2221080120] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 04/07/2023] [Indexed: 05/10/2023] Open
Abstract
Direct reciprocity is a powerful mechanism for the evolution of cooperation based on repeated interactions between the same individuals. But high levels of cooperation evolve only if the benefit-to-cost ratio exceeds a certain threshold that depends on memory length. For the best-explored case of one-round memory, that threshold is two. Here, we report that intermediate mutation rates lead to high levels of cooperation, even if the benefit-to-cost ratio is only marginally above one, and even if individuals only use a minimum of past information. This surprising observation is caused by two effects. First, mutation generates diversity which undermines the evolutionary stability of defectors. Second, mutation leads to diverse communities of cooperators that are more resilient than homogeneous ones. This finding is relevant because many real-world opportunities for cooperation have small benefit-to-cost ratios, which are between one and two, and we describe how direct reciprocity can attain cooperation in such settings. Our result can be interpreted as showing that diversity, rather than uniformity, promotes evolution of cooperation.
Collapse
Affiliation(s)
- Josef Tkadlec
- Department of Mathematics, Harvard University, Cambridge, MA02138
| | - Christian Hilbe
- Max Planck Research Group ‘Dynamics of Social Behavior’, Max Planck Institute for Evolutionary Biology, 24306, Plön, Germany
| | - Martin A. Nowak
- Department of Mathematics, Harvard University, Cambridge, MA02138
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA02138
| |
Collapse
|
4
|
Li J, Zhao X, Li B, Rossetti CSL, Hilbe C, Xia H. Evolution of cooperation through cumulative reciprocity. NATURE COMPUTATIONAL SCIENCE 2022; 2:677-686. [PMID: 38177263 DOI: 10.1038/s43588-022-00334-w] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Accepted: 09/14/2022] [Indexed: 01/06/2024]
Abstract
Reciprocity is a simple principle for cooperation that explains many of the patterns of how humans seek and receive help from each other. To capture reciprocity, traditional models often assume that individuals use simple strategies with restricted memory. These memory-1 strategies are mathematically convenient, but they miss important aspects of human reciprocity, where defections can have lasting effects. Here we instead propose a strategy of cumulative reciprocity. Cumulative reciprocators count the imbalance of cooperation across their previous interactions with their opponent. They cooperate as long as this imbalance is sufficiently small. Using analytical and computational methods, we show that this strategy can sustain cooperation in the presence of errors, that it enforces fair outcomes and that it evolves in hostile environments. Using an economic experiment, we confirm that cumulative reciprocity is more predictive of human behaviour than several classical strategies. The basic principle of cumulative reciprocity is versatile and can be extended to a range of social dilemmas.
Collapse
Affiliation(s)
- Juan Li
- Institute of Systems Engineering, Dalian University of Technology, Dalian, China
- Center for Big Data and Intelligent Decision-Making, Dalian University of Technology, Dalian, China
| | - Xiaowei Zhao
- Institute of Systems Engineering, Dalian University of Technology, Dalian, China
- School of Software Technology, Dalian University of Technology, Dalian, China
| | - Bing Li
- Institute of Systems Engineering, Dalian University of Technology, Dalian, China
| | | | - Christian Hilbe
- Max Planck Institute for Evolutionary Biology, Plön, Germany.
| | - Haoxiang Xia
- Institute of Systems Engineering, Dalian University of Technology, Dalian, China.
- Center for Big Data and Intelligent Decision-Making, Dalian University of Technology, Dalian, China.
| |
Collapse
|
5
|
Schmid L, Hilbe C, Chatterjee K, Nowak MA. Direct reciprocity between individuals that use different strategy spaces. PLoS Comput Biol 2022; 18:e1010149. [PMID: 35700167 PMCID: PMC9197081 DOI: 10.1371/journal.pcbi.1010149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Accepted: 04/28/2022] [Indexed: 12/04/2022] Open
Abstract
In repeated interactions, players can use strategies that respond to the outcome of previous rounds. Much of the existing literature on direct reciprocity assumes that all competing individuals use the same strategy space. Here, we study both learning and evolutionary dynamics of players that differ in the strategy space they explore. We focus on the infinitely repeated donation game and compare three natural strategy spaces: memory-1 strategies, which consider the last moves of both players, reactive strategies, which respond to the last move of the co-player, and unconditional strategies. These three strategy spaces differ in the memory capacity that is needed. We compute the long term average payoff that is achieved in a pairwise learning process. We find that smaller strategy spaces can dominate larger ones. For weak selection, unconditional players dominate both reactive and memory-1 players. For intermediate selection, reactive players dominate memory-1 players. Only for strong selection and low cost-to-benefit ratio, memory-1 players dominate the others. We observe that the supergame between strategy spaces can be a social dilemma: maximum payoff is achieved if both players explore a larger strategy space, but smaller strategy spaces dominate.
Collapse
Affiliation(s)
| | - Christian Hilbe
- Max Planck Research Group Dynamics of Social Behavior, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | | | - Martin A. Nowak
- Department of Mathematics, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
| |
Collapse
|
6
|
Cheng Z, Chen G, Hong Y. Misperception influence on zero-determinant strategies in iterated Prisoner's Dilemma. Sci Rep 2022; 12:5174. [PMID: 35338188 PMCID: PMC8956668 DOI: 10.1038/s41598-022-08750-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Accepted: 02/17/2022] [Indexed: 11/09/2022] Open
Abstract
Zero-determinant (ZD) strategies have attracted wide attention in Iterated Prisoner's Dilemma (IPD) games, since the player equipped with ZD strategies can unilaterally enforce the two players' expected utilities subjected to a linear relation. On the other hand, uncertainties, which may be caused by misperception, occur in IPD inevitably in practical circumstances. To better understand the situation, we consider the influence of misperception on ZD strategies in IPD, where the two players, player X and player Y, have different cognitions, but player X detects the misperception and it is believed to make ZD strategies by player Y. We provide a necessary and sufficient condition for the ZD strategies in IPD with misperception, where there is also a linear relationship between players' utilities in player X's cognition. Then we explore bounds of players' expected utility deviation from a linear relationship in player X's cognition with also improving its own utility.
Collapse
Affiliation(s)
- Zhaoyang Cheng
- Key Laboratory of Systems and Control, Academy of Mathematics and Systems Science, Beijing, 100190, China
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, 100190, China
| | - Guanpu Chen
- Key Laboratory of Systems and Control, Academy of Mathematics and Systems Science, Beijing, 100190, China
- JD Explore Academy, Beijing, 100176, China
| | - Yiguang Hong
- Key Laboratory of Systems and Control, Academy of Mathematics and Systems Science, Beijing, 100190, China.
- Department of Control Science and Engineering, Tongji University, Shanghai, 201804, China.
| |
Collapse
|
7
|
Zhang C, Liu S, Wang Z, Weissing FJ, Zhang J. The “self-bad, partner-worse” strategy inhibits cooperation in networked populations. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2021.11.041] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
8
|
Cooperation in alternating interactions with memory constraints. Nat Commun 2022; 13:737. [PMID: 35136025 PMCID: PMC8825791 DOI: 10.1038/s41467-022-28336-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Accepted: 01/20/2022] [Indexed: 11/08/2022] Open
Abstract
In repeated social interactions, individuals often employ reciprocal strategies to maintain cooperation. To explore the emergence of reciprocity, many theoretical models assume synchronized decision making. In each round, individuals decide simultaneously whether to cooperate or not. Yet many manifestations of reciprocity in nature are asynchronous. Individuals provide help at one time and receive help at another. Here, we explore such alternating games in which players take turns. We mathematically characterize all Nash equilibria among memory-one strategies. Moreover, we use evolutionary simulations to explore various model extensions, exploring the effect of discounted games, irregular alternation patterns, and higher memory. In all cases, we observe that mutual cooperation still evolves for a wide range of parameter values. However, compared to simultaneous games, alternating games require different strategies to maintain cooperation in noisy environments. Moreover, none of the respective strategies are evolutionarily stable. In many instances of reciprocity, individuals cooperate in turns. Here, the authors analyze the equilibria and the dynamics of such alternating games, and in particular describe all strategies with one-round memory that maintain cooperation.
Collapse
|
9
|
D'Arcangelo C, Andreozzi L, Faillo M. Human players manage to extort more than the mutual cooperation payoff in repeated social dilemmas. Sci Rep 2021; 11:16820. [PMID: 34413364 PMCID: PMC8377025 DOI: 10.1038/s41598-021-96061-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 07/31/2021] [Indexed: 11/13/2022] Open
Abstract
Social dilemmas are mixed-motive games. Although the players have a common interest in maintaining cooperation, each may try to obtain a larger payoff by cooperating less than the other. This phenomenon received increased attention after Press and Dyson discovered a class of strategies for the repeated prisoner's dilemma (extortionate strategies) that secure for themselves a payoff that is never smaller, but can be larger, than the opponent's payoff. We conducted an experiment to test whether humans adopt extortionate strategies when playing a social dilemma. Our results reveal that human subjects do try to extort a larger payoff from their opponents. However, they are only successful when extortionate strategies are part of a Nash equilibrium. In settings where extortionate strategies do not appear in any Nash equilibrium, attempts at extortion only result in a breakdown of cooperation. Our subjects recognized the different incentives implied by the two settings, and they were ready to "extort" the opponent when allowed to do so. This suggests that deviations from mutually cooperative equilibria, which are usually attributed to players' impatience, coordination problems, or lack of information, can instead be driven by subjects trying to reach more favorable outcomes.
Collapse
Affiliation(s)
- Chiara D'Arcangelo
- Dipartimento di Economia, Università degli Studi G. D'Annunzio Chieti-Pescara, 65127, Pescara, Italy.
| | - Luciano Andreozzi
- Dipartimento di Economia e Management, Università di Trento, 38122, Trento, Italy
| | - Marco Faillo
- Dipartimento di Economia e Management, Università di Trento, 38122, Trento, Italy
| |
Collapse
|