1
|
Hübner V, Staab M, Hilbe C, Chatterjee K, Kleshnina M. Efficiency and resilience of cooperation in asymmetric social dilemmas. Proc Natl Acad Sci U S A 2024; 121:e2315558121. [PMID: 38408249 DOI: 10.1073/pnas.2315558121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Accepted: 01/17/2024] [Indexed: 02/28/2024] Open
Abstract
Direct reciprocity is a powerful mechanism for cooperation in social dilemmas. The very logic of reciprocity, however, seems to require that individuals are symmetric, and that everyone has the same means to influence each others' payoffs. Yet in many applications, individuals are asymmetric. Herein, we study the effect of asymmetry in linear public good games. Individuals may differ in their endowments (their ability to contribute to a public good) and in their productivities (how effective their contributions are). Given the individuals' productivities, we ask which allocation of endowments is optimal for cooperation. To this end, we consider two notions of optimality. The first notion focuses on the resilience of cooperation. The respective endowment distribution ensures that full cooperation is feasible even under the most adverse conditions. The second notion focuses on efficiency. The corresponding endowment distribution maximizes group welfare. Using analytical methods, we fully characterize these two endowment distributions. This analysis reveals that both optimality notions favor some endowment inequality: More productive players ought to get higher endowments. Yet the two notions disagree on how unequal endowments are supposed to be. A focus on resilience results in less inequality. With additional simulations, we show that the optimal endowment allocation needs to account for both the resilience and the efficiency of cooperation.
Collapse
Affiliation(s)
- Valentin Hübner
- Institute of Science and Technology Austria, Klosterneuburg 3400, Austria
| | - Manuel Staab
- School of Economics, The University of Queensland, Brisbane, QLD 4067, Australia
| | - Christian Hilbe
- Max Planck Research Group Dynamics of Social Behavior, Max Planck Institute for Evolutionary Biology, Plön 24306, Germany
| | | | - Maria Kleshnina
- Institute for Advanced Study in Toulouse, Toulouse 31000, France
| |
Collapse
|
2
|
Guo H, Shen C, Hu S, Xing J, Tao P, Shi Y, Wang Z. Facilitating cooperation in human-agent hybrid populations through autonomous agents. iScience 2023; 26:108179. [PMID: 37920671 PMCID: PMC10618689 DOI: 10.1016/j.isci.2023.108179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Revised: 07/10/2023] [Accepted: 10/09/2023] [Indexed: 11/04/2023] Open
Abstract
Cooperative AI has shown its effectiveness in solving the conundrum of cooperation. Understanding how cooperation emerges in human-agent hybrid populations is a topic of significant interest, particularly in the realm of evolutionary game theory. In this article, we scrutinize how cooperative and defective Autonomous Agents (AAs) influence human cooperation in social dilemma games with a one-shot setting. Focusing on well-mixed populations, we find that cooperative AAs have a limited impact in the prisoner's dilemma games but facilitate cooperation in the stag hunt games. Surprisingly, defective AAs can promote complete dominance of cooperation in the snowdrift games. As the proportion of AAs increases, both cooperative and defective AAs have the potential to cause human cooperation to disappear. We then extend our investigation to consider the pairwise comparison rule and complex networks, elucidating that imitation strength and population structure are critical for the emergence of human cooperation in human-agent hybrid populations.
Collapse
Affiliation(s)
- Hao Guo
- School of Artificial Intelligence, Optics and Electronics (iOPEN), Northwestern Polytechnical University, Xi’an 710072, China
- Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
| | - Chen Shen
- Faculty of Engineering Sciences, Kyushu University, Kasuga-koen, Kasuga-shi, Fukuoka 816-8580, Japan
| | - Shuyue Hu
- Shanghai Artificial Intelligence Laboratory, Shanghai, China
| | - Junliang Xing
- Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
| | - Pin Tao
- Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
| | - Yuanchun Shi
- Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
| | - Zhen Wang
- School of Artificial Intelligence, Optics and Electronics (iOPEN), Northwestern Polytechnical University, Xi’an 710072, China
| |
Collapse
|
3
|
Ecotière C, Billiard S, André JB, Collet P, Ferrière R, Méléard S. Human-environment feedback and the consistency of proenvironmental behavior. PLoS Comput Biol 2023; 19:e1011429. [PMID: 37721943 PMCID: PMC10538744 DOI: 10.1371/journal.pcbi.1011429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Revised: 09/28/2023] [Accepted: 08/13/2023] [Indexed: 09/20/2023] Open
Abstract
Addressing global environmental crises such as anthropogenic climate change requires the consistent adoption of proenvironmental behavior by a large part of a population. Here, we develop a mathematical model of a simple behavior-environment feedback loop to ask how the individual assessment of the environmental state combines with social interactions to influence the consistent adoption of proenvironmental behavior, and how this feeds back to the perceived environmental state. In this stochastic individual-based model, individuals can switch between two behaviors, 'active' (or actively proenvironmental) and 'baseline', differing in their perceived cost (higher for the active behavior) and environmental impact (lower for the active behavior). We show that the deterministic dynamics and the stochastic fluctuations of the system can be approximated by ordinary differential equations and a Ornstein-Uhlenbeck type process. By definition, the proenvironmental behavior is adopted consistently when, at population stationary state, its frequency is high and random fluctuations in frequency are small. We find that the combination of social and environmental feedbacks can promote the spread of costly proenvironmental behavior when neither, operating in isolation, would. To be adopted consistently, strong social pressure for proenvironmental action is necessary but not sufficient-social interactions must occur on a faster timescale compared to individual assessment, and the difference in environmental impact must be small. This simple model suggests a scenario to achieve large reductions in environmental impact, which involves incrementally more active and potentially more costly behavior being consistently adopted under increasing social pressure for proenvironmentalism.
Collapse
Affiliation(s)
- Claire Ecotière
- Centre de Mathématiques Appliquées, CNRS, Ecole Polytechnique, IP Paris, Palaiseau, France
| | | | - Jean-Baptiste André
- Institut Jean Nicod, Département d’études cognitives, ENS, EHESS, PSL Research University, CNRS, Paris France
| | - Pierre Collet
- CPHT, CNRS, Ecole polytechnique, IP Paris, Palaiseau, France
| | - Régis Ferrière
- Institut de Biologie (IBENS), ENS-PSL, CNRS, INSERM, Paris, France
- Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona, United States of America
- iGLOBES International Research Laboratory, CNRS, ENS-PSL, University of Arizona, Tucson, Arizona, United States of America
| | - Sylvie Méléard
- Centre de Mathématiques Appliquées, CNRS, Ecole Polytechnique, IP Paris, Palaiseau, France
- Institut Universitaire de France, Paris, France
| |
Collapse
|
4
|
Kleshnina M, Hilbe C, Šimsa Š, Chatterjee K, Nowak MA. The effect of environmental information on evolution of cooperation in stochastic games. Nat Commun 2023; 14:4153. [PMID: 37438341 DOI: 10.1038/s41467-023-39625-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 06/22/2023] [Indexed: 07/14/2023] Open
Abstract
Many human interactions feature the characteristics of social dilemmas where individual actions have consequences for the group and the environment. The feedback between behavior and environment can be studied with the framework of stochastic games. In stochastic games, the state of the environment can change, depending on the choices made by group members. Past work suggests that such feedback can reinforce cooperative behaviors. In particular, cooperation can evolve in stochastic games even if it is infeasible in each separate repeated game. In stochastic games, participants have an interest in conditioning their strategies on the state of the environment. Yet in many applications, precise information about the state could be scarce. Here, we study how the availability of information (or lack thereof) shapes evolution of cooperation. Already for simple examples of two state games we find surprising effects. In some cases, cooperation is only possible if there is precise information about the state of the environment. In other cases, cooperation is most abundant when there is no information about the state of the environment. We systematically analyze all stochastic games of a given complexity class, to determine when receiving information about the environment is better, neutral, or worse for evolution of cooperation.
Collapse
Affiliation(s)
| | - Christian Hilbe
- Max Planck Research Group Dynamics of Social Behavior, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Štěpán Šimsa
- IST Austria, Klosterneuburg, Austria
- Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic
| | | | - Martin A Nowak
- Department of Mathematics, Harvard University, Cambridge, MA, USA
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
| |
Collapse
|
5
|
Sawicki J, Berner R, Loos SAM, Anvari M, Bader R, Barfuss W, Botta N, Brede N, Franović I, Gauthier DJ, Goldt S, Hajizadeh A, Hövel P, Karin O, Lorenz-Spreen P, Miehl C, Mölter J, Olmi S, Schöll E, Seif A, Tass PA, Volpe G, Yanchuk S, Kurths J. Perspectives on adaptive dynamical systems. CHAOS (WOODBURY, N.Y.) 2023; 33:071501. [PMID: 37486668 DOI: 10.1063/5.0147231] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Accepted: 05/24/2023] [Indexed: 07/25/2023]
Abstract
Adaptivity is a dynamical feature that is omnipresent in nature, socio-economics, and technology. For example, adaptive couplings appear in various real-world systems, such as the power grid, social, and neural networks, and they form the backbone of closed-loop control strategies and machine learning algorithms. In this article, we provide an interdisciplinary perspective on adaptive systems. We reflect on the notion and terminology of adaptivity in different disciplines and discuss which role adaptivity plays for various fields. We highlight common open challenges and give perspectives on future research directions, looking to inspire interdisciplinary approaches.
Collapse
Affiliation(s)
- Jakub Sawicki
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
- Akademie Basel, Fachhochschule Nordwestschweiz FHNW, Leonhardsstrasse 6, 4009 Basel, Switzerland
| | - Rico Berner
- Department of Physics, Humboldt-Universität zu Berlin, Newtonstraße 15, 12489 Berlin, Germany
| | - Sarah A M Loos
- DAMTP, University of Cambridge, Wilberforce Road, Cambridge CB3 0WA, United Kingdom
| | - Mehrnaz Anvari
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
- Fraunhofer Institute for Algorithms and Scientific Computing, Schloss Birlinghoven, 53757 Sankt-Augustin, Germany
| | - Rolf Bader
- Institute of Systematic Musicology, University of Hamburg, Hamburg, Germany
| | - Wolfram Barfuss
- Transdisciplinary Research Area: Sustainable Futures, University of Bonn, 53113 Bonn, Germany
- Center for Development Research (ZEF), University of Bonn, 53113 Bonn, Germany
| | - Nicola Botta
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
- Department of Computer Science and Engineering, Chalmers University of Technology, 412 96 Göteborg, Sweden
| | - Nuria Brede
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
- Department of Computer Science, University of Potsdam, An der Bahn 2, 14476 Potsdam, Germany
| | - Igor Franović
- Scientific Computing Laboratory, Center for the Study of Complex Systems, Institute of Physics Belgrade, University of Belgrade, Pregrevica 118, 11080 Belgrade, Serbia
| | - Daniel J Gauthier
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
| | - Sebastian Goldt
- Department of Physics, International School of Advanced Studies (SISSA), Trieste, Italy
| | - Aida Hajizadeh
- Research Group Comparative Neuroscience, Leibniz Institute for Neurobiology, Magdeburg, Germany
| | - Philipp Hövel
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
| | - Omer Karin
- Department of Mathematics, Imperial College London, London SW7 2AZ, United Kingdom
| | - Philipp Lorenz-Spreen
- Center for Adaptive Rationality, Max Planck Institute for Human Development, Lentzeallee 94, 14195 Berlin, Germany
| | - Christoph Miehl
- Akademie Basel, Fachhochschule Nordwestschweiz FHNW, Leonhardsstrasse 6, 4009 Basel, Switzerland
| | - Jan Mölter
- Department of Mathematics, School of Computation, Information and Technology, Technical University of Munich, Boltzmannstraße 3, 85748 Garching bei München, Germany
| | - Simona Olmi
- Akademie Basel, Fachhochschule Nordwestschweiz FHNW, Leonhardsstrasse 6, 4009 Basel, Switzerland
| | - Eckehard Schöll
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
- Akademie Basel, Fachhochschule Nordwestschweiz FHNW, Leonhardsstrasse 6, 4009 Basel, Switzerland
| | - Alireza Seif
- Pritzker School of Molecular Engineering, The University of Chicago, Chicago, Illinois 60637, USA
| | - Peter A Tass
- Department of Neurosurgery, Stanford University School of Medicine, Stanford, California 94304, USA
| | - Giovanni Volpe
- Department of Physics, University of Gothenburg, Gothenburg, Sweden
| | - Serhiy Yanchuk
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
- Department of Physics, Humboldt-Universität zu Berlin, Newtonstraße 15, 12489 Berlin, Germany
| | - Jürgen Kurths
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
- Department of Physics, Humboldt-Universität zu Berlin, Newtonstraße 15, 12489 Berlin, Germany
| |
Collapse
|
6
|
Chen X, Fu F. Outlearning extortioners: unbending strategies can foster reciprocal fairness and cooperation. PNAS NEXUS 2023; 2:pgad176. [PMID: 37287707 PMCID: PMC10244001 DOI: 10.1093/pnasnexus/pgad176] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 05/14/2023] [Accepted: 05/16/2023] [Indexed: 06/09/2023]
Abstract
Recent theory shows that extortioners taking advantage of the zero-determinant (ZD) strategy can unilaterally claim an unfair share of the payoffs in the Iterated Prisoner's Dilemma. It is thus suggested that against a fixed extortioner, any adapting coplayer should be subdued with full cooperation as their best response. In contrast, recent experiments demonstrate that human players often choose not to accede to extortion out of concern for fairness, actually causing extortioners to suffer more loss than themselves. In light of this, here we reveal fair-minded strategies that are unbending to extortion such that any payoff-maximizing extortioner ultimately will concede in their own interest by offering a fair split in head-to-head matches. We find and characterize multiple general classes of such unbending strategies, including generous ZD strategies and Win-Stay, Lose-Shift (WSLS) as particular examples. When against fixed unbending players, extortioners are forced with consequentially increasing losses whenever intending to demand a more unfair share. Our analysis also pivots to the importance of payoff structure in determining the superiority of ZD strategies and in particular their extortion ability. We show that an extortionate ZD player can be even outperformed by, for example, WSLS, if the total payoff of unilateral cooperation is smaller than that of mutual defection. Unbending strategies can be used to outlearn evolutionary extortioners and catalyze the evolution of Tit-for-Tat-like strategies out of ZD players. Our work has implications for promoting fairness and resisting extortion so as to uphold a just and cooperative society.
Collapse
Affiliation(s)
- Xingru Chen
- School of Science, Beijing University of Posts and Telecommunications, Beijing 100876, China
- Department of Mathematics, Dartmouth College, Hanover, 03755 NH, USA
| | - Feng Fu
- Department of Mathematics, Dartmouth College, Hanover, 03755 NH, USA
- Department of Biomedical Data Science, Geisel School of Medicine at Dartmouth, Lebanon, 03756 NH, USA
| |
Collapse
|
7
|
Liu L, Chen X, Szolnoki A. Coevolutionary dynamics via adaptive feedback in collective-risk social dilemma game. eLife 2023; 12:82954. [PMID: 37204305 DOI: 10.7554/elife.82954] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Accepted: 04/26/2023] [Indexed: 05/20/2023] Open
Abstract
Human society and natural environment form a complex giant ecosystem, where human activities not only lead to the change in environmental states, but also react to them. By using collective-risk social dilemma game, some studies have already revealed that individual contributions and the risk of future losses are inextricably linked. These works, however, often use an idealistic assumption that the risk is constant and not affected by individual behaviors. Here, we develop a coevolutionary game approach that captures the coupled dynamics of cooperation and risk. In particular, the level of contributions in a population affects the state of risk, while the risk in turn influences individuals' behavioral decision-making. Importantly, we explore two representative feedback forms describing the possible effect of strategy on risk, namely, linear and exponential feedbacks. We find that cooperation can be maintained in the population by keeping at a certain fraction or forming an evolutionary oscillation with risk, independently of the feedback type. However, such evolutionary outcome depends on the initial state. Taken together, a two-way coupling between collective actions and risk is essential to avoid the tragedy of the commons. More importantly, a critical starting portion of cooperators and risk level is what we really need for guiding the evolution toward a desired direction.
Collapse
Affiliation(s)
- Linjie Liu
- College of Science, Northwest A & F University, Yangling, China
- School of Mathematical Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Xiaojie Chen
- School of Mathematical Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Attila Szolnoki
- Institute of Technical Physics and Materials Science, Centre for Energy Research, Budapest, Hungary
| |
Collapse
|
8
|
Harhen NC, Bornstein AM. Overharvesting in human patch foraging reflects rational structure learning and adaptive planning. Proc Natl Acad Sci U S A 2023; 120:e2216524120. [PMID: 36961923 PMCID: PMC10068834 DOI: 10.1073/pnas.2216524120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 02/11/2023] [Indexed: 03/26/2023] Open
Abstract
Patch foraging presents a sequential decision-making problem widely studied across organisms-stay with a current option or leave it in search of a better alternative? Behavioral ecology has identified an optimal strategy for these decisions, but, across species, foragers systematically deviate from it, staying too long with an option or "overharvesting" relative to this optimum. Despite the ubiquity of this behavior, the mechanism underlying it remains unclear and an object of extensive investigation. Here, we address this gap by approaching foraging as both a decision-making and learning problem. Specifically, we propose a model in which foragers 1) rationally infer the structure of their environment and 2) use their uncertainty over the inferred structure representation to adaptively discount future rewards. We find that overharvesting can emerge from this rational statistical inference and uncertainty adaptation process. In a patch-leaving task, we show that human participants adapt their foraging to the richness and dynamics of the environment in ways consistent with our model. These findings suggest that definitions of optimal foraging could be extended by considering how foragers reduce and adapt to uncertainty over representations of their environment.
Collapse
Affiliation(s)
- Nora C. Harhen
- Department of Cognitive Sciences, University of California, Irvine, CA92697
| | - Aaron M. Bornstein
- Department of Cognitive Sciences, University of California, Irvine, CA92697
- Center for the Neurobiology of Learning and Memory, University of California, Irvine, CA92697
| |
Collapse
|
9
|
Barfuss W, Meylahn JM. Intrinsic fluctuations of reinforcement learning promote cooperation. Sci Rep 2023; 13:1309. [PMID: 36693872 PMCID: PMC9873645 DOI: 10.1038/s41598-023-27672-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Accepted: 01/05/2023] [Indexed: 01/26/2023] Open
Abstract
In this work, we ask for and answer what makes classical temporal-difference reinforcement learning with [Formula: see text]-greedy strategies cooperative. Cooperating in social dilemma situations is vital for animals, humans, and machines. While evolutionary theory revealed a range of mechanisms promoting cooperation, the conditions under which agents learn to cooperate are contested. Here, we demonstrate which and how individual elements of the multi-agent learning setting lead to cooperation. We use the iterated Prisoner's dilemma with one-period memory as a testbed. Each of the two learning agents learns a strategy that conditions the following action choices on both agents' action choices of the last round. We find that next to a high caring for future rewards, a low exploration rate, and a small learning rate, it is primarily intrinsic stochastic fluctuations of the reinforcement learning process which double the final rate of cooperation to up to 80%. Thus, inherent noise is not a necessary evil of the iterative learning process. It is a critical asset for the learning of cooperation. However, we also point out the trade-off between a high likelihood of cooperative behavior and achieving this in a reasonable amount of time. Our findings are relevant for purposefully designing cooperative algorithms and regulating undesired collusive effects.
Collapse
Affiliation(s)
- Wolfram Barfuss
- Tübingen AI Center, University of Tübingen, Tübingen, Germany
| | - Janusz M Meylahn
- Department of Applied Mathematics, University of Twente, Enschede, The Netherlands. .,Dutch Institute of Emergent Phenomena, University of Amsterdam, Amsterdam, The Netherlands.
| |
Collapse
|
10
|
Vidiella B, Carrignon S, Bentley RA, O’Brien MJ, Valverde S. A cultural evolutionary theory that explains both gradual and punctuated change. J R Soc Interface 2022; 19:20220570. [PMCID: PMC9667142 DOI: 10.1098/rsif.2022.0570] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Cumulative cultural evolution (CCE) occurs among humans who may be presented with many similar options from which to choose, as well as many social influences and diverse environments. It is unknown what general principles underlie the wide range of CCE dynamics and whether they can all be explained by the same unified paradigm. Here, we present a scalable evolutionary model of discrete choice with social learning, based on a few behavioural science assumptions. This paradigm connects the degree of transparency in social learning to the human tendency to imitate others. Computer simulations and quantitative analysis show the interaction of three primary factors—information transparency, popularity bias and population size—drives the pace of CCE. The model predicts a stable rate of evolutionary change for modest degrees of popularity bias. As popularity bias grows, the transition from gradual to punctuated change occurs, with maladaptive subpopulations arising on their own. When the popularity bias gets too severe, CCE stops. This provides a consistent framework for explaining the rich and complex adaptive dynamics taking place in the real world, such as modern digital media.
Collapse
Affiliation(s)
- Blai Vidiella
- Evolution of Networks Lab, Institute of Evolutionary Biology (UPF-CSIC), Passeig Marítim de la Barceloneta 37, 08003 Barcelona, Spain
| | - Simon Carrignon
- McDonald Institute for Archaeological Research, Downing Street, Cambridge CB2 3ER, UK
| | | | - Michael J. O’Brien
- Department of Communication, History, and Philosophy and Department of Life Sciences, Texas A&M University–San Antonio, Texas 78224, USA
- Department of Anthropology, University of Missouri-Columbia, Missouri 65201, USA
| | - Sergi Valverde
- Evolution of Networks Lab, Institute of Evolutionary Biology (UPF-CSIC), Passeig Marítim de la Barceloneta 37, 08003 Barcelona, Spain
- European Centre for Living Technology (ECLT), Ca’ Bottacin, 3911 Dorsoduro Calle Crosera, 30123 Venezia, Italy
| |
Collapse
|
11
|
Guo H, Wang Z, Song Z, Yuan Y, Deng X, Li X. Effect of state transition triggered by reinforcement learning in evolutionary prisoner’s dilemma game. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.08.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
|
12
|
Farahbakhsh I, Bauch CT, Anand M. Modelling coupled human-environment complexity for the future of the biosphere: strengths, gaps and promising directions. Philos Trans R Soc Lond B Biol Sci 2022; 377:20210382. [PMID: 35757879 PMCID: PMC9234813 DOI: 10.1098/rstb.2021.0382] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
Humans and the environment form a single complex system where humans not only influence ecosystems but also react to them. Despite this, there are far fewer coupled human–environment system (CHES) mathematical models than models of uncoupled ecosystems. We argue that these coupled models are essential to understand the impacts of social interventions and their potential to avoid catastrophic environmental events and support sustainable trajectories on multi-decadal timescales. A brief history of CHES modelling is presented, followed by a review spanning recent CHES models of systems including forests and land use, coral reefs and fishing and climate change mitigation. The ability of CHES modelling to capture dynamic two-way feedback confers advantages, such as the ability to represent ecosystem dynamics more realistically at longer timescales, and allowing insights that cannot be generated using ecological models. We discuss examples of such key insights from recent research. However, this strength brings with it challenges of model complexity and tractability, and the need for appropriate data to parameterize and validate CHES models. Finally, we suggest opportunities for CHES models to improve human–environment sustainability in future research spanning topics such as natural disturbances, social structure, social media data, model discovery and early warning signals. This article is part of the theme issue ‘Ecological complexity and the biosphere: the next 30 years’.
Collapse
Affiliation(s)
| | - Chris T Bauch
- Department of Applied Mathematics, University of Waterloo, Waterloo, Canada
| | - Madhur Anand
- School of Environmental Sciences, University of Guelph, Guelph, Canada
| |
Collapse
|
13
|
Chen F, Wu T, Wang L. Evolutionary dynamics of zero-determinant strategies in repeated multiplayer games. J Theor Biol 2022; 549:111209. [PMID: 35779706 DOI: 10.1016/j.jtbi.2022.111209] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2021] [Revised: 06/01/2022] [Accepted: 06/23/2022] [Indexed: 12/01/2022]
Abstract
Several studies have confirmed the existence of zero-determinant (ZD) strategies in repeated social dilemmas since Press and Dyson's ingenious discovery of ZD strategies in iterated prisoner's dilemmas. However, less research studies evolutionary performance of multiplayer ZD strategies, especially from a theoretical perspective. Here, we use a state-clustering method to theoretically analyze evolutionary dynamics of two representative ZD strategies: generous ZD strategies and extortionate ZD strategies. We consider two new settings for multiplayer ZD strategies: competitions with all ZD strategies and competitions with all memory-one strategies, apart from the competitions between these strategies and some classical ones. Moreover, we investigate the influence of the level of generosity and extortion on evolutionary dynamics of generous and extortionate ZD strategies, which was commonly ignored in previous studies. Theoretical results show that players with limited generosity are at an advantageous place and extortioners extorting more severely hold their ground more readily. Our results may provide new insights into better understanding evolutionary dynamics of ZD strategies in repeated multiplayer games.
Collapse
Affiliation(s)
- Fang Chen
- Center for Systems and Control, College of Engineering, Peking University, Beijing, China
| | - Te Wu
- Center for Complex Systems, Xidian University, Xi'an, China
| | - Long Wang
- Center for Systems and Control, College of Engineering, Peking University, Beijing, China; Center for Multi-Agent Research, Institute for Artificial Intelligence, Peking University, Beijing, China.
| |
Collapse
|
14
|
Barfuss W, Mann RP. Modeling the effects of environmental and perceptual uncertainty using deterministic reinforcement learning dynamics with partial observability. Phys Rev E 2022; 105:034409. [PMID: 35428165 DOI: 10.1103/physreve.105.034409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Accepted: 02/24/2022] [Indexed: 11/07/2022]
Abstract
Assessing the systemic effects of uncertainty that arises from agents' partial observation of the true states of the world is critical for understanding a wide range of scenarios, from navigation and foraging behavior to the provision of renewable resources and public infrastructures. Yet previous modeling work on agent learning and decision-making either lacks a systematic way to describe this source of uncertainty or puts the focus on obtaining optimal policies using complex models of the world that would impose an unrealistically high cognitive demand on real agents. In this work we aim to efficiently describe the emergent behavior of biologically plausible and parsimonious learning agents faced with partially observable worlds. Therefore we derive and present deterministic reinforcement learning dynamics where the agents observe the true state of the environment only partially. We showcase the broad applicability of our dynamics across different classes of partially observable agent-environment systems. We find that partial observability creates unintuitive benefits in several specific contexts, pointing the way to further research on a general understanding of such effects. For instance, partially observant agents can learn better outcomes faster, in a more stable way, and even overcome social dilemmas. Furthermore, our method allows the application of dynamical systems theory to partially observable multiagent leaning. In this regard we find the emergence of catastrophic limit cycles, a critical slowing down of the learning processes between reward regimes, and the separation of the learning dynamics into fast and slow directions, all caused by partial observability. Therefore, the presented dynamics have the potential to become a formal, yet practical, lightweight and robust tool for researchers in biology, social science, and machine learning to systematically investigate the effects of interacting partially observant agents.
Collapse
Affiliation(s)
- Wolfram Barfuss
- Institute for Theoretical Physics, University of Tübingen, 72076 Tübingen, Germany.,Department of Statistics, School of Mathematics, University of Leeds, Leeds LS2 9JT, United Kingdom
| | - Richard P Mann
- Department of Statistics, School of Mathematics, University of Leeds, Leeds LS2 9JT, United Kingdom
| |
Collapse
|
15
|
Barfuss W. Dynamical systems as a level of cognitive analysis of multi-agent learning: Algorithmic foundations of temporal-difference learning dynamics. Neural Comput Appl 2022; 34:1653-1671. [PMID: 35221541 PMCID: PMC8827307 DOI: 10.1007/s00521-021-06117-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 05/11/2021] [Indexed: 01/02/2023]
Abstract
A dynamical systems perspective on multi-agent learning, based on the link between evolutionary game theory and reinforcement learning, provides an improved, qualitative understanding of the emerging collective learning dynamics. However, confusion exists with respect to how this dynamical systems account of multi-agent learning should be interpreted. In this article, I propose to embed the dynamical systems description of multi-agent learning into different abstraction levels of cognitive analysis. The purpose of this work is to make the connections between these levels explicit in order to gain improved insight into multi-agent learning. I demonstrate the usefulness of this framework with the general and widespread class of temporal-difference reinforcement learning. I find that its deterministic dynamical systems description follows a minimum free-energy principle and unifies a boundedly rational account of game theory with decision-making under uncertainty. I then propose an on-line sample-batch temporal-difference algorithm which is characterized by the combination of applying a memory-batch and separated state-action value estimation. I find that this algorithm serves as a micro-foundation of the deterministic learning equations by showing that its learning trajectories approach the ones of the deterministic learning equations under large batch sizes. Ultimately, this framework of embedding a dynamical systems description into different abstraction levels gives guidance on how to unleash the full potential of the dynamical systems approach to multi-agent learning.
Collapse
Affiliation(s)
- Wolfram Barfuss
- School of Mathematics, University of Leeds, Leeds, UK.,Tübingen AI Center, University of Tübingen, Tübingen, Germany
| |
Collapse
|
16
|
Coordination and equilibrium selection in games: the role of local effects. Sci Rep 2022; 12:3373. [PMID: 35233046 PMCID: PMC8888577 DOI: 10.1038/s41598-022-07195-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Accepted: 02/11/2022] [Indexed: 01/28/2023] Open
Abstract
We study the role of local effects and finite size effects in reaching coordination and in equilibrium selection in two-player coordination games. We investigate three update rules - the replicator dynamics (RD), the best response (BR), and the unconditional imitation (UI). For the pure coordination game with two equivalent strategies we find a transition from a disordered state to coordination for a critical value of connectivity. The transition is system-size-independent for the BR and RD update rules. For the IU it is system-size-dependent, but coordination can always be reached below the connectivity of a complete graph. We also consider the general coordination game which covers a range of games, such as the stag hunt. For these games there is a payoff-dominant strategy and a risk-dominant strategy with associated states of equilibrium coordination. We analyse equilibrium selection analytically and numerically. For the RD and BR update rules mean-field predictions agree with simulations and the risk-dominant strategy is evolutionary favoured independently of local effects. When players use the unconditional imitation, however, we observe coordination in the payoff-dominant strategy. Surprisingly, the selection of pay-off dominant equilibrium only occurs below a critical value of the network connectivity and disappears in complete graphs. As we show, it is a combination of local effects and update rule that allows for coordination on the payoff-dominant strategy.
Collapse
|
17
|
Chica M, Hernández JM, Santos FC. Cooperation dynamics under pandemic risks and heterogeneous economic interdependence. CHAOS, SOLITONS, AND FRACTALS 2022; 155:111655. [PMID: 34955615 PMCID: PMC8683094 DOI: 10.1016/j.chaos.2021.111655] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 10/29/2021] [Accepted: 11/22/2021] [Indexed: 05/29/2023]
Abstract
The spread of COVID-19 and ensuing containment measures have accentuated the profound interdependence among nations or regions. This has been particularly evident in tourism, one of the sectors most affected by uncoordinated mobility restrictions. The impact of this interdependence on the tendency to adopt less or more restrictive measures is hard to evaluate, more so if diversity in economic exposures to citizens' mobility are considered. Here, we address this problem by developing an analytical and computational game-theoretical model encompassing the conflicts arising from the need to control the economic effects of global risks, such as in the COVID-19 pandemic. The model includes the individual costs derived from severe restrictions imposed by governments, including the resulting economic interdependence among all the parties involved in the game. By using tourism-based data, the model is enriched with actual heterogeneous income losses, such that every player has a different economic cost when applying restrictions. We show that economic interdependence enhances cooperation because of the decline in the expected payoffs by free-riding parties (i.e., those neglecting the application of mobility restrictions). Furthermore, we show (analytically and through numerical simulations) that these cross-exposures can transform the nature of the cooperation dilemma each region or country faces, modifying the position of the fixed points and the size of the basins of attraction that characterize this class of games. Finally, our results suggest that heterogeneity among regions may be used to leverage the impact of intervention policies by ensuring an agreement among the most relevant initial set of cooperators.
Collapse
Affiliation(s)
- Manuel Chica
- Andalusian Research Institute DaSCI "Data Science and Computational Intelligence", University of Granada, Granada 18071, Spain
- School of Electrical Engineering and Computing, The University of Newcastle, Callaghan NSW 2308, Australia
| | - Juan M Hernández
- Department of Quantitative Methods in Economics and Management, Universtiy Institute of Tourism and Sustainable Economic Development (TIDES), University of Las Palmas de Gran Canaria, Las Palmas 35017, Spain
| | - Francisco C Santos
- INESC-ID & Instituto Superior Técnico, Universidade de Lisboa, Porto Salvo 2744-016, Portugal
| |
Collapse
|
18
|
Salahshour M. Evolution of cooperation in costly institutions exhibits Red Queen and Black Queen dynamics in heterogeneous public goods. Commun Biol 2021; 4:1340. [PMID: 34845323 PMCID: PMC8630072 DOI: 10.1038/s42003-021-02865-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Accepted: 11/03/2021] [Indexed: 11/16/2022] Open
Abstract
Public goods are often subject to heterogeneous costs, such as the necessary costs to maintain the public goods infrastructure. However, the extent to which heterogeneity in participation cost can affect groups' ability to provide public goods is unclear. Here, by introducing a mathematical model, I show that when individuals face a costly institution and a free institution to perform a collective action task, the existence of a participation cost promotes cooperation in the costly institution. Despite paying for a participation cost, costly cooperators, who join the costly institution and cooperate, can outperform defectors who predominantly join a free institution. This promotes cooperation in the costly institution and can facilitate the evolution of cooperation in the free institution. For small profitability of the collective action, cooperation in a costly institution but not the free institution evolves. However, individuals are doomed to a winnerless red queen dynamics in which cooperators are unable to suppress defection. For large profitabilities, cooperation in both the costly and the free institution evolves. In this regime, cooperators with different game preferences complement each other to efficiently suppress defection in a black queen dynamic.
Collapse
Affiliation(s)
- Mohammad Salahshour
- Max Planck Institute for Mathematics in the Sciences, Inselstrasse 22, D-04103, Leipzig, Germany.
| |
Collapse
|
19
|
Bak-Coleman JB, Alfano M, Barfuss W, Bergstrom CT, Centeno MA, Couzin ID, Donges JF, Galesic M, Gersick AS, Jacquet J, Kao AB, Moran RE, Romanczuk P, Rubenstein DI, Tombak KJ, Van Bavel JJ, Weber EU. Stewardship of global collective behavior. Proc Natl Acad Sci U S A 2021; 118:e2025764118. [PMID: 34155097 PMCID: PMC8271675 DOI: 10.1073/pnas.2025764118] [Citation(s) in RCA: 72] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Collective behavior provides a framework for understanding how the actions and properties of groups emerge from the way individuals generate and share information. In humans, information flows were initially shaped by natural selection yet are increasingly structured by emerging communication technologies. Our larger, more complex social networks now transfer high-fidelity information over vast distances at low cost. The digital age and the rise of social media have accelerated changes to our social systems, with poorly understood functional consequences. This gap in our knowledge represents a principal challenge to scientific progress, democracy, and actions to address global crises. We argue that the study of collective behavior must rise to a "crisis discipline" just as medicine, conservation, and climate science have, with a focus on providing actionable insight to policymakers and regulators for the stewardship of social systems.
Collapse
Affiliation(s)
- Joseph B Bak-Coleman
- Center for an Informed Public, University of Washington, Seattle, WA 98195;
- eScience Institute, University of Washington, Seattle, WA 98195
| | - Mark Alfano
- Ethics & Philosophy of Technology, Delft University of Technology, 2628 CD Delft, The Netherlands
- Institute of Philosophy, Australian Catholic University, Banyo Queensland 4014, Australia
| | - Wolfram Barfuss
- Earth System Analysis, Potsdam Institute for Climate Impact Research, Member of the Leibniz Association, 14473 Potsdam, Germany
- Tübingen AI Center, University of Tübingen, 72074 Tübingen, Germany
| | - Carl T Bergstrom
- Department of Biology, University of Washington, Seattle, WA 98195
| | - Miguel A Centeno
- Princeton School of Public and International Affairs, Princeton University, Princeton, NJ 08544
| | - Iain D Couzin
- Department of Collective Behaviour, Max Planck Institute of Animal Behavior, 78315 Radolfzell am Bodensee, Germany
- Centre for the Advanced Study of Collective Behaviour, University of Konstanz, 78464 Konstanz, Germany
- Department of Biology, University of Konstanz, 78464 Konstanz, Germany
| | - Jonathan F Donges
- Earth System Analysis, Potsdam Institute for Climate Impact Research, Member of the Leibniz Association, 14473 Potsdam, Germany
- Stockholm Resilience Centre, Stockholm University, 11419 Stockholm, Sweden
| | | | - Andrew S Gersick
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ 08544
| | - Jennifer Jacquet
- Department of Environmental Studies, New York University, New York, NY 10012
| | | | - Rachel E Moran
- Center for an Informed Public, University of Washington, Seattle, WA 98195
| | - Pawel Romanczuk
- Institute for Theoretical Biology, Department of Biology, Humboldt Universität zu Berlin, 10115 Berlin, Germany
| | - Daniel I Rubenstein
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ 08544
| | - Kaia J Tombak
- Department of Anthropology, Hunter College of the City University of New York, New York, NY 10065
| | - Jay J Van Bavel
- Department of Psychology, New York University, New York, NY 10003
- Center for Neural Science, New York University, New York, NY 10003
| | - Elke U Weber
- Department of Psychology, Princeton University, Princeton, NJ 08544
- Andlinger Center for Energy and Environment, School of Engineering and Applied Science, Princeton University, Princeton, NJ 08544
| |
Collapse
|
20
|
Chica M, Hernández JM, Bulchand-Gidumal J. A collective risk dilemma for tourism restrictions under the COVID-19 context. Sci Rep 2021; 11:5043. [PMID: 33658596 PMCID: PMC7930199 DOI: 10.1038/s41598-021-84604-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Accepted: 02/18/2021] [Indexed: 11/09/2022] Open
Abstract
The current COVID-19 pandemic has impacted millions of people and the global economy. Tourism has been one the most affected economic sectors because of the mobility restrictions established by governments and uncoordinated actions from origin and destination regions. The coordination of restrictions and reopening policies could help control the spread of virus and enhance economies, but this is not an easy endeavor since touristic companies, citizens, and local governments have conflicting interests. We propose an evolutionary game model that reflects a collective risk dilemma behind these decisions. To this aim, we represent regions as players, organized in groups; and consider the perceived risk as a strict lock-down and null economic activity. The costs for regions when restricting their mobility are heterogeneous, given that the dependence on tourism of each region is diverse. Our analysis shows that, for both large populations and the EU NUTS2 case study, the existence of heterogeneous costs enhances global agreements. Furthermore, the decision on how to group regions to maximize the regions' agreement of the population is a relevant issue for decision makers to consider. We find out that a layout of groups based on similar costs of cooperation boosts the regions' agreements and avoid the risk of having a total lock-down and a negligible tourism activity. These findings can guide policy makers to facilitate agreements among regions to maximize the tourism recovery.
Collapse
Grants
- A-TIC-284-UGR18 Consejería de Economía, Innovación, Ciencia y Empleo, Junta de Andalucía (Ministry of Economy, Innovation, Science and Employment, Government of Andalucia)
- PGC2018-101216-B-I00 Ministerio de Economía y Competitividad (Ministry of Economy and Competitiveness)
- P18-TP-4475 Consejería de Economía, Innovación, Ciencia y Empleo, Junta de Andalucía (Ministry of Economy, Innovation, Science and Employment, Government of Andalucia)
- COVID-19-04 Universidad de Las Palmas de Gran Canaria (University of Las Palmas de Gran Canaria)
- COVID-19-04 Universidad de Las Palmas de Gran Canaria (University of Las Palmas de Gran Canaria)
Collapse
Affiliation(s)
- Manuel Chica
- Andalusian Research Institute DaSCI "Data Science and Computational Intelligence", University of Granada, 18071, Granada, Spain.
- School of Electrical Engineering and Computing, The University of Newcastle, Callaghan, NSW, 2308, Australia.
| | - Juan M Hernández
- Department of Quantitative Methods in Economics and Management, University of Las Palmas de Gran Canaria, Las Palmas, 35017, Spain
- TIDES Institute for Sustainable Tourism and Economic Development, University of Las Palmas de Gran Canaria, Las Palmas, 35017, Spain
| | - Jacques Bulchand-Gidumal
- TIDES Institute for Sustainable Tourism and Economic Development, University of Las Palmas de Gran Canaria, Las Palmas, 35017, Spain
| |
Collapse
|
21
|
San Miguel M, Toral R. Introduction to the chaos focus issue on the dynamics of social systems. CHAOS (WOODBURY, N.Y.) 2020; 30:120401. [PMID: 33380029 DOI: 10.1063/5.0037137] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Accepted: 11/11/2020] [Indexed: 06/12/2023]
Affiliation(s)
- Maxi San Miguel
- IFISC, Instituto de Física Interdisciplinar y Sistemas Complejos (CSIC-UIB), Campus Universitat de les Illes Balears, 07122 Palma de Mallorca, Spain
| | - Raul Toral
- IFISC, Instituto de Física Interdisciplinar y Sistemas Complejos (CSIC-UIB), Campus Universitat de les Illes Balears, 07122 Palma de Mallorca, Spain
| |
Collapse
|
22
|
Huang F, Cao M, Wang L. Learning enables adaptation in cooperation for multi-player stochastic games. J R Soc Interface 2020; 17:20200639. [PMID: 33202177 DOI: 10.1098/rsif.2020.0639] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Interactions among individuals in natural populations often occur in a dynamically changing environment. Understanding the role of environmental variation in population dynamics has long been a central topic in theoretical ecology and population biology. However, the key question of how individuals, in the middle of challenging social dilemmas (e.g. the 'tragedy of the commons'), modulate their behaviours to adapt to the fluctuation of the environment has not yet been addressed satisfactorily. Using evolutionary game theory, we develop a framework of stochastic games that incorporates the adaptive mechanism of reinforcement learning to investigate whether cooperative behaviours can evolve in the ever-changing group interaction environment. When the action choices of players are just slightly influenced by past reinforcements, we construct an analytical condition to determine whether cooperation can be favoured over defection. Intuitively, this condition reveals why and how the environment can mediate cooperative dilemmas. Under our model architecture, we also compare this learning mechanism with two non-learning decision rules, and we find that learning significantly improves the propensity for cooperation in weak social dilemmas, and, in sharp contrast, hinders cooperation in strong social dilemmas. Our results suggest that in complex social-ecological dilemmas, learning enables the adaptation of individuals to varying environments.
Collapse
Affiliation(s)
- Feng Huang
- Center for Systems and Control, College of Engineering, Peking University, Beijing 100871, People's Republic of China.,Center for Data Science and System Complexity, Faculty of Science and Engineering, University of Groningen, Groningen 9747 AG, The Netherlands
| | - Ming Cao
- Center for Data Science and System Complexity, Faculty of Science and Engineering, University of Groningen, Groningen 9747 AG, The Netherlands
| | - Long Wang
- Center for Systems and Control, College of Engineering, Peking University, Beijing 100871, People's Republic of China
| |
Collapse
|
23
|
Domingos EF, Grujić J, Burguillo JC, Kirchsteiger G, Santos FC, Lenaerts T. Timing Uncertainty in Collective Risk Dilemmas Encourages Group Reciprocation and Polarization. iScience 2020; 23:101752. [PMID: 33294777 PMCID: PMC7701182 DOI: 10.1016/j.isci.2020.101752] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Revised: 10/04/2020] [Accepted: 10/28/2020] [Indexed: 11/21/2022] Open
Abstract
Social dilemmas are often shaped by actions involving uncertain returns only achievable in the future, such as climate action or voluntary vaccination. In this context, uncertainty may produce non-trivial effects. Here, we assess experimentally — through a collective risk dilemma — the effect of timing uncertainty, i.e. how uncertainty about when a target needs to be reached affects the participants' behaviors. We show that timing uncertainty prompts not only early generosity but also polarized outcomes, where participants' total contributions are distributed unevenly. Furthermore, analyzing participants' behavior under timing uncertainty reveals an increase in reciprocal strategies. A data-driven game-theoretical model captures the self-organizing dynamics underpinning these behavioral patterns. Timing uncertainty thus casts a shadow on the future that leads participants to respond early, whereas reciprocal strategies appear to be important for group success. Yet, the same uncertainty also leads to inequity and polarization, requiring the inclusion of new incentives handling these societal issues. Timing uncertainty influences experimental observations in the collective risk game It induces subjects to contribute earlier and in a polarized manner Successful players adopt reciprocal strategies, responding in kind to past actions Coordination gets more difficult under high timing uncertainty
Collapse
Affiliation(s)
- Elias Fernández Domingos
- AI lab, Computer Science Department, Vrije Universiteit Brussel, Pleinlaan 9, 3rd Floor, 1050 Brussels, Belgium.,MLG, Département D'Informatique, Université Libre de Bruxelles, Boulevard Du Triomphe, CP 212, 1050 Brussels, Belgium.,Department of Telematic Engineering, University of Vigo, 36310 Vigo, Spain
| | - Jelena Grujić
- AI lab, Computer Science Department, Vrije Universiteit Brussel, Pleinlaan 9, 3rd Floor, 1050 Brussels, Belgium.,MLG, Département D'Informatique, Université Libre de Bruxelles, Boulevard Du Triomphe, CP 212, 1050 Brussels, Belgium
| | - Juan C Burguillo
- Department of Telematic Engineering, University of Vigo, 36310 Vigo, Spain
| | - Georg Kirchsteiger
- ECARES, Université Libre de Bruxelles, Av. Roosevelt 42, CP 114, 1050 Brussels, Belgium
| | - Francisco C Santos
- MLG, Département D'Informatique, Université Libre de Bruxelles, Boulevard Du Triomphe, CP 212, 1050 Brussels, Belgium.,INESC-ID and Instituto Superior Técnico, Universidade de Lisboa, IST-Taguspark, 2744-016 Porto Salvo, Portugal.,ATP-group, 2744-016 Porto Salvo, Portugal
| | - Tom Lenaerts
- AI lab, Computer Science Department, Vrije Universiteit Brussel, Pleinlaan 9, 3rd Floor, 1050 Brussels, Belgium.,MLG, Département D'Informatique, Université Libre de Bruxelles, Boulevard Du Triomphe, CP 212, 1050 Brussels, Belgium
| |
Collapse
|