1
|
Witt A, Toyokawa W, Lala KN, Gaissmaier W, Wu CM. Humans flexibly integrate social information despite interindividual differences in reward. Proc Natl Acad Sci U S A 2024; 121:e2404928121. [PMID: 39302964 PMCID: PMC11441569 DOI: 10.1073/pnas.2404928121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 08/19/2024] [Indexed: 09/22/2024] Open
Abstract
There has been much progress in understanding human social learning, including recent studies integrating social information into the reinforcement learning framework. Yet previous studies often assume identical payoffs between observer and demonstrator, overlooking the diversity of social information in real-world interactions. We address this gap by introducing a socially correlated bandit task that accommodates payoff differences among participants, allowing for the study of social learning under more realistic conditions. Our Social Generalization (SG) model, tested through evolutionary simulations and two online experiments, outperforms existing models by incorporating social information into the generalization process, but treating it as noisier than individual observations. Our findings suggest that human social learning is more flexible than previously believed, with the SG model indicating a potential resource-rational trade-off where social learning partially replaces individual exploration. This research highlights the flexibility of humans' social learning, allowing us to integrate social information from others with different preferences, skills, or goals.
Collapse
Affiliation(s)
- Alexandra Witt
- Human and Machine Cognition Lab, University of Tübingen, Tübingen72074, Germany
| | - Wataru Toyokawa
- Social Psychology and Decision Sciences, Department of Psychology, University of Konstanz, Konstanz78464, Germany
- Computational Group Dynamics Unit, RIKEN Center for Brain Science, RIKEN, Wako351-0198, Japan
| | - Kevin N. Lala
- School of Biology, University of St Andrews, St AndrewsKY16 9AJ, United Kingdom
| | - Wolfgang Gaissmaier
- Social Psychology and Decision Sciences, Department of Psychology, University of Konstanz, Konstanz78464, Germany
| | - Charley M. Wu
- Human and Machine Cognition Lab, University of Tübingen, Tübingen72074, Germany
| |
Collapse
|
2
|
Bergerot C, Barfuss W, Romanczuk P. Moderate confirmation bias enhances decision-making in groups of reinforcement-learning agents. PLoS Comput Biol 2024; 20:e1012404. [PMID: 39231162 PMCID: PMC11404843 DOI: 10.1371/journal.pcbi.1012404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Revised: 09/16/2024] [Accepted: 08/09/2024] [Indexed: 09/06/2024] Open
Abstract
Humans tend to give more weight to information confirming their beliefs than to information that disconfirms them. Nevertheless, this apparent irrationality has been shown to improve individual decision-making under uncertainty. However, little is known about this bias' impact on decision-making in a social context. Here, we investigate the conditions under which confirmation bias is beneficial or detrimental to decision-making under social influence. To do so, we develop a Collective Asymmetric Reinforcement Learning (CARL) model in which artificial agents observe others' actions and rewards, and update this information asymmetrically. We use agent-based simulations to study how confirmation bias affects collective performance on a two-armed bandit task, and how resource scarcity, group size and bias strength modulate this effect. We find that a confirmation bias benefits group learning across a wide range of resource-scarcity conditions. Moreover, we discover that, past a critical bias strength, resource abundance favors the emergence of two different performance regimes, one of which is suboptimal. In addition, we find that this regime bifurcation comes with polarization in small groups of agents. Overall, our results suggest the existence of an optimal, moderate level of confirmation bias for decision-making in a social context.
Collapse
Affiliation(s)
- Clémence Bergerot
- Department of Biology, Humboldt Universität zu Berlin, Berlin, Germany
- Charité - Universitätsmedizin Berlin, Einstein Center for Neurosciences Berlin, Berlin, Germany
| | - Wolfram Barfuss
- Transdisciplinary Research Area: Sustainable Futures, University of Bonn, Bonn, Germany
- Center for Development Research (ZEF), University of Bonn, Bonn, Germany
| | - Pawel Romanczuk
- Department of Biology, Humboldt Universität zu Berlin, Berlin, Germany
- Science of Intelligence, Research Cluster of Excellence, Berlin, Germany
| |
Collapse
|
3
|
Homma S, Takezawa M. Risk preference as an outcome of evolutionarily adaptive learning mechanisms: An evolutionary simulation under diverse risky environments. PLoS One 2024; 19:e0307991. [PMID: 39088544 PMCID: PMC11293680 DOI: 10.1371/journal.pone.0307991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Accepted: 07/15/2024] [Indexed: 08/03/2024] Open
Abstract
The optimization of cognitive and learning mechanisms can reveal complicated behavioral phenomena. In this study, we focused on reinforcement learning, which uses different learning rules for positive and negative reward prediction errors. We attempted to relate the evolved learning bias to the complex features of risk preference such as domain-specific behavior manifests and the relatively stable domain-general factor underlying behaviors. The simulations of the evolution of the two learning rates under diverse risky environments showed that the positive learning rate evolved on average to be higher than the negative one, when agents experienced both tasks where risk aversion was more rewarding and risk seeking was more rewarding. This evolution enabled agents to flexibly choose more reward behaviors depending on the task type. The evolved agents also demonstrated behavioral patterns described by the prospect theory. Our simulations captured two aspects of the evolution of risk preference: the domain-specific aspect, behavior acquired through learning in a specific context; and the implicit domain-general aspect, corresponding to the learning rates shaped through evolution to adaptively behave in a wide range of environments. These results imply that our framework of learning under the innate constraint may be useful in understanding the complicated behavioral phenomena.
Collapse
Affiliation(s)
- Shogo Homma
- Department of Behavioral Science, Graduate School of Humanities and Human Sciences, Hokkaido University, Sapporo, Hokkaido, Japan
- Japan Society for the Promotion of Science, Tokyo, Japan
- Department of Cognitive and Psychological Sciences, Graduate School of Informatics, Nagoya University, Nagoya, Aichi, Japan
| | - Masanori Takezawa
- Department of Behavioral Science, Graduate School of Humanities and Human Sciences, Hokkaido University, Sapporo, Hokkaido, Japan
- Center for Experimental Research in Social Sciences, Hokkaido University, Sapporo, Hokkaido, Japan
- Center for Human Nature, Artificial Intelligence and Neuroscience, Hokkaido University, Sapporo, Hokkaido, Japan
| |
Collapse
|
4
|
Orejudo S, Lozano-Blasco R, Bautista P, Aiger M. Interaction among participants in a collective intelligence experiment: an emotional approach. Front Psychol 2024; 15:1383134. [PMID: 38813562 PMCID: PMC11133684 DOI: 10.3389/fpsyg.2024.1383134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Accepted: 04/04/2024] [Indexed: 05/31/2024] Open
Abstract
Introduction The construct of collective intelligence assumes that groups have a better capacity than individuals to deal with complex, poorly defined problems. The digital domain allows us to analyze this premise under circumstances different from those in the physical environment: we can gather an elevated number of participants and generate a large quantity of data. Methods This study adopted an emotional perspective to analyze the interactions among 794 adolescents dealing with a sexting case on an online interaction platform designed to generate group answers resulting from a certain degree of achieved consensus. Results Our results show that emotional responses evolve over time in several phases of interaction. From the onset, the emotional dimension predicts how individual responses will evolve, particularly in the final consensus phase. Discussion Responses gradually become more emotionally complex; participants tend to identify themselves with the victim in the test case while increasingly rejecting the aggressors.
Collapse
Affiliation(s)
- Santos Orejudo
- Department of Psychology and Sociology, University of Zaragoza, Zaragoza, Spain
| | | | - Pablo Bautista
- Department of Educational Sciences, University of Zaragoza, Zaragoza, Spain
| | - Montserrat Aiger
- Department of Psychology and Sociology, University of Zaragoza, Zaragoza, Spain
| |
Collapse
|
5
|
Tump AN, Deffner D, Pleskac TJ, Romanczuk P, M. Kurvers RHJ. A Cognitive Computational Approach to Social and Collective Decision-Making. PERSPECTIVES ON PSYCHOLOGICAL SCIENCE 2024; 19:538-551. [PMID: 37671891 PMCID: PMC10913326 DOI: 10.1177/17456916231186964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/07/2023]
Abstract
Collective dynamics play a key role in everyday decision-making. Whether social influence promotes the spread of accurate information and ultimately results in adaptive behavior or leads to false information cascades and maladaptive social contagion strongly depends on the cognitive mechanisms underlying social interactions. Here we argue that cognitive modeling, in tandem with experiments that allow collective dynamics to emerge, can mechanistically link cognitive processes at the individual and collective levels. We illustrate the strength of this cognitive computational approach with two highly successful cognitive models that have been applied to interactive group experiments: evidence-accumulation and reinforcement-learning models. We show how these approaches make it possible to simultaneously study (a) how individual cognition drives social systems, (b) how social systems drive individual cognition, and (c) the dynamic feedback processes between the two layers.
Collapse
Affiliation(s)
- Alan N. Tump
- Center for Adaptive Rationality, Max Planck Institute for Human Development
- Science of Intelligence, Technische Universität Berlin
| | - Dominik Deffner
- Center for Adaptive Rationality, Max Planck Institute for Human Development
- Science of Intelligence, Technische Universität Berlin
| | | | - Pawel Romanczuk
- Science of Intelligence, Technische Universität Berlin
- Institute for Theoretical Biology, Department of Biology, Humboldt Universität zu Berlin
- Bernstein Center for Computational Neuroscience Berlin
| | - Ralf H. J. M. Kurvers
- Center for Adaptive Rationality, Max Planck Institute for Human Development
- Science of Intelligence, Technische Universität Berlin
| |
Collapse
|
6
|
Beck KB, Sheldon BC, Firth JA. Social learning mechanisms shape transmission pathways through replicate local social networks of wild birds. eLife 2023; 12:85703. [PMID: 37128701 PMCID: PMC10154030 DOI: 10.7554/elife.85703] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 04/05/2023] [Indexed: 05/03/2023] Open
Abstract
The emergence and spread of novel behaviours via social learning can lead to rapid population-level changes whereby the social connections between individuals shape information flow. However, behaviours can spread via different mechanisms and little is known about how information flow depends on the underlying learning rule individuals employ. Here, comparing four different learning mechanisms, we simulated behavioural spread on replicate empirical social networks of wild great tits and explored the relationship between individual sociality and the order of behavioural acquisition. Our results reveal that, for learning rules dependent on the sum and strength of social connections to informed individuals, social connectivity was related to the order of acquisition, with individuals with increased social connectivity and reduced social clustering adopting new behaviours faster. However, when behavioural adoption depends on the ratio of an individuals' social connections to informed versus uninformed individuals, social connectivity was not related to the order of acquisition. Finally, we show how specific learning mechanisms may limit behavioural spread within networks. These findings have important implications for understanding whether and how behaviours are likely to spread across social systems, the relationship between individuals' sociality and behavioural acquisition, and therefore for the costs and benefits of sociality.
Collapse
Affiliation(s)
- Kristina B Beck
- Edward Grey Institute of Field Ornithology, Department of Biology, University of Oxford, Oxford, United Kingdom
| | - Ben C Sheldon
- Edward Grey Institute of Field Ornithology, Department of Biology, University of Oxford, Oxford, United Kingdom
| | - Josh A Firth
- Edward Grey Institute of Field Ornithology, Department of Biology, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
7
|
Jiang Y, Marcowski P, Ryazanov A, Winkielman P. People conform to social norms when gambling with lives or money. Sci Rep 2023; 13:853. [PMID: 36646767 PMCID: PMC9842616 DOI: 10.1038/s41598-023-27462-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 01/02/2023] [Indexed: 01/18/2023] Open
Abstract
Many consider moral decisions to follow an internal "moral compass", resistant to social pressures. Here we examine how social influence shapes moral decisions under risk, and how it operates in different decision contexts. We employed an adapted Asian Disease Paradigm where participants chose between certain losses/gains and probabilistic losses/gains in a series of moral (lives) or financial (money) decisions. We assessed participants' own risk preferences before and after exposing them to social norms that are generally risk-averse or risk-seeking. Our results showed that participants robustly shifted their own choices towards the observed risk preferences. This conformity holds even after a re-testing in three days. Interestingly, in the monetary domain, risk-averse norms have more influence on choices in the loss frame, whereas risk-seeking norms have more influence in the gain frame, presumably because norms that contradict default behavior are most informative. In the moral domain, risk-averse as opposed to risk-seeking norms are more effective in the loss frame but in the gain frame different norms are equally effective. Taken together, our results demonstrate conformity in risk preferences across contexts and highlight unique features of decisions and conformity in moral and monetary domains.
Collapse
Affiliation(s)
- Yueyi Jiang
- Department of Psychology, University of California San Diego, La Jolla, CA, USA.
| | - Przemysław Marcowski
- Swartz Center for Computational Neuroscience, University of California San Diego, La Jolla, CA, USA
| | - Arseny Ryazanov
- Department of Psychology, University of California San Diego, La Jolla, CA, USA
| | - Piotr Winkielman
- Department of Psychology, University of California San Diego, La Jolla, CA, USA.,Department of Psychology, SWPS University of Social Sciences and Humanities, Warsaw, Poland
| |
Collapse
|
8
|
Orejudo S, Cano-Escoriaza J, Cebollero-Salinas AB, Bautista P, Clemente-Gallardo J, Rivero A, Rivero P, Tarancón A. Evolutionary emergence of collective intelligence in large groups of students. Front Psychol 2022; 13:848048. [PMID: 36405219 PMCID: PMC9666766 DOI: 10.3389/fpsyg.2022.848048] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Accepted: 09/26/2022] [Indexed: 10/20/2023] Open
Abstract
The emergence of collective intelligence has been studied in much greater detail in small groups than in larger ones. Nevertheless, in groups of several hundreds or thousands of members, it is well-known that the social environment exerts a considerable influence on individual behavior. A few recent papers have dealt with some aspects of large group situations, but have not provided an in-depth analysis of the role of interactions among the members of a group in the creation of ideas, as well as the group's overall performance. In this study, we report an experiment where a large set of individuals, i.e., 789 high-school students, cooperated online in real time to solve two different examinations on a specifically designed platform (Thinkhub). Our goal of this paper 6 to describe the specific mechanisms of idea creation we were able to observe and to measure the group's performance as a whole. When we deal with communication networks featuring a large number of interacting entities, it seems natural to model the set as a complex system by resorting to the tools of statistical mechanics. Our experiment shows how an interaction in small groups that increase in size over several phases, leading to a final phase where the students are confronted with the most popular answers of the previous phases, is capable of producing high-quality answers to all examination questions, whereby the last phase plays a crucial role. Our experiment likewise shows that a group's performance in such a task progresses in a linear manner in parallel with the size of the group. Finally, we show that the controlled interaction and dynamics foreseen in the system can reduce the spread of "fake news" within the group.
Collapse
Affiliation(s)
- Santos Orejudo
- Department of Psychology and Sociology, University of Zaragoza, Zaragoza, Spain
| | | | | | - Pablo Bautista
- Department of Psychology and Sociology, University of Zaragoza, Zaragoza, Spain
| | - Jesús Clemente-Gallardo
- Department of Theoretical Physics, University of Zaragoza, Zaragoza, Spain
- Institute for Biocomputation and Physics of Complex Systems, University of Zaragoza, Zaragoza, Spain
| | | | - Pilar Rivero
- Department of Specific Didactics, University of Zaragoza, Zaragoza, Spain
| | - Alfonso Tarancón
- Department of Theoretical Physics, University of Zaragoza, Zaragoza, Spain
- Institute for Biocomputation and Physics of Complex Systems, University of Zaragoza, Zaragoza, Spain
| |
Collapse
|