1
|
Witt A, Toyokawa W, Lala KN, Gaissmaier W, Wu CM. Humans flexibly integrate social information despite interindividual differences in reward. Proc Natl Acad Sci U S A 2024; 121:e2404928121. [PMID: 39302964 PMCID: PMC11441569 DOI: 10.1073/pnas.2404928121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 08/19/2024] [Indexed: 09/22/2024] Open
Abstract
There has been much progress in understanding human social learning, including recent studies integrating social information into the reinforcement learning framework. Yet previous studies often assume identical payoffs between observer and demonstrator, overlooking the diversity of social information in real-world interactions. We address this gap by introducing a socially correlated bandit task that accommodates payoff differences among participants, allowing for the study of social learning under more realistic conditions. Our Social Generalization (SG) model, tested through evolutionary simulations and two online experiments, outperforms existing models by incorporating social information into the generalization process, but treating it as noisier than individual observations. Our findings suggest that human social learning is more flexible than previously believed, with the SG model indicating a potential resource-rational trade-off where social learning partially replaces individual exploration. This research highlights the flexibility of humans' social learning, allowing us to integrate social information from others with different preferences, skills, or goals.
Collapse
Affiliation(s)
- Alexandra Witt
- Human and Machine Cognition Lab, University of Tübingen, Tübingen72074, Germany
| | - Wataru Toyokawa
- Social Psychology and Decision Sciences, Department of Psychology, University of Konstanz, Konstanz78464, Germany
- Computational Group Dynamics Unit, RIKEN Center for Brain Science, RIKEN, Wako351-0198, Japan
| | - Kevin N. Lala
- School of Biology, University of St Andrews, St AndrewsKY16 9AJ, United Kingdom
| | - Wolfgang Gaissmaier
- Social Psychology and Decision Sciences, Department of Psychology, University of Konstanz, Konstanz78464, Germany
| | - Charley M. Wu
- Human and Machine Cognition Lab, University of Tübingen, Tübingen72074, Germany
| |
Collapse
|
2
|
Paunov A, L’Hôtellier M, Guo D, He Z, Yu A, Meyniel F. Multiple and subject-specific roles of uncertainty in reward-guided decision-making. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.27.587016. [PMID: 38585958 PMCID: PMC10996615 DOI: 10.1101/2024.03.27.587016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/09/2024]
Abstract
Decision-making in noisy, changing, and partially observable environments entails a basic tradeoff between immediate reward and longer-term information gain, known as the exploration-exploitation dilemma. Computationally, an effective way to balance this tradeoff is by leveraging uncertainty to guide exploration. Yet, in humans, empirical findings are mixed, from suggesting uncertainty-seeking to indifference and avoidance. In a novel bandit task that better captures uncertainty-driven behavior, we find multiple roles for uncertainty in human choices. First, stable and psychologically meaningful individual differences in uncertainty preferences actually range from seeking to avoidance, which can manifest as null group-level effects. Second, uncertainty modulates the use of basic decision heuristics that imperfectly exploit immediate rewards: a repetition bias and win-stay-lose-shift heuristic. These heuristics interact with uncertainty, favoring heuristic choices under higher uncertainty. These results, highlighting the rich and varied structure of reward-based choice, are a step to understanding its functional basis and dysfunction in psychopathology.
Collapse
Affiliation(s)
- Alexander Paunov
- INSERM-CEA Cognitive Neuroimaging Unit (UNICOG), NeuroSpin Center, CEA Paris-Saclay, Gif-sur-Yvette, France Université de Paris, Paris, France
- Institut de Neuromodulation, GHU Paris, Psychiatrie et Neurosciences, Centre Hospitalier Sainte-Anne, Pôle Hospitalo-Universitaire 15, Université Paris Cité, Paris, France
| | - Maëva L’Hôtellier
- INSERM-CEA Cognitive Neuroimaging Unit (UNICOG), NeuroSpin Center, CEA Paris-Saclay, Gif-sur-Yvette, France Université de Paris, Paris, France
| | - Dalin Guo
- Department of Cognitive Science, University of California San Diego, San Diego, CA, USA
| | - Zoe He
- Department of Cognitive Science, University of California San Diego, San Diego, CA, USA
| | - Angela Yu
- Department of Cognitive Science, University of California San Diego, San Diego, CA, USA
- Centre for Cognitive Science & Hessian AI Center, Technical University of Darmstadt, Germany
| | - Florent Meyniel
- INSERM-CEA Cognitive Neuroimaging Unit (UNICOG), NeuroSpin Center, CEA Paris-Saclay, Gif-sur-Yvette, France Université de Paris, Paris, France
- Institut de Neuromodulation, GHU Paris, Psychiatrie et Neurosciences, Centre Hospitalier Sainte-Anne, Pôle Hospitalo-Universitaire 15, Université Paris Cité, Paris, France
| |
Collapse
|
3
|
Amemiya J, Heyman GD, Gerstenberg T. Children use disagreement to infer what happened. Cognition 2024; 250:105836. [PMID: 38843594 DOI: 10.1016/j.cognition.2024.105836] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 04/09/2024] [Accepted: 05/23/2024] [Indexed: 07/22/2024]
Abstract
In a rapidly changing and diverse world, the ability to reason about conflicting perspectives is critical for effective communication, collaboration, and critical thinking. The current pre-registered experiments with children ages 7 to 11 years investigated the developmental foundations of this ability through a novel social reasoning paradigm and a computational approach. In the inference task, children were asked to figure out what happened based on whether two speakers agreed or disagreed in their interpretation. In the prediction task, children were provided information about what happened and asked to predict whether two speakers will agree or disagree. Together, these experiments assessed children's understanding that disagreement often results from ambiguity about what happened, and that ambiguity about what happened is often predictive of disagreement. Experiment 1 (N = 52) showed that children are more likely to infer that an ambiguous utterance occurred after learning that people disagreed (versus agreed) about what happened and found that these inferences become stronger with age. Experiment 2 (N = 110) similarly found age-related change in children's inferences and also showed that children could reason in the forward direction, predicting that an ambiguous utterance would lead to disagreement. A computational model indicated that although children's ability to predict when disagreements might arise may be critical for making the reverse inferences, it did not fully account for age-related change.
Collapse
Affiliation(s)
| | - Gail D Heyman
- Department of Psychology, University of California, San Diego, USA
| | | |
Collapse
|
4
|
Nussenbaum K, Katzman PL, Lu H, Zorowitz S, Hartley CA. Sensitivity to the Instrumental Value of Choice Increases Across Development. Psychol Sci 2024; 35:933-947. [PMID: 38900963 DOI: 10.1177/09567976241256961] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/22/2024] Open
Abstract
Across development, people tend to demonstrate a preference for contexts in which they have the opportunity to make choices. However, it is not clear how children, adolescents, and adults learn to calibrate this preference based on the costs and benefits of agentic choice. Here, in both a primary, in-person, reinforcement-learning experiment (N = 92; age range = 10-25 years) and a preregistered online replication study (N = 150; age range = 8-25 years), we found that participants overvalued agentic choice but also calibrated their agency decisions to the reward structure of the environment, increasingly selecting agentic choice when choice had greater instrumental value. Regression analyses and computational modeling of participant choices revealed that participants' bias toward agentic choice-reflecting its intrinsic value-remained consistent across age, whereas sensitivity to the instrumental value of agentic choice increased from childhood to early adulthood.
Collapse
Affiliation(s)
- Kate Nussenbaum
- Department of Psychology, New York University
- Princeton Neuroscience Institute, Princeton University
| | | | - Hanxiao Lu
- Department of Psychology, New York University
| | | | - Catherine A Hartley
- Department of Psychology, New York University
- Center for Neural Science, New York University
| |
Collapse
|
5
|
Parr AC, Sydnor VJ, Calabro FJ, Luna B. Adolescent-to-adult gains in cognitive flexibility are adaptively supported by reward sensitivity, exploration, and neural variability. Curr Opin Behav Sci 2024; 58:101399. [PMID: 38826569 PMCID: PMC11138371 DOI: 10.1016/j.cobeha.2024.101399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]
Abstract
Cognitive flexibility exhibits dynamic changes throughout development, with different forms of flexibility showing dissociable developmental trajectories. In this review, we propose that an adolescent-specific mode of flexibility in the face of changing environmental contingencies supports the emergence of adolescent-to-adult gains in cognitive shifting efficiency. We first describe how cognitive shifting abilities monotonically improve from childhood to adulthood, accompanied by increases in brain state flexibility, neural variability, and excitatory/inhibitory balance. We next summarize evidence supporting the existence of a dopamine-driven, adolescent peak in flexible behavior that results in reward seeking, undirected exploration, and environmental sampling. We propose a neurodevelopmental framework that relates these adolescent behaviors to the refinement of neural phenotypes relevant to mature cognitive flexibility, and thus highlight the importance of the adolescent period in fostering healthy neurocognitive trajectories.
Collapse
Affiliation(s)
- Ashley C. Parr
- Department of Psychiatry, University of Pittsburgh, Pittsburgh PA, 14213, USA
| | - Valerie J. Sydnor
- Department of Psychiatry, University of Pittsburgh, Pittsburgh PA, 14213, USA
| | - Finnegan J. Calabro
- Department of Psychiatry, University of Pittsburgh, Pittsburgh PA, 14213, USA
| | - Beatriz Luna
- Department of Psychiatry, University of Pittsburgh, Pittsburgh PA, 14213, USA
- Department of Psychology, University of Pittsburgh, Pittsburgh PA, 14213, USA
| |
Collapse
|
6
|
Hagan KE, Aimufua I, Haynos AF, Walsh BT. The explore/exploit trade-off: An ecologically valid and translational framework that can advance mechanistic understanding of eating disorders. Int J Eat Disord 2024; 57:1102-1108. [PMID: 38385592 DOI: 10.1002/eat.24173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 01/26/2024] [Accepted: 02/08/2024] [Indexed: 02/23/2024]
Abstract
The explore/exploit trade-off is a decision-making process that is conserved across species and balances exploring unfamiliar choices of unknown value with choosing familiar options of known value to maximize reward. This framework is rooted in behavioral ecology and has traditionally been used to study maladaptive versus adaptive non-human animal foraging behavior. Researchers have begun to recognize the potential utility of understanding human decision-making and psychopathology through the explore/exploit trade-off. In this article, we propose that explore/exploit trade-off holds promise for advancing our mechanistic understanding of decision-making processes that confer vulnerability for and maintain eating pathology due to its neurodevelopmental bases, conservation across species, and ability to be mathematically modeled. We present a model for how suboptimal explore/exploit decision-making can promote disordered eating and present recommendations for future research applying this framework to eating pathology. Taken together, the explore/exploit trade-off provides a translational framework for expanding etiologic and maintenance models of eating pathology, given developmental changes in explore/exploit decision-making that coincide in time with the emergence of eating pathology and evidence of biased explore/exploit decision-making in psychopathology. Additionally, understanding explore/exploit decision-making in eating disorders may improve knowledge of their underlying pathophysiology, informing targeted clinical interventions such as neuromodulation and pharmacotherapy. PUBLIC SIGNIFICANCE STATEMENT: The explore/exploit trade-off is a cross-species decision-making process whereby organisms choose between a known option with a known reward or sampling unfamiliar options. We hypothesize that imbalanced explore/exploit decision-making can promote disordered eating and present preliminary data. We propose that explore/exploit trade-off has significant potential to advance understanding of the neurocognitive and neurodevelopmental mechanisms of eating pathology, which could ultimately guide revisions of etiologic models and inform novel interventions.
Collapse
Affiliation(s)
- Kelsey E Hagan
- Department of Psychiatry, Virginia Commonwealth University, Richmond, Virginia, USA
- Institute for Women's Health, Virginia Commonwealth University, Richmond, Virginia, USA
| | - Ivieosa Aimufua
- Department of Psychiatry, New York State Psychiatric Institute, Columbia University Irving Medical Center, New York, New York, USA
| | - Ann F Haynos
- Department of Psychiatry, Virginia Commonwealth University, Richmond, Virginia, USA
- Department of Psychology, Virginia Commonwealth University, Richmond, Virginia, USA
- Department of Psychiatry and Behavioral Sciences, University of Minnesota, Minneapolis, Minnesota, USA
| | - B Timothy Walsh
- Department of Psychiatry, New York State Psychiatric Institute, Columbia University Irving Medical Center, New York, New York, USA
| |
Collapse
|
7
|
Ciranka S, Hertwig R. Environmental statistics and experience shape risk-taking across adolescence. Trends Cogn Sci 2023; 27:1123-1134. [PMID: 37739921 DOI: 10.1016/j.tics.2023.08.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Revised: 08/29/2023] [Accepted: 08/30/2023] [Indexed: 09/24/2023]
Abstract
Adolescents are often portrayed as reckless risk-takers because of their immature brains. Recent research has cast doubt on this portrayal, identifying the environment as a moderator of risk-taking. However, the key features of environments that drive risk-taking behaviors are often underspecified. We call for greater attention to the environment by drawing on research showing that its statistical structure impacts future risk-taking as people learn from outcomes they experience after taking a risk. This opinion shows that adolescents are unlikely to experience harm from many risks because environmental statistics are skewed and favor safe experiences. Environmental statistics and experience suggest entry points for policy interventions by carefully timing risk warnings and leveraging peers' potential to shape the statistics of rewarding experiences.
Collapse
Affiliation(s)
- Simon Ciranka
- Center for Adaptive Rationality, Max Planck Institute for Human Development, Lentzeallee 94, 14195 Berlin, Germany.
| | - Ralph Hertwig
- Center for Adaptive Rationality, Max Planck Institute for Human Development, Lentzeallee 94, 14195 Berlin, Germany
| |
Collapse
|