1
|
Ajuwon V, Ojeda A, Murphy RA, Monteiro T, Kacelnik A. Paradoxical choice and the reinforcing value of information. Anim Cogn 2023; 26:623-637. [PMID: 36306041 PMCID: PMC9950180 DOI: 10.1007/s10071-022-01698-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 09/07/2022] [Accepted: 10/01/2022] [Indexed: 11/01/2022]
Abstract
Signals that reduce uncertainty can be valuable because well-informed decision-makers can better align their preferences to opportunities. However, some birds and mammals display an appetite for informative signals that cannot be used to increase returns. We explore the role that reward-predictive stimuli have in fostering such preferences, aiming at distinguishing between two putative underlying mechanisms. The 'information hypothesis' proposes that reducing uncertainty is reinforcing per se, somewhat consistently with the concept of curiosity: a motivation to know in the absence of tractable extrinsic benefits. In contrast, the 'conditioned reinforcement hypothesis', an associative account, proposes asymmetries in secondarily acquired reinforcement: post-choice stimuli announcing forthcoming rewards (S+) reinforce responses more than stimuli signalling no rewards (S-) inhibit responses. In three treatments, rats faced two equally profitable options delivering food probabilistically after a fixed delay. In the informative option (Info), food or no food was signalled immediately after choice, whereas in the non-informative option (NoInfo) outcomes were uncertain until the delay lapsed. Subjects preferred Info when (1) both outcomes were explicitly signalled by salient auditory cues, (2) only forthcoming food delivery was explicitly signalled, and (3) only the absence of forthcoming reward was explicitly signalled. Acquisition was slower in (3), when food was not explicitly signalled, showing that signals for positive outcomes have a greater influence on the development of preference than signals for negative ones. Our results are consistent with an elaborated conditioned reinforcement account, and with the conjecture that both uncertainty reduction and conditioned reinforcement jointly act to generate preference.
Collapse
Affiliation(s)
- Victor Ajuwon
- Department of Biology, University of Oxford, Oxford, UK.
| | - Andrés Ojeda
- grid.4991.50000 0004 1936 8948Department of Biology, University of Oxford, Oxford, UK
| | - Robin A. Murphy
- grid.4991.50000 0004 1936 8948Department of Experimental Psychology, University of Oxford, Oxford, UK
| | - Tiago Monteiro
- grid.4991.50000 0004 1936 8948Department of Biology, University of Oxford, Oxford, UK ,grid.6583.80000 0000 9686 6466Domestication Lab, Department of Interdisciplinary Life Sciences, Konrad Lorenz Institute of Ethology, University of Veterinary Medicine Vienna, Vienna, Austria
| | - Alex Kacelnik
- Department of Biology, University of Oxford, Oxford, UK.
| |
Collapse
|
2
|
López-Tolsa GE, Niño JM, Orduña V. The essential value of the alternatives of the suboptimal choice procedure is different for pigeons and rats. Behav Processes 2020; 181:104245. [PMID: 32991942 DOI: 10.1016/j.beproc.2020.104245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Revised: 08/29/2020] [Accepted: 09/04/2020] [Indexed: 11/20/2022]
Abstract
Suboptimal choice is the preference for a discriminative alternative with low probability of reinforcement, over a non-discriminative alternative with higher probability of reinforcement. Pigeons consistently prefer the discriminative alternative, whereas rats prefer the non-discriminative; the variables accounting for this difference are not yet clear. The economic concepts related to demand curves have been used to calculate the essential value of different types of reinforcers, so they may be useful to compare the value of the alternatives in the suboptimal choice procedure. The goal of this study was to calculate the essential value of each of the alternatives presented in the suboptimal choice procedure to assess if pigeons (Experiment 1) and rats (Experiment 2) value them differently. In both experiments, the fixed ratio requirement in the initial link was increased throughout sessions in order to obtain the demand curve and calculate the essential value by fitting the exponential-demand model. A Bayesian Linear Mixed-Effects Model indicated that pigeons had higher essential values for the discriminative alternative, whereas rats obtained higher essential values for the non-discriminative alternative. These results suggest that preferences in the suboptimal choice procedure are indeed based on the essential value of the alternatives, and provide a new paradigm to study the variables affecting this phenomenon.
Collapse
Affiliation(s)
- Gabriela E López-Tolsa
- Facultad de Psicología, Universidad Nacional Autónoma de México, Ciudad de México, 04510, Mexico
| | - José Manuel Niño
- Facultad de Psicología, Universidad Nacional Autónoma de México, Ciudad de México, 04510, Mexico
| | - Vladimir Orduña
- Facultad de Psicología, Universidad Nacional Autónoma de México, Ciudad de México, 04510, Mexico.
| |
Collapse
|
3
|
Abstract
Animals will favor a risky option when a stimulus signaling reward bridges the choice and the outcome. The present experiments investigated signal-induced risky choices and reward-outcome expectations in rhesus and capuchin monkeys. Risky choice was assessed by preference for a large-probabilistic reward over a modest-certain reward. Outcome expectancy was assessed by providing a truncation-response to shorten the delay period. In Experiment 1 both species generally favored the risky option compared to a safe option when the outcomes were signaled and generally shortened the delays except when a signaled-loss stimulus was presented. The use of the delay-truncation response suggested that the monkeys were sensitive to the information conveyed by the stimulus. Experiments 2 and 3 were designed to investigate whether the delay-truncation response used by capuchin monkeys was strategically used reflecting explicit decision-making versus a conditioned response to reward stimuli. A perceptual judgment task was included and the selective use of the delay-truncation response on unsignaled correct trials may suggest the involvement of metacognitive processes. The capuchin monkeys generally truncated the delays except under conditions where reward would not be expected (risky-loss or incorrect-judgment). When the outcomes were unsignaled during the delay some capuchin monkeys were less likely to truncate the delay following an incorrect task response. Overall, the monkeys: (1) made more risky choices when the outcomes were signaled - consistent with gambling-like behavior. (2) selectively truncated the unsignaled delays when rewards could be anticipated (even when metacognitive-like awareness guided anticipation) - suggesting that delay truncation responses reflect explicit outcome expectancy.
Collapse
Affiliation(s)
- Travis R Smith
- Department of Psychological Sciences, Kansas State University, 492 Bluemont Hall, 1114 Mid-Campus Dr North, Manhattan, KS, 66506-5302, USA.
| | - Michael J Beran
- Language Research Center and Department of Psychology, Georgia State University, Atlanta, GA, USA
| |
Collapse
|
4
|
Macías A, González VV, Machado A, Vasconcelos M. The functional equivalence of two variants of the suboptimal choice task: choice proportion and response latency as measures of value. Anim Cogn 2020; 24:85-98. [PMID: 32772333 DOI: 10.1007/s10071-020-01418-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2020] [Revised: 07/28/2020] [Accepted: 07/30/2020] [Indexed: 10/23/2022]
Abstract
In the suboptimal-choice task, birds systematically choose the leaner but informative option (suboptimal) over the richer but non-informative option (optimal). The task has two variations. In the standard task, the optimal option includes two terminal link stimuli. In the original task, it includes a single terminal link stimulus. Two models, the temporal information account (Cunningham and Shahan, J Exp Psychol Anim Learn Cogn 44:1-22, 2018) and the ∆-∑ hypothesis (González et al., J Exp Anal Behav 113:591-608, 2020), presuppose that these procedures are equivalent, but no formal comparison is available. Here we test whether or not these procedures are functionally equivalent. One group of pigeons was trained with the standard procedure, another group with the original procedure, and a third group was trained with a hybrid of the other two (i.e., the two options were the optimal links of the standard and original procedures). Our findings indicate that the number of terminal link stimuli in the optimal option is inconsequential vis-à-vis choice. Moreover, our findings also indicate that latencies to respond are a sensitive metric of value and choice. As predicted by the Sequential Choice Model, we were able to predict simultaneous choices from the latencies of sequential choices and observed a substantial shortening of latencies during simultaneous choices.
Collapse
Affiliation(s)
- Alejandro Macías
- Animal Learning and Behavior Lab, School of Psychology, University of Minho, Campus de Gualtar, 4710-057, Braga, Portugal. .,Department of Education and Psychology, University of Aveiro, Aveiro, Portugal.
| | - Valeria V González
- Animal Learning and Behavior Lab, School of Psychology, University of Minho, Campus de Gualtar, 4710-057, Braga, Portugal
| | - Armando Machado
- Department of Education and Psychology, University of Aveiro, Aveiro, Portugal.,William James Center for Research, University of Aveiro, Aveiro, Portugal
| | - Marco Vasconcelos
- Department of Education and Psychology, University of Aveiro, Aveiro, Portugal.,William James Center for Research, University of Aveiro, Aveiro, Portugal
| |
Collapse
|
5
|
González-Torres R, Flores J, Orduña V. Suboptimal choice by pigeons is eliminated when key-pecking behavior is replaced by treadle-pressing. Behav Processes 2020; 178:104157. [PMID: 32497555 DOI: 10.1016/j.beproc.2020.104157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Revised: 05/29/2020] [Accepted: 05/29/2020] [Indexed: 11/25/2022]
Abstract
In the study of suboptimal choice, a reliable result is that pigeons strongly prefer an alternative that signals whether a reinforcer will be delivered or not over another alternative without that information even if the first provides a lower probability of reinforcement. In the aforementioned research, key pecking has been the operant response and illuminated keys the discriminative stimuli. In the present study we modified both of these aspects of the procedure in order to analyze the generality of suboptimal preferences of pigeons and to investigate the effect of changes in the incentive salience of the discriminative stimuli. To accomplish this, we presented pigeons a choice situation with the same parameters of reinforcement than previous research, but with treadle pressing as the choice response and ambient lights as discriminative stimuli. Under these conditions, most of the pigeons showed optimal behavior and a high degree of discrimination of the stimuli associated with the discriminative alternative. A control condition with key pecking as choice response and keylights as discriminative stimuli showed that the same pigeons turned to be suboptimal, a result that discards the possibility that the optimality found in the main condition was a consequence of a particular characteristic of our sample of subjects or of our procedure. We discuss the influence that the attribution of incentive salience to the discriminative stimuli has on suboptimal choice in both pigeons and rats.
Collapse
Affiliation(s)
| | - Julio Flores
- Facultad de Psicología, Universidad Nacional Autónoma de México, México, DF, 04510, Mexico
| | - Vladimir Orduña
- Facultad de Psicología, Universidad Nacional Autónoma de México, México, DF, 04510, Mexico.
| |
Collapse
|
6
|
The incentive salience of the stimuli biases rats’ preferences in the “suboptimal choice” procedure. Behav Processes 2020; 172:104057. [DOI: 10.1016/j.beproc.2020.104057] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Revised: 01/13/2020] [Accepted: 01/14/2020] [Indexed: 11/24/2022]
|
7
|
Orduña V, Alba R. Rats' optimal choice behavior in a gambling-like task. Behav Processes 2019; 162:104-111. [PMID: 30742885 DOI: 10.1016/j.beproc.2019.02.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2018] [Revised: 02/02/2019] [Accepted: 02/07/2019] [Indexed: 02/07/2023]
Abstract
Among the different procedures that model gambling behavior in non-human animals, the "suboptimal choice procedure" has been extensively employed for analyzing the impact of environmental cues on choice behavior. It has been repeatedly demonstrated that pigeons prefer an alternative that infrequently presents a stimulus that signals a larger amount of reinforcement, than another alternative that always presents a stimulus associated with a smaller amount of reinforcement, even though the net rate of reinforcement is lower in the former. In the present study, we tested rats in the magnitude version of the suboptimal choice procedure. Eight rats were given a choice between two alternatives: a) one in which a stimulus predicting the delivery of ten pellets was presented with probability (p) = 0.2 and a stimulus predicting zero pellets was presented with p = 0.8, and b) one in which either of two stimuli predicted the delivery of three pellets with p = 1.0. Contrary to the consistent and robust suboptimal behavior of pigeons, rats preferred the optimal alternative. This effect occurred despite the high index of discrimination of the stimuli associated with the different outcomes shown by the rats. The relevance of this result to the development of animal models of gambling behavior is discussed.
Collapse
Affiliation(s)
- Vladimir Orduña
- Facultad de Psicología, Universidad Nacional Autónoma de México, México D.F., 04510, Mexico.
| | - Rodrigo Alba
- Facultad de Psicología, Universidad Nacional Autónoma de México, México D.F., 04510, Mexico
| |
Collapse
|