1
|
Kendricks DR, Morrow C, Haste DA, Newland MC. Adult and adolescent antipsychotic exposure increases delay discounting and diminishes behavioral flexibility in male C57BL/6 mice. Pharmacol Biochem Behav 2024; 245:173866. [PMID: 39241867 DOI: 10.1016/j.pbb.2024.173866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/08/2024] [Revised: 08/28/2024] [Accepted: 09/03/2024] [Indexed: 09/09/2024]
Abstract
Second-generation antipsychotics are frequently prescribed to adolescents, but the long-term consequences of their use remain understudied. These medications work via monoamine neurotransmitter systems, especially dopamine and serotonin, which undergo considerable development and pruning during adolescence. Dopamine and serotonin are linked to a wide host of behaviors, including impulsive choice and behavioral plasticity. In a murine model of adolescent antipsychotic use, male C57BL/6 mice were exposed to either 2.5 mg/kg/day risperidone or 5 mg/kg/day olanzapine via drinking water from postnatal days 22-60. To determine whether the adolescent period was uniquely sensitive to antipsychotic exposure, long-term effects on behavior were compared to an equivalently exposed group of adults where mice were exposed to 2.5 mg/kg risperidone from postnatal days 101-138. Motor activity and body weight in adolescent animals were assessed. Thirty days after exposure terminated animal's behavioral flexibility and impulsive choice were assessed using spatial discrimination reversal and delay discounting. Antipsychotic exposure produced a modest change in behavior flexibility during the second reversal. There was a robust and reproducible difference in impulsive choice: exposed animals devalued the delayed alternative reward substantially more than controls. This effect was observed both following adolescent and adult exposure, indicating that an irreversible change in impulsive choice occurs regardless of the age of exposure.
Collapse
|
2
|
Craig AR. Resistance to change, of behavior and of theory. J Exp Anal Behav 2023; 120:440-456. [PMID: 37526100 DOI: 10.1002/jeab.875] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 07/10/2023] [Indexed: 08/02/2023]
Abstract
The persistence of operant behavior when disrupted tends to be positively related to how often reinforcers were delivered in the past. Behavioral momentum theory describes this finding as the outcome of Pavlovian processes. That is, the relation between discriminative stimuli and reinforcers that were delivered in their presence strengthens behavior, thereby making it more likely to persist. If only the story were that simple. A growing number of findings challenge the basic tenets of behavioral momentum theory. Some even call into question whether Pavlovian relations contribute to persistence in the first place. In this paper, I will review behavioral momentum theory and some of the data that have been problematic for the theory. I will argue that despite these very real challenges, the theory provides important utility not only to basic analyses of response persistence but also to clinical interventions directed at long-term reductions in problem behavior. It, for example, has set the stage for the development of alternative conceptual analyses of resistance to change, two of which will be highlighted for readers. Moreover, behavioral momentum theory may tell us something important about the reasons it continues to have an influence on the field, despite the challenging data that deter it.
Collapse
|
3
|
Madden GJ, Mahmoudi S, Brown K. Pavlovian learning and conditioned reinforcement. J Appl Behav Anal 2023; 56:498-519. [PMID: 37254881 PMCID: PMC10364091 DOI: 10.1002/jaba.1004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Accepted: 05/09/2023] [Indexed: 06/01/2023]
Abstract
Conditioned reinforcers are widely used in applied behavior analysis. Basic research evidence reveals that Pavlovian learning plays an important role in the acquisition and efficacy of new conditioned-reinforcer functions. Thus, a better understanding of Pavlovian principles holds the promise of improving the efficacy of conditioned reinforcement in applied research and practice. This paper surveys how (and if) Pavlovian principles are presented in behavior-analytic textbooks; imprecisions and knowledge gaps within contemporary Pavlovian empirical findings are highlighted. Thereafter, six practical principles of Pavlovian conditioning are presented along with empirical support and knowledge gaps that should be filled by applied and translational behavior-analytic researchers. Innovative applications of these principles are outlined for research in language acquisition, token reinforcement, and self-control.
Collapse
Affiliation(s)
| | - Saba Mahmoudi
- Department of Psychology, Utah State University, Logan, UT, USA
| | - Katherine Brown
- Department of Psychology, Utah State University, Logan, UT, USA
| |
Collapse
|
4
|
Morris SL, Vollmer TR, Dallery J. An evaluation of methods for studying the effects of conditioned reinforcement on human choice. J Exp Anal Behav 2023; 119:476-487. [PMID: 36726294 DOI: 10.1002/jeab.833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 01/16/2023] [Indexed: 02/03/2023]
Abstract
Shahan et al. (2006) found that the relative rate of pigeons' pecking on two observing responses (i.e., responses that only produced an S+ or stimulus correlated with primary reinforcement) was well described by the relative rate of S+ delivery. Researchers have not evaluated the effects of S+ delivery rate in a concurrent observing response procedure with human subjects, so the necessary procedural modifications for studying the effects of conditioned reinforcement on human choice remain unclear. The purpose of the current study was to conduct an additive component analysis of modifications to the procedures of Shahan et al. (2006). We evaluated the additive effects of introducing response cost, a changeover response, and ordinal discriminative stimuli on correspondence with the results of Shahan et al. and the quality of fits of the generalized matching equation. When our procedures were most similar to those of Shahan et al., we observed low rates of observing and indifference between the two observing responses. For the group of subjects with whom all three additive components were included, we obtained the highest level of sensitivity to relative rate of S+ delivery, but the slope and R2 of our fits of the generalized matching equation were still much lower than those obtained by Shahan et al. Potential reasons for these discrepancies, methods of resolving them, and implications for future research are discussed.
Collapse
Affiliation(s)
- Samuel L Morris
- Department of Psychology, Louisiana State University, Baton Rouge, LA, United States
| | - Timothy R Vollmer
- Department of Psychology, University of Florida, Gainesville, FL, United States
| | - Jesse Dallery
- Department of Psychology, University of Florida, Gainesville, FL, United States
| |
Collapse
|
5
|
Hughes CE, Langford JS, Van Heukelom JT, Blejewski RC, Pitts RC. A method for studying reinforcement factors controlling impulsive choice for use in behavioral neuroscience. J Exp Anal Behav 2022; 117:363-383. [PMID: 35506355 DOI: 10.1002/jeab.751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 02/04/2022] [Accepted: 02/09/2022] [Indexed: 02/01/2023]
Abstract
Although procedures originating within the experimental analysis of behavior commonly are used in behavioral neuroscience to produce behavioral endpoints, they are used less often to analyze the behavioral processes involved, particularly at the level of individual organisms (see Soto, 2020). Concurrent-chains procedures have been used extensively to study choice and to quantify relations between various dimensions of reinforcement and preference. Unfortunately, parametric analysis of those relations using traditional steady-state, single-subject experimental designs can be time-consuming, often rendering these procedures impractical for use in behavioral neuroscience. The purpose of this paper is to describe how concurrent-chains procedures can be adapted to allow for parametric examination of effects of the reinforcement dimensions involved in impulsive choice (magnitude and delay) within experimental sessions in rats. Data are presented indicating that this procedure can produce relatively consistent within-session estimates of sensitivity to reinforcement in individual subjects, and that these estimates can be modified by neurobiological manipulation (drug administration). These data suggest that this type of procedure offers a promising approach to the study of neurobiological mechanisms of complex behavior in individual organisms, which could facilitate a more fruitful relationship between behavior analysis and behavioral neuroscience.
Collapse
|
6
|
Edwards DJ. Ensuring Effective Public Health Communication: Insights and Modeling Efforts From Theories of Behavioral Economics, Heuristics, and Behavioral Analysis for Decision Making Under Risk. Front Psychol 2021; 12:715159. [PMID: 34721162 PMCID: PMC8548420 DOI: 10.3389/fpsyg.2021.715159] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 09/21/2021] [Indexed: 01/08/2023] Open
Abstract
Public health (PH) messaging can have an enormous impact on shaping how individuals within society behave, and can ensure it is in a safe and responsible way, consistent with up-to-date evidence-based PH guidelines. If done effectively, messaging can save lives and improve the health of those within society. However, unfortunately, those within Government PH bodies typically have little training about how to effectively represent PH messages in a way that is consistent with psychological theories of cognitive bias, in order to avoid cognitively biasing the public through their messages. As a result of this, inadequate representation of PH messages can result, which can often lead to cognitive bias in those from the public who read or listen to the message information. This can lead to poor decision making of the pubic as a whole, which can then further lead to harm and even death of public members as a result of these poor decisions. One way to minimize the problem of bias in decision making is to explore psychology theories that model how bias can occur from PH messaging, and identify ways in which PH agencies can utilize such approaches to improve the effectiveness of their messages. Previous focus has been largely on behavioral economic theories, however, here, other accounts are offered in addition to these. These include theories of heuristics and theories from the behavior analysis domain, which may increase the predictive power of modeling bias, and have applications for how best to represent PH message information which minimize bias.
Collapse
Affiliation(s)
- Darren J. Edwards
- Department of Public Health, Policy, and Social Sciences, Swansea University, Swansea, United Kingdom
| |
Collapse
|
7
|
Yates JR, Prior NA, Chitwood MR, Day HA, Heidel JR, Hopkins SE, Muncie BT, Paradella-Bradley TA, Sestito AP, Vecchiola AN, Wells EE. Using a dependent schedule to measure risky choice in male rats: Effects of d-amphetamine, methylphenidate, and methamphetamine. Exp Clin Psychopharmacol 2020; 28:181-195. [PMID: 31120280 PMCID: PMC7317298 DOI: 10.1037/pha0000300] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Risky choice is the tendency to choose a large, uncertain reward over a small, certain reward, and is typically measured with probability discounting, in which the probability of obtaining the large reinforcer decreases across blocks of trials. One caveat to traditional procedures is that independent schedules are used, in which subjects can show exclusive preference for one alternative relative to the other. For example, some rats show exclusive preference for the small, certain reinforcer as soon as delivery of the large reinforcer becomes probabilistic. Therefore, determining if a drug increases risk aversion (i.e., decreases responding for the probabilistic alternative) is difficult (due to floor effects). The overall goal of this experiment was to use a concurrent-chains procedure that incorporated a dependent schedule during the initial link, thus preventing animals from showing exclusive preference for one alternative relative to the other. To determine how pharmacological manipulations alter performance in this task, male Sprague-Dawley rats (n = 8) received injections of amphetamine (0, 0.25, 0.5, 1.0 mg/kg), methylphenidate (0, 0.3, 1.0, 3.0 mg/kg), and methamphetamine (0, 0.5, 1.0, 2.0 mg/kg). Amphetamine (0.25 mg/kg) and methylphenidate (3.0 mg/kg) selectively increased risky choice, whereas higher doses of amphetamine (0.5 and 1.0 kg/mg) and each dose of methamphetamine impaired stimulus control (i.e., flattened the discounting function). These results show that dependent schedules can be used to measure risk-taking behavior and that psychostimulants promote suboptimal choice when this schedule is used. (PsycInfo Database Record (c) 2020 APA, all rights reserved).
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | - Emily E Wells
- Department of Psychological and Brain Sciences, University of Louisville
| |
Collapse
|
8
|
Bai JYH, Cowie S, Macaskill AC, Elliffe D, Podlesnik CA. Assessing potential reinforcement‐like effects of brief stimuli unrelated to food reinforcers. J Exp Anal Behav 2020; 113:363-389. [DOI: 10.1002/jeab.580] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2019] [Revised: 09/27/2019] [Accepted: 11/20/2019] [Indexed: 11/08/2022]
|
9
|
Grace RC, McLean AP. Effects of schedule type and terminal-link duration on choice in concurrent chains. J Exp Anal Behav 2019; 112:192-209. [PMID: 31407355 DOI: 10.1002/jeab.544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2019] [Accepted: 07/18/2019] [Indexed: 11/08/2022]
Abstract
Four pigeons responded in a three-component concurrent chains procedure in which the terminal-link schedules were either both fixed-interval (FI FI), both variable-interval (VI VI), both mixed-interval (MI MI) or variable-interval fixed-interval (VI FI). Across components within sessions, overall terminal-link duration was varied while schedule types were varied across conditions. For the conditions with homogeneous schedules, the strongest preference was obtained with FI FI, intermediate with MI MI, and weakest with VI VI. Preference increased with overall terminal-link duration for all schedules, but the increase was more rapid for VI FI. The hyperbolic value-added model (HVA) and cumulative decision model (CDM) were fitted to the choice data and accounted for 83.9% and 76.1% of the variance, respectively. However, deviations from both models' predictions were systematic. A modification of the CDM, which assumed that reinforcers delivered after variable delays were more effective, improved the fit of the model and substantially reduced the systematic deviations in residuals. Comparable modifications of HVA produced only limited improvement. Results show that preference in concurrent chains with homogeneous terminal links depends on the degree of variability in reinforcer delays, and that the CDM can provide an excellent account of results across different terminal-link schedules.
Collapse
|
10
|
Greer BD, Shahan TA. Resurgence as Choice: Implications for promoting durable behavior change. J Appl Behav Anal 2019; 52:816-846. [PMID: 31049954 PMCID: PMC6625346 DOI: 10.1002/jaba.573] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2018] [Accepted: 03/20/2019] [Indexed: 11/07/2022]
Abstract
Resurgence is an increase in a previously suppressed behavior resulting from a worsening in reinforcement conditions for current behavior. Resurgence is often observed following successful treatment of problem behavior with differential reinforcement when reinforcement for an alternative behavior is subsequently omitted or reduced. The efficacy of differential reinforcement has long been conceptualized in terms of quantitative models of choice between concurrent operants (i.e., the matching law). Here, we provide an overview of a novel quantitative model of resurgence called Resurgence as Choice (RaC), which suggests that resurgence results from these same basic choice processes. We review the failures of the only other quantitative model of resurgence (i.e., Behavioral Momentum Theory) and discuss its shortcomings with respect to the limited range of circumstances about which it makes predictions in applied settings. Finally, we describe how RaC overcomes these shortcomings and discuss implications of the model for promoting durable behavior change.
Collapse
Affiliation(s)
- Brian D Greer
- University of Nebraska Medical Center's Munroe-Meyer Institute
| | | |
Collapse
|
11
|
Grace RC. Preference, resistance to change, and the cumulative decision model. J Exp Anal Behav 2018; 109:33-47. [DOI: 10.1002/jeab.305] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2017] [Accepted: 12/13/2017] [Indexed: 11/11/2022]
|
12
|
Aparicio CF, Hennigan PJ, Mulligan LJ, Alonso-Alvarez B. Spontaneously hypertensive (SHR) rats choose more impulsively than Wistar-Kyoto (WKY) rats on a delay discounting task. Behav Brain Res 2017; 364:480-493. [PMID: 28963043 DOI: 10.1016/j.bbr.2017.09.040] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2017] [Revised: 09/19/2017] [Accepted: 09/25/2017] [Indexed: 11/29/2022]
Abstract
Indications of Attention Deficit Hyperactivity Disorder (ADHD) in the spontaneously hypertensive rat (SHR) are not consistent across different tests of impulsivity, questioning the SHR's validity as a rodent model of ADHD. This study used a concurrent-chains procedure to examine possible differences in impulsive choice between SHRs and control-normotensive Wistar Kyoto (WKY) rats. The aim was to extend the generality of findings showing regularities between the hyperbolic-decay model and the generalized matching law fitting delay discounting data from nonhuman animals. The objectives were to: (1) examine differences in impulsive choice between SHRs and WKYs; (2) add evidence suggesting that the SHR is a suitable model of ADHD; (3) demonstrate that concurrent-chains procedures requiring locomotion detect differences in impulsive choice between SHRs and WKYs; (4) support the idea that impulsivity in nonhuman animals increases with training. The initial link used two non-independent random interval schedules arranging entries to the terminal links, where one fixed-time (FT) schedule delayed 1-food pellet and the other FT 4-food pellets. The FT delaying the former was kept constant at 0.1s and that delaying the latter changed after every 10 food deliveries, defining six delay components (0.1, 5, 10, 20, 40, and 80s) presented in random order each session. Results showed that the SHRs choose more impulsively than the WKYs, adding to the body of evidence suggesting that the SHR may be a suitable model of ADHD. Both models of choice fitted the impulsive choices of the SHRs and WKYs well; positive correlations between estimates of parameters k and s suggested compatibility between models of choice showing that impulsivity increases with training.
Collapse
|
13
|
Gambling-like behavior in pigeons: 'jackpot' signals promote maladaptive risky choice. Sci Rep 2017; 7:6625. [PMID: 28747679 PMCID: PMC5529572 DOI: 10.1038/s41598-017-06641-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2017] [Accepted: 06/16/2017] [Indexed: 11/24/2022] Open
Abstract
Individuals often face choices that have uncertain outcomes and have important consequences. As a model of this environment, laboratory experiments often offer a choice between an uncertain, large reward that varies in its probability of delivery against a certain but smaller reward as a measure of an individual’s risk aversion. An important factor generally lacking from these procedures are gambling related cues that may moderate risk preferences. The present experiment offered pigeons choices between unreliable and certain rewards but, for the Signaled group on winning choices, presented a ‘jackpot’ signal prior to reward delivery. The Unsignaled group received an ambiguous stimulus not informative of choice outcomes. For the Signaled group, presenting win signals effectively blocked value discounting for the large, uncertain outcome as the probability of a loss increased, whereas the Unsignaled group showed regular preference changes similar to previous research lacking gambling related cues. These maladaptive choices were further shown to be unaffected by more salient loss signals and resistant to response cost increases. The results suggest an important role of an individual’s sensitivity to outcome-correlated cues in influencing risky choices that may moderate gambling behaviors in humans, particularly in casino and other gambling-specific environments.
Collapse
|
14
|
Moore J, Friedlen KE. Choice Behavior in Pigeons Maintained With Probabilistic Schedules of Reinforcement. PSYCHOLOGICAL RECORD 2017. [DOI: 10.1007/bf03395580] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
15
|
A Critical Appraisal of Contemporary Approaches in the Quantitative Analysis of Behavior. PSYCHOLOGICAL RECORD 2017. [DOI: 10.1007/bf03395642] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
16
|
Chow JJ, Smith AP, Wilson AG, Zentall TR, Beckmann JS. Suboptimal choice in rats: Incentive salience attribution promotes maladaptive decision-making. Behav Brain Res 2017; 320:244-254. [PMID: 27993692 PMCID: PMC5241164 DOI: 10.1016/j.bbr.2016.12.013] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2016] [Revised: 12/10/2016] [Accepted: 12/12/2016] [Indexed: 11/27/2022]
Abstract
Stimuli that are more predictive of subsequent reward also function as better conditioned reinforcers. Moreover, stimuli attributed with incentive salience function as more robust conditioned reinforcers. Some theories have suggested that conditioned reinforcement plays an important role in promoting suboptimal choice behavior, like gambling. The present experiments examined how different stimuli, those attributed with incentive salience versus those without, can function in tandem with stimulus-reward predictive utility to promote maladaptive decision-making in rats. One group of rats had lights associated with goal-tracking as the reward-predictive stimuli and another had levers associated with sign-tracking as the reward-predictive stimuli. All rats were first trained on a choice procedure in which the expected value across both alternatives was equivalent but differed in their stimulus-reward predictive utility. Next, the expected value across both alternatives was systematically changed so that the alternative with greater stimulus-reward predictive utility was suboptimal in regard to primary reinforcement. The results demonstrate that in order to obtain suboptimal choice behavior, incentive salience alongside strong stimulus-reward predictive utility may be necessary; thus, maladaptive decision-making can be driven more by the value attributed to stimuli imbued with incentive salience that reliably predict a reward rather than the reward itself.
Collapse
Affiliation(s)
- Jonathan J Chow
- Department of Psychology, University of Kentucky, 741 S. Limestone, Lexington, KY, 40536 USA.
| | - Aaron P Smith
- Department of Psychology, University of Kentucky, 741 S. Limestone, Lexington, KY, 40536 USA.
| | - A George Wilson
- Department of Psychology, University of Kentucky, 741 S. Limestone, Lexington, KY, 40536 USA; Department of Neurosciences, Medical University of South Carolina, 173 Ashley Avenue, Charleston, SC, 29425 USA.
| | - Thomas R Zentall
- Department of Psychology, University of Kentucky, 741 S. Limestone, Lexington, KY, 40536 USA.
| | - Joshua S Beckmann
- Department of Psychology, University of Kentucky, 741 S. Limestone, Lexington, KY, 40536 USA.
| |
Collapse
|
17
|
Abstract
Stimuli associated with primary reinforcement for instrumental behavior are widely believed to acquire the capacity to function as conditioned reinforcers via Pavlovian conditioning. Some Pavlovian conditioning studies suggest that animals learn the important temporal relations between stimuli and integrate such temporal information over separate experiences to form a temporal map. The present experiment examined whether Pavlovian conditioning can establish a positive instrumental conditioned reinforcer through such temporal integration. Two groups of rats received either delay or trace appetitive conditioning in which a neutral stimulus predicted response-independent food deliveries (CS1→US). Both groups then experienced one session of backward second-order conditioning of the training CS1 and a novel CS2 (CS1-CS2 pairing). Finally, the ability of CS2 to function as a conditioned reinforcer for a new instrumental response (leverpressing) was assessed. Consistent with the previous demonstrations of temporal integration in fear conditioning, a CS2 previously trained in a trace-conditioning protocol served as a better instrumental conditioned reinforcer after backward second-order conditioning than did a CS2 previously trained in a delay protocol. These results suggest that an instrumental conditioned reinforcer can be established via temporal integration and raise challenges for existing quantitative accounts of instrumental conditioned reinforcement.
Collapse
|
18
|
Effects of Reinforcement Context on Initial Link Responding in Concurrent Chain Reinforcement Schedules. PSYCHOLOGICAL RECORD 2016. [DOI: 10.1007/s40732-016-0204-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
19
|
Pisklak JM, McDevitt MA, Dunn RM, Spetch ML. When good pigeons make bad decisions: Choice with probabilistic delays and outcomes. J Exp Anal Behav 2016; 104:241-51. [PMID: 26676182 DOI: 10.1002/jeab.177] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2015] [Accepted: 11/03/2015] [Indexed: 11/06/2022]
Abstract
Pigeons chose between an (optimal) alternative that sometimes provided food after a 10-s delay and other times after a 40-s delay and another (suboptimal) alternative that sometimes provided food after 10 s but other times no food after 40 s. When outcomes were not signaled during the delays, pigeons strongly preferred the optimal alternative. When outcomes were signaled, choices of the suboptimal alternative increased and most pigeons preferred the alternative that provided no food after the long delay despite the cost in terms of obtained food. The pattern of results was similar whether the short delays occurred on 25% or 50% of the trials. Shortening the 40-s delay to food sharply reduced suboptimal choices, but shortening the delay to no food had little effect. The results suggest that a signaled delay to no food does not punish responding in probabilistic choice procedures. The findings are discussed in terms of conditioned reinforcement by signals for good news.
Collapse
|
20
|
Abstract
Experimental psychologists in the 1930s erred badly in interpreting Bridgman (1927/1960) to mean that a theoretical construct was scientifically valid given a single set of operations. This mistake was corrected in the 1950s with the adoption of a multiple or convergent operationism, which stimulated the development of cognitive psychology and remains widely used to this day. The major criticisms made by Bickhard, Green, Leahey and Rogers (all 2001) attempt to rebut my (Grace, 2001) supposed claim that converging operationism solved the philosophical difficulties inherent in operationism as a thoroughgoing account of scientific meaning. But I made no such claim. Instead, I argue that it is legitimate to assess operationism pragmatically, in a research context. On this view, reasons for the tenacity of operationism in psychology are clear: the use of operational definitions facilitates clarity of communication and helps scientists to formulate empirical tests of theories.
Collapse
|
21
|
Smith AP, Bailey AR, Chow JJ, Beckmann JS, Zentall TR. Suboptimal Choice in Pigeons: Stimulus Value Predicts Choice over Frequencies. PLoS One 2016; 11:e0159336. [PMID: 27441394 PMCID: PMC4956316 DOI: 10.1371/journal.pone.0159336] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2016] [Accepted: 06/30/2016] [Indexed: 11/24/2022] Open
Abstract
Pigeons have shown suboptimal gambling-like behavior when preferring a stimulus that infrequently signals reliable reinforcement over alternatives that provide greater reinforcement overall. As a mechanism for this behavior, recent research proposed that the stimulus value of alternatives with more reliable signals for reinforcement will be preferred relatively independently of their frequencies. The present study tested this hypothesis using a simplified design of a Discriminative alternative that, 50% of the time, led to either a signal for 100% reinforcement or a blackout period indicative of 0% reinforcement against a Nondiscriminative alternative that always led to a signal that predicted 50% reinforcement. Pigeons showed a strong preference for the Discriminative alternative that remained despite reducing the frequency of the signal for reinforcement in subsequent phases to 25% and then 12.5%. In Experiment 2, using the original design of Experiment 1, the stimulus following choice of the Nondiscriminative alternative was increased to 75% and then to 100%. Results showed that preference for the Discriminative alternative decreased only when the signals for reinforcement for the two alternatives predicted the same probability of reinforcement. The ability of several models to predict this behavior are discussed, but the terminal link stimulus value offers the most parsimonious account of this suboptimal behavior.
Collapse
Affiliation(s)
- Aaron P. Smith
- Department of Psychology, University of Kentucky, Lexington, Kentucky, United States of America
- * E-mail:
| | - Alexandria R. Bailey
- Department of Psychology, University of Kentucky, Lexington, Kentucky, United States of America
| | - Jonathan J. Chow
- Department of Psychology, University of Kentucky, Lexington, Kentucky, United States of America
| | - Joshua S. Beckmann
- Department of Psychology, University of Kentucky, Lexington, Kentucky, United States of America
| | - Thomas R. Zentall
- Department of Psychology, University of Kentucky, Lexington, Kentucky, United States of America
| |
Collapse
|
22
|
McDowell JJ, Calvin OL, Klapes B. A survey of residual analysis and a new test of residual trend. J Exp Anal Behav 2016; 105:445-58. [DOI: 10.1002/jeab.208] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2015] [Accepted: 04/17/2016] [Indexed: 11/11/2022]
|
23
|
Grace RC. Reprint of "Acquisition of choice in concurrent chains: Assessing the cumulative decision model". Behav Processes 2016; 127:74-85. [PMID: 27150444 DOI: 10.1016/j.beproc.2016.04.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2015] [Revised: 03/14/2016] [Accepted: 03/17/2016] [Indexed: 11/27/2022]
Abstract
Concurrent chains is widely used to study pigeons' choice between terminal links that can vary in delay, magnitude, or probability of reinforcement. We review research on the acquisition of choice in this procedure. Acquisition has been studied with a variety of research designs, and some studies have incorporated no-food trials to allow for timing and choice to be observed concurrently. Results show that: Choice can be acquired rapidly within sessions when terminal links change unpredictably; under steady-state conditions, acquisition depends on both initial- and terminal-link schedules; and initial-link responding is mediated by learning about the terminal-link stimulus-reinforcer relations. The cumulative decision model (CDM) proposed by Christensen and Grace (2010) and Grace and McLean (2006, 2015) provides a good description of within-session acquisition, and correctly predicts the effects of initial and terminal-link schedules in steady-state designs (Grace, 2002a). Questions for future research include how abrupt shifts in preference within individual sessions and temporal control of terminal-link responding can be modeled.
Collapse
|
24
|
Grace RC. Acquisition of choice in concurrent chains: Assessing the cumulative decision model. Behav Processes 2016; 126:82-93. [DOI: 10.1016/j.beproc.2016.03.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2015] [Revised: 03/14/2016] [Accepted: 03/17/2016] [Indexed: 11/29/2022]
|
25
|
Subramaniam S, Kyonka EG. Environmental dynamics modulate covariation of choice and timing. Behav Processes 2016; 124:130-40. [DOI: 10.1016/j.beproc.2016.01.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2015] [Revised: 01/13/2016] [Accepted: 01/17/2016] [Indexed: 11/25/2022]
|
26
|
Shahan TA, Cunningham P. Conditioned reinforcement and information theory reconsidered. J Exp Anal Behav 2016; 103:405-18. [PMID: 25766452 DOI: 10.1002/jeab.142] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2014] [Accepted: 01/28/2015] [Indexed: 11/06/2022]
Abstract
The idea that stimuli might function as conditioned reinforcers because of the information they convey about primary reinforcers has a long history in the study of learning. However, formal application of information theory to conditioned reinforcement has been largely abandoned in modern theorizing because of its failures with respect to observing behavior. In this paper we show how recent advances in the application of information theory to Pavlovian conditioning offer a novel approach to conditioned reinforcement. The critical feature of this approach is that calculations of information are based on reductions of uncertainty about expected time to primary reinforcement signaled by a conditioned reinforcer. Using this approach, we show that previous failures of information theory with observing behavior can be remedied, and that the resulting framework produces predictions similar to Delay Reduction Theory in both observing-response and concurrent-chains procedures. We suggest that the similarity of these predictions might offer an analytically grounded reason for why Delay Reduction Theory has been a successful theory of conditioned reinforcement. Finally, we suggest that the approach provides a formal basis for the assertion that conditioned reinforcement results from Pavlovian conditioning and may provide an integrative approach encompassing both domains.
Collapse
|
27
|
Orduña V. Impulsivity and sensitivity to amount and delay of reinforcement in an animal model of ADHD. Behav Brain Res 2015. [PMID: 26225844 DOI: 10.1016/j.bbr.2015.07.046] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Previous research has been inconclusive about the degree of impulsivity displayed by spontaneously hypertensive rats (SHR), an animal model of Attention Deficit Hyperactivity Disorder (ADHD). In the present set of experiments, concurrent-chains schedules were employed in order to explore SHR's impulsivity, sensitivity to delay, and sensitivity to amount of reinforcement; Wistar rats (WIS) were used as comparison group. In the three experiments - performed with different subjects - non-independent variable interval 30s schedules were presented in the initial links; the difference between experiments was in the terminal links. For exploring impulsivity, one of the terminal links (SS) was associated to a short delay (2s) and a small reinforcer (1 pellet), whereas the other terminal link (LL) was associated to a longer delay (28s) and a larger reinforcer (4 pellets). The results indicated a remarkably higher impulsivity in SHR. Because this impulsivity may have as potential mechanisms an increased sensitivity to delay and/or a decreased sensitivity to the amount of reinforcement, in experiments 2 and 3 these possibilities were examined. For assessing sensitivity to delay, the following pairs of fixed interval (FI) schedules were used in the terminal links in five conditions: 2-28, 6-24, 15-15, 24-6, 28-2s; the magnitude of reinforcement was 1 pellet in all conditions for both alternatives. For assessing sensitivity to amount, in five conditions the alternatives were associated with different magnitudes of reinforcement: 1-5 pellets, 2-4, 3-3, 4-2 and 5-1 in left-right alternatives, respectively; the delay to reinforcement was controlled by a FI 15s in all conditions and for both alternatives. The sensitivity to delay and the sensitivity to amount were calculated according to the Generalized Matching Law. The results indicated a higher sensitivity to delay in SHR, and the same sensitivity to amount in SHR and WIS rats. These results suggest that the increased sensitivity to delay influences the high level of impulsivity observed in SHR.
Collapse
Affiliation(s)
- Vladimir Orduña
- Facultad de Psicología, Universidad Nacional Autónoma de México, México, D.F. 04510, Mexico.
| |
Collapse
|
28
|
The learning of action sequences through social transmission. Anim Cogn 2015; 18:1093-103. [PMID: 26006723 DOI: 10.1007/s10071-015-0877-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2014] [Revised: 04/07/2015] [Accepted: 05/11/2015] [Indexed: 10/23/2022]
Abstract
Previous empirical work on animal social learning has found that many species lack the ability to learn entire action sequences solely through reliance on social information. Conversely, acquiring action sequences through asocial learning can be difficult due to the large number of potential sequences arising from even a small number of base actions. In spite of this, several studies report that some primates use action sequences in the wild. We investigate how social information can be integrated with asocial learning to facilitate the learning of action sequences. We formalize this problem by examining how learners using temporal difference learning, a widely applicable model of reinforcement learning, can combine social cues with their own experiences to acquire action sequences. The learning problem is modeled as a Markov decision process. The learning of nettle processing by mountain gorillas serves as a focal example. Through simulations, we find that the social facilitation of component actions can combine with individual learning to facilitate the acquisition of action sequences. Our analysis illustrates that how even simple forms of social learning, combined with asocial learning, generate substantially faster learning of action sequences compared to asocial processes alone, and that the benefits of social information increase with the length of the action sequence and the number of base actions.
Collapse
|
29
|
|
30
|
Moore J. Pragmatism, mathematical models, and the scientific ideal of prediction and control. Behav Processes 2015; 114:2-13. [PMID: 25596451 DOI: 10.1016/j.beproc.2015.01.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2014] [Revised: 12/09/2014] [Accepted: 01/13/2015] [Indexed: 11/26/2022]
Abstract
Mathematical models are often held to be valuable, if not necessary, for theories and explanations in the quantitative analysis of behavior. The present review suggests that mathematical models primarily derived from the observation of functional relations do indeed contribute to the scientific value of theories and explanations, even though the final form of the models appears to be highly abstract. However, mathematical models not primarily so derived risk being essentialist in character, based on a particular view of formal causation. Such models invite less effective and frequently mentalistic theories and explanations of behavior. Models may be evaluated in terms of both (a) the verbal processes responsible for their origin and development and (b) the prediction and control engendered by the theories and explanations that incorporate the models, however indirect or abstract that prediction and control may be. Overall, the present review suggests that technological application and theoretical contemplation may be usefully viewed as continuous and overlapping forms of scientific activity, rather than dichotomous and mutually exclusive.
Collapse
Affiliation(s)
- J Moore
- UW-Milwaukee, Department of Psychology, 2441 E Hartford, Milwaukee, WI 53201, USA.
| |
Collapse
|
31
|
Kyonka EG. Quantifying transitions in response allocation with change point analysis in concurrent chains. Behav Processes 2014; 104:91-8. [DOI: 10.1016/j.beproc.2014.02.013] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2013] [Revised: 02/11/2014] [Accepted: 02/14/2014] [Indexed: 10/25/2022]
|
32
|
Davidson A, Jensen M, Burgess E, Stevens A, Hayes L, Sieweke S, Stough K, Wright A, McCarty R, Eddleman L, Kim YI, Milby JB, Schumacher JE. Effects of therapeutic goal management (TGM) on treatment attendance and drug abstinence among men with co-occurring substance use and axis I mental disorders who are homeless: results of the Birmingham EARTH program. Addict Sci Clin Pract 2013; 8:17. [PMID: 24499617 PMCID: PMC3829098 DOI: 10.1186/1940-0640-8-17] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2013] [Accepted: 10/21/2013] [Indexed: 11/21/2022] Open
Abstract
Purpose This study describes the implementation and impact of Therapeutic Goal Management (TGM) in a Substance Abuse and Mental Health Services Administration (SAMHSA)-sponsored demonstration project entitled Enhanced Addiction Recovery through Housing (EARTH). Participants The sample included 28 male participants followed at six months who completed some treatment. Forty-three percent were Caucasian, and 57% were African American. The average age of participants was 42 years. Design The relationships between TGM goal achievement, treatment attendance, and drug abstinence outcomes were studied among EARTH program participants who were homeless and met criteria for co-occurring substance use and severe DSM-IV Axis I mental disorders. Results The results revealed an overall drug abstinence rate of 72.4% over six months and significant positive relationships between TGM goal achievement and drug abstinence (r = 0.693) and TGM goal achievement and treatment attendance (r = 0.843). Conclusions This research demonstrated the relationship and potential positive impact of systematically setting, monitoring, and reinforcing personalized goals in multiple life areas on drug abstinence and treatment attendance outcomes among persons who are homeless with co-occurring substance use and other Axis I disorders in a integrated community service delivery program.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | - Joseph E Schumacher
- Division of Preventive Medicine, University of Alabama at Birmingham, 1717 11th Avenue South, MT 616, Birmingham, AL 35205, USA.
| |
Collapse
|
33
|
Hutsell BA, Jacobs EA. Attention and psychophysics in the development of stimulus control. J Exp Anal Behav 2013; 100:282-300. [DOI: 10.1002/jeab.54] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2013] [Accepted: 07/23/2013] [Indexed: 11/08/2022]
|
34
|
Beeby E, White KG. Preference reversal between impulsive and self-control choice. J Exp Anal Behav 2013; 99:260-76. [PMID: 23440893 DOI: 10.1002/jeab.23] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2012] [Accepted: 02/01/2013] [Indexed: 11/08/2022]
Abstract
In a concurrent-chains procedure, pigeons chose between reinforcers varying in delay and amount. Reinforcer amount was determined by duration of access to grain, and delay was determined by fixed-interval schedules in the terminal links. Preference was measured by the ratio of responses in initial links. Dependent scheduling of variable-interval schedules in initial links ensured that delay and amount were not confounded with frequency of outcomes, which remained equal for the two choices. In Experiment 1, in components signaled by red keys in the initial links, small and large reinforcers were delivered after delays of 1 s and 10 s respectively. In components signaled by green, additional time was added to both delays. Smaller-sooner reinforcers were preferred in red components. In green components, smaller-sooner reinforcers were preferred at short delays, and choices for the larger-later reinforcer generally increased with increasing duration of the added delay. At longer delays, up to 15 s, the larger-later reinforcer was preferred. That is, the pigeons showed within-session preference reversal, with impulsive choice at short delays in red components and self-control choice at long delays in green components. In Experiment 2, added delay to both reinforcement and reinforcer amount were varied. Sensitivity of initial-link response ratios to ratios of amount increased with increasing duration of the added delay. This interaction between delay and amount was predicted if the temporal discounting functions assumed the magnitude effect in which discounting rate was inversely proportional to amount. It was also predicted by the contextual choice model of performance in concurrent-chains procedures.
Collapse
Affiliation(s)
- Emma Beeby
- Department of Psychology, University of Otago, Dunedin 9054, New Zealand.
| | | |
Collapse
|
35
|
Jimenez-Gomez C, Shahan TA. Concurrent-chains schedules as a method to study choice between alcohol-associated conditioned reinforcers. J Exp Anal Behav 2012; 97:71-83. [PMID: 22287805 DOI: 10.1901/jeab.2012.97-71] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2011] [Accepted: 09/14/2011] [Indexed: 11/22/2022]
Abstract
An extensive body of research using concurrent-chains schedules of reinforcement has shown that choice for one of two differentially valued food-associated stimuli is dependent upon the overall temporal context in which those stimuli are embedded. The present experiments examined whether the concurrent chains procedure was useful for the study of behavior maintained by alcohol and alcohol-associated stimuli. In Experiment 1, rats responded on concurrent-chains schedules with equal variable-interval (VI) 10-s schedules in the initial links. Across conditions, fixed-interval schedules in the terminal links were varied to yield 1∶1, 9∶1, and 1∶9 ratios of alcohol delivery. Initial-link response rates reflected changes in terminal-link schedules, with greater relative responding in the rich terminal link. In Experiment 2, terminal-link schedules remained constant with a 9∶1 ratio of alcohol delivery rates while the length of two equal duration initial-link schedules was varied. Preference for the rich terminal link was less extreme when initial links were longer (i.e., the initial-link effect), as has been previously reported with food reinforcers. This result suggests that the conditioned reinforcing value of an alcohol-associated stimulus depends on the temporal context in which it is embedded. The concurrent-chains procedure and quantitative models of concurrent chains performance may provide a useful framework within which to study how contextual variables modulate preference for drug-associated conditioned reinforcers.
Collapse
Affiliation(s)
- Corina Jimenez-Gomez
- Department of Psychology, The University of Auckland, Private Bag 92019, Victoria Street West, Auckland 1142, New Zealand.
| | | |
Collapse
|
36
|
Mazur JE. Effects of pre-trial response requirements on self-control choices by rats and pigeons. J Exp Anal Behav 2012; 97:215-30. [PMID: 22389527 PMCID: PMC3292232 DOI: 10.1901/jeab.2012.97-215] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2011] [Accepted: 12/05/2011] [Indexed: 11/22/2022]
Abstract
Parallel experiments with rats and pigeons examined whether the size of a pre-trial ratio requirement would affect choices in a self-control situation. In different conditions, either 1 response or 40 responses were required before each trial. In the first half of each experiment, an adjusting-ratio schedule was used, in which subjects could choose a fixed-ratio schedule leading to a small reinforcer, or an adjusting-ratio schedule leading to a larger reinforcer. The size of the adjusting ratio requirement was increased and decreased over trials based on the subject's responses, in order to estimate an indifference point-a ratio at which the two alternatives were chosen about equally often. The second half of each experiment used an adjusting-delay procedure-fixed and adjusting delays to the small and large reinforcers were used instead of ratio requirements. In some conditions, particularly with the reinforcer delays, the rats had consistently longer adjusting delays with the larger pre-trial ratios, reflecting a greater tendency to choose the larger, delayed reinforcer when more responding was required to reach the choice point. No consistent effects of the pre-trial ratio were found for the pigeons in any of the conditions. These results may indicate that rats are more sensitive to the long-term reinforcement rates of the two alternatives, or they may result from a shallower temporal discounting rate for rats than for pigeons, a difference that has been observed in previous studies.
Collapse
Affiliation(s)
- James E Mazur
- Psychology Department, Southern Connecticut State University, New Haven, CT 06515, USA.
| |
Collapse
|
37
|
Hutsell BA, Jacobs EA. Rapid acquisition of bias in signal detection: dynamics of effective reinforcement allocation. J Exp Anal Behav 2012; 97:29-49. [PMID: 22287803 DOI: 10.1901/jeab.2012.97-29] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2010] [Accepted: 09/20/2011] [Indexed: 11/22/2022]
Abstract
We investigated changes in bias (preference for one response alternative) in signal detection when relative reinforcer frequency for correct responses varied across sessions. In Experiment 1, 4 rats responded in a two-stimulus, two-response identification procedure employing temporal stimuli (short vs. long houselight presentations). Relative reinforcer frequency varied according to a 31-step pseudorandom binary sequence and stimulus duration difference varied over two values across conditions. In Experiment 2, 3 rats responded in a five-stimulus, two-response classification procedure employing temporal stimuli. Relative reinforcer frequency was varied according to a 36-step pseudorandom ternary sequence. Results of both experiments were analyzed according to a behavioral model of detection. The model was extended to incorporate the effects of current and previous session reinforcer frequency ratios on current-session performance. Similar to findings with concurrent schedules, effects on bias of relative reinforcer frequency were highest for the current session. However, carryover from reinforcer ratios of previous sessions was evident. Generally, the results indicate that bias can come under control of frequent changes in relative reinforcer frequency in both identification and classification procedures.
Collapse
Affiliation(s)
- Blake A Hutsell
- Department of Psychology, 226 Thach Hall, 342 W. Thach Ave, Auburn University, Auburn, Alabama 36849-5212, USA.
| | | |
Collapse
|
38
|
Podlesnik CA, Jimenez-Gomez C, Thrailkill EA, Shahan TA. Temporal context, preference, and resistance to change. J Exp Anal Behav 2012; 96:191-213. [PMID: 21909164 DOI: 10.1901/jeab.2011.96-191] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2011] [Accepted: 04/15/2011] [Indexed: 10/28/2022]
Abstract
According to behavioral momentum theory, preference and relative resistance to change in concurrent-chains schedules are correlated and reflect the relative conditioned value of discriminative stimuli. In the present study, we explore the generality of this relation by manipulating the temporal context within a concurrent-chains procedure through changes in the duration of the initial links. Consistent with previous findings, preference for a richer terminal link was less extreme with longer initial links across three experiments with pigeons. In Experiment 1, relative resistance to change and preference were related inversely when responding was disrupted with response-independent food presentations during initial links, replicating a previous finding with rats. However, more food was presented with longer initial links, confounding the disrupter and initial-link duration. In Experiment 2, presession feeding was used instead and eliminated the negative relation between relative resistance to change and preference, but relative resistance to change was not sensitive to relative terminal-link reinforcement rates. In Experiment 3, with more extreme relative terminal-link reinforcement rates, increasing initial-link duration similarly decreased preference and relative resistance to change for the richer terminal link. Thus, when conditions of disruption are equal and assessed under the appropriate reinforcement conditions, changes in temporal context impact relative resistance to change and preference similarly.
Collapse
|
39
|
Abstract
Psychologists have long been intrigued with the rationales that underlie our decisions. Similarly, the concept of conditioned reinforcement has a venerable history, particularly in accounting for behavior not obviously maintained by primary reinforcers. The studies of choice and of conditioned reinforcement have often developed in lockstep. Many contemporary approaches to these fundamental topics share an emphasis on context and on relative value. We trace the evolution of thinking about the potency of conditioned reinforcers from stimuli that were thought to acquire their value from pairings with more fundamental reinforcers to stimuli that acquire their value by being differentially correlated with these more fundamental reinforcers. We discuss some seminal experiments (including several that have been underappreciated) and some ongoing data, all of which have propelled us to the conclusion that the strength of conditioned reinforcers is determined by their signaling a relative improvement in the organism's relation to reinforcement.
Collapse
|
40
|
Mattson KM, Hucks A, Grace RC, McLean AP. Signaled and unsignaled terminal links in concurrent chains I: effects of reinforcer probability and immediacy. J Exp Anal Behav 2011; 94:327-52. [PMID: 21541175 DOI: 10.1901/jeab.2010.94-327] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2010] [Accepted: 08/17/2010] [Indexed: 10/18/2022]
Abstract
Eight pigeons responded in a three-component concurrent-chains procedure, with either independent or dependent initial links. Relative probability and immediacy of reinforcement in the terminal links were both varied, and outcomes on individual trials (reinforcement or nonreinforcement) were either signaled or unsignaled. Terminal-link fixed-time schedules were varied across components within conditions to yield immediacy ratios of 1∶2, 1∶1 and 2∶1. The probabilities of reinforcement were varied across conditions to yield reinforcer ratios of 1∶5, 1∶2, 2∶1 and 5∶1. Results showed that a model based on the generalized matching law provided a good description of response allocation, accounting for 92% of the variance overall. As expected, sensitivity to probability was greater in the unsignaled conditions. However, sensitivity to immediacy was also greater in the unsignaled conditions, suggesting that the effect of signaling terminal-link outcomes may not be limited to probability but apply to reinforcer variables in general. The effects of signaling can be explained in terms of conditioned reinforcement added to each alternative's outcomes in the matching law. There was some evidence for an interaction between reinforcer probability and immediacy, particularly for the dependent-schedules group, such that sensitivity to immediacy was greater at moderate rather than extreme reinforcer ratios. However, further analysis suggested that this could have been due to a ceiling effect on response allocation imposed by dependent scheduling. Overall, the present results show that the generalized matching law can provide a useful account of choice between outcomes that vary in both probability and immediacy of reinforcement.
Collapse
Affiliation(s)
- Karla M Mattson
- Department of Psychology, University of Canterbury, Private Bag 4800, Christchurch, New Zealand
| | | | | | | |
Collapse
|
41
|
Kyonka EGE, Grace RC. Rapid acquisition of choice and timing and the provenance of the terminal-link effect. J Exp Anal Behav 2011; 94:209-25. [PMID: 21451749 DOI: 10.1901/jeab.2010.94-209] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2009] [Accepted: 05/14/2010] [Indexed: 11/22/2022]
Abstract
Eight pigeons responded in a concurrent-chains procedure in which terminal-link schedules changed pseudorandomly across sessions. Pairs of terminal-link delays either summed to 15 s or to 45 s. Across sessions, the location of the shorter terminal link changed according to a pseudorandom binary sequence. On some terminal links, food was withheld to obtain start and stop times, measures of temporal control. Log initial-link response ratios stabilized within the first half of each session. Log response ratio was a monotonically-increasing but nonlinear function of programmed log terminal-link immediacy ratio. There was an effect of absolute terminal-link duration on log response ratio: For most subjects, preference for the relatively shorter terminal-link delay was stronger when absolute delays were long than when absolute delays were short. Polynomial regressions and model comparison showed that differences in degree of nonlinearity, not in sensitivity to log immediacy ratio, produced this effect. Temporal control of stop times was timescale invariant with scalar variability, but temporal control of start times was not consistent across subjects or terminal-link durations.
Collapse
Affiliation(s)
- Elizabeth G E Kyonka
- West Virginia University, Department of Psychology, Morgantown, WV 26506-6040, USA.
| | | |
Collapse
|
42
|
Singer RA, Zentall TR. Preference for the Outcome That Follows a Relatively Aversive Event: Contrast or Delay Reduction? LEARNING AND MOTIVATION 2011; 42:255-271. [PMID: 22993453 PMCID: PMC3444245 DOI: 10.1016/j.lmot.2011.06.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Pigeons prefer a positive discriminative (S+) stimulus that follows a less preferred event (a large number of required responses, a longer delay, or the absence of food) over a different S+ with a similar history of reinforcement that follows a more preferred event (a single required response, no delay, or food). We proposed that this phenomenon results from contrast (referred to as within-trial contrast) between the less preferred initial event and the signal for reinforcement. Delay reduction theory (Fantino, 1969) can account for these results by proposing that the less preferred initial event lengthens the duration of the trial, thereby allowing the S+ stimulus to occur later in the trial and thus become a better predictor of reinforcement. In the present experiments, we further explored this effect. In Experiment 1, we controlled for trial duration by using a fixed ratio response (30 pecks) as one initial event and the absence of pecking for the same duration as the other initial event (0 pecks). The pigeons showed a reliable preference for the positive stimulus that followed the least preferred initial event. In Experiment 2, we controlled for trial duration by using 30 pecks as one initial event and 1 peck followed by a delay that matched the duration of the preceding 30-peck trial. (Group Time Same). For Group Time Different, there was no delay following the 1-peck initial event. For Group Time Same, preference for the initial event negatively predicted the pigeons' preference for the S+ stimulus that followed, supporting the contrast account. A somewhat greater preference for the discriminative stimulus that followed the least preferred initial event was found for Group Time Different suggesting that in addition to contrast, delay reduction also may play a small role. However, the greater initial-event preference found for Group Time Different suggests that contrast can account for the group difference as well.
Collapse
Affiliation(s)
- Rebecca A. Singer
- Department of Psychology, Georgetown College, Georgetown, KY 40324, Phone: 502-863-8353,
| | - Thomas R. Zentall
- Department of Psychology, University of Kentucky, Lexington, KY 40506-0044.
| |
Collapse
|
43
|
McDevitt MA, Williams BA. Dual effects on choice of conditioned reinforcement frequency and conditioned reinforcement value. J Exp Anal Behav 2011; 93:147-55. [PMID: 20885807 DOI: 10.1901/jeab.2010.93-147] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2009] [Accepted: 12/14/2009] [Indexed: 10/19/2022]
Abstract
Pigeons were presented with a concurrent-chains schedule in which the total time to primary reinforcement was equated for the two alternatives (VI 30 s VI 60 s vs. VI 60 s VI 30 s). In one set of conditions, the terminal links were signaled by the same stimulus, and in another set of conditions they were signaled by different stimuli. Choice was in favor of the shorter terminal link when the terminal links were differentially signaled but in favor of the shorter initial link (and longer terminal link) when the terminal links shared the same stimulus. Preference reversed regularly with reversals of the stimulus condition and was unrelated to the discrimination between the two terminal links during the nondifferential stimulus condition. The present results suggest that the relative value of the terminal-link stimuli and the relative rate of conditioned reinforcer presentation are important influences on choice behavior, and that models of conditioned reinforcement need to include both factors.
Collapse
Affiliation(s)
- Margaret A McDevitt
- Psychology Department, McDaniel College, 2 College Hill, Westminster, MD 21157, USA.
| | | |
Collapse
|
44
|
Christensen DR, Grace RC. A decision model for steady-state choice in concurrent chains. J Exp Anal Behav 2010; 94:227-40. [PMID: 21451750 PMCID: PMC2929087 DOI: 10.1901/jeab.2010.94-227] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2009] [Accepted: 05/18/2010] [Indexed: 10/18/2022]
Abstract
Grace and McLean (2006) proposed a decision model for acquisition of choice in concurrent chains which assumes that after reinforcement in a terminal link, subjects make a discrimination whether the preceding reinforcer delay was short or long relative to a criterion. Their model was subsequently extended by Christensen and Grace (2008, 2009a, 2009b) to include effects of initial- and terminal-link duration on choice. We show that an expression for steady-state responding can be derived from the decision model, which enables a model for choice that provides an account of archival data that is equal or superior to the contextual choice model (Grace, 1994) and hyperbolic value-added model (Mazur, 2001) in terms of goodness of fit, parsimony, and parameter invariance. The success of the steady-state decision model validates the strategy of understanding acquisition phenomena as a bridge toward explaining choice at the molar level.
Collapse
|
45
|
|
46
|
Savastano H, Fantino E. Differences in delay, not ratios, control choice in concurrent chains. J Exp Anal Behav 2010; 66:97-116. [PMID: 16812820 PMCID: PMC1284562 DOI: 10.1901/jeab.1996.66-97] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
47
|
Grace R. The Matching Law And Amount-dependent Exponential Discounting As Accounts Of Self-control Choice. J Exp Anal Behav 2010; 71:27-44. [PMID: 16812890 PMCID: PMC1284692 DOI: 10.1901/jeab.1999.71-27] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
48
|
Abstract
A three-component concurrent-chains procedure was used to investigate preference between terminal-link schedules that differed in delay and magnitude of reinforcement. Response and time allocation data were well described by a generalized matching model. Sensitivity to delay appeared to be lower when reinforcement magnitudes were unequal than when they were equal, but when obtained rather than programmed time spent responding in the initial links was used in the model, the difference vanished. The results support independence of delay and magnitude as separate dimensions of reinforcement value, as required by the matching law, and the assumption of the contextual choice model (Grace, 1994) that sensitivities to delay and magnitude are affected similarly by temporal context. Although there was statistical evidence for interaction between successive components, the effects were small and transient. The multiple-component concurrent-chains procedure should prove useful in future research on multidimensional preference, although it may be necessary to control obtained initial-link time more precisely.
Collapse
|
49
|
|
50
|
Grace R, Schwendiman J, Nevin J. Effects of unsignaled delay of reinforcement on preference and resistance to change. J Exp Anal Behav 2010; 69:247-61. [PMID: 16812876 PMCID: PMC1284662 DOI: 10.1901/jeab.1998.69-247] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|