1
|
Gomes-Ng S, Cowie S, Elliffe D. Divided stimulus control depends on differential and nondifferential reinforcement: Testing a quantitative model. J Exp Anal Behav 2023; 120:344-362. [PMID: 37581958 DOI: 10.1002/jeab.876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 07/26/2023] [Indexed: 08/17/2023]
Abstract
We investigated the effects of differential and nondifferential reinforcers on divided control by compound-stimulus dimensions. Six pigeons responded in a delayed matching-to-sample procedure in which a blue or yellow sample stimulus flashed on/off at a fast or slow rate, and subjects reported its color or alternation frequency. The dimension to report was unsignaled (Phase 1) or signaled (Phase 2). Correct responses were reinforced with a probability of .70, and the probability of reinforcers for errors varied across conditions. Comparison choice depended on reinforcer ratios for correct and incorrect responding; as the frequency of error reinforcers according to a dimension increased, control (measured by log d) by that dimension decreased and control by the other dimension increased. Davison and Nevin's (1999) model described data when the dimension to report was unsignaled, whereas model fits were poorer when it was signaled, perhaps due to carryover between conditions. We are the first to test this quantitative model of divided control with reinforcers for errors and when the dimension to report is signaled; hence, further research is needed to establish the model's generality. We question whether divided stimulus control is dimensional and suggest it may instead reflect joint control by compound stimuli and reinforcer ratios.
Collapse
Affiliation(s)
- Stephanie Gomes-Ng
- Auckland University of Technology, Auckland, New Zealand
- The University of Auckland, Auckland, New Zealand
| | - Sarah Cowie
- The University of Auckland, Auckland, New Zealand
| | | |
Collapse
|
2
|
Killeen PR. Theory of reinforcement schedules. J Exp Anal Behav 2023; 120:289-319. [PMID: 37706228 DOI: 10.1002/jeab.880] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Accepted: 08/04/2023] [Indexed: 09/15/2023]
Abstract
The three principles of reinforcement are (1) events such as incentives and reinforcers increase the activity of an organism; (2) that activity is bounded by competition from other responses; and (3) animals approach incentives and their signs, guided by their temporal and physical conditions, together called the "contingencies of reinforcement." Mathematical models of each of these principles comprised mathematical principles of reinforcement (MPR; Killeen, 1994). Over the ensuing decades, MPR was extended to new experimental contexts. This article reviews the basic theory and its extensions to satiation, warm-up, extinction, sign tracking, pausing, and sequential control in progressive-ratio and multiple schedules. In the latter cases, a single equation balancing target and competing responses governs behavioral contrast and behavioral momentum. Momentum is intrinsic in the fundamental equations, as behavior unspools more slowly from highly aroused responses conditioned by higher rates of incitement than it does from responses from leaner contexts. Habits are responses that have accrued substantial behavioral momentum. Operant responses, being predictors of reinforcement, are approached by making them: The sight and feel of a paw on a lever is approached by placing paw on lever, as attempted for any sign of reinforcement. Behavior in concurrent schedules is governed by approach to momentarily richer patches (melioration). Applications of MPR in behavioral pharmacology and delay discounting are noted.
Collapse
|
3
|
Gomes-Ng S, Cowie S, Elliffe D. Is superstitious responding a matter of detectability? A replication of Killeen (1978). J Exp Anal Behav 2023; 120:171-185. [PMID: 37184425 DOI: 10.1002/jeab.855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Accepted: 04/22/2023] [Indexed: 05/16/2023]
Abstract
Organisms may sometimes behave as if a contingency exists between behavior and consequences, even if this is not actually the case. Killeen (1978) suggested that such superstition occurs because of factors that bias subjects to behave "superstitiously" rather than because of failures of discrimination. We systematically replicated Killeen's experiment and compared contingency discrimination between different consequences. Six pigeons responded in a matching-to-sample procedure in which a response-independent or response-dependent stimulus change, food delivery, or blackout occurred. The pigeons reported whether the consequence was response dependent or response independent by choosing between two side keys. Discrimination was strongest after stimulus changes, weaker after blackouts, and weakest after food deliveries. These differences persisted even after additional training, suggesting asymmetries that may reflect differences in the disruptive effects of different consequences on remembering and/or behavioral mnemonics. Importantly, the pigeons were not biased to report response-dependent consequences unless that response was consistent with locational biases; that is, they behaved "superstitiously" when there was a reason to be biased to do so. These findings corroborate Killeen's and demonstrate that behavior may deviate from contingencies not necessarily because subjects cannot discriminate those contingencies but because they are biased to behave otherwise.
Collapse
Affiliation(s)
- Stephanie Gomes-Ng
- Auckland University of Technology, New Zealand
- The University of Auckland, New Zealand
| | | | | |
Collapse
|
4
|
Primacy and recency effects in hierarchical renewal in rats. Behav Processes 2022; 201:104732. [PMID: 35988894 DOI: 10.1016/j.beproc.2022.104732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 08/04/2022] [Accepted: 08/10/2022] [Indexed: 11/22/2022]
Abstract
Previous studies on hierarchical resurgence produced mixed results regarding the order and magnitude of recurrence of responses trained initially (primacy effect) or more recently (recency effect). Although changes in contextual stimuli could explain such differences, in resurgence procedures contextual stimuli are not commonly presented, thus their effects on multiple operants trained sequentially remain unclear. Renewal procedures, in contrast, have been useful to determine the effects of exteroceptive contextual stimuli on response recurrence. Thus, primacy and recency effects were studied using a renewal procedure in which three contexts were presented sequentially. Lever presses by rats were reinforced on a different lever under each training context and were then exposed to extinction in a different context. Presses on a fourth lever were never reinforced. During renewal testing, the three training contexts were presented in the same or inverse order relative to training. A strong primacy effect was found in rats exposed to the original training order. Both primacy and recency effects were found when the rats were exposed to contexts in inverse order. These results suggest that the magnitude of renewal of hierarchically trained responses is affected by training order and order of presentation of contextual stimuli during testing.
Collapse
|
5
|
Gomes-Ng S, Baharrizki D, Cowie S, Elliffe D, Bai JYH. Effects of brief post-sample cues signaling presence or absence of reinforcers in delayed matching-to-sample. Behav Processes 2022; 200:104664. [PMID: 35654309 DOI: 10.1016/j.beproc.2022.104664] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 04/28/2022] [Accepted: 05/25/2022] [Indexed: 11/02/2022]
Abstract
When short-term memory is assessed in the delayed matching-to-sample (DMTS) procedure, performance is better when cues signal larger reinforcer magnitudes or higher reinforcer probabilities for correct responding. Previous studies demonstrating signaled-magnitude or signaled-probability effects presented cues for a prolonged period during the sample stimulus and/or retention interval. The present study asked whether a signaled-probability effect would occur with brief post-sample cues that signaled the presence or absence of reinforcement. Five pigeons responded in a DMTS task in which sample stimuli were sometimes followed by a 0.5-s cue signaling that reinforcers would either be available or not available in the current trial, and the retention interval varied from 0.5s to 20s. A reliable signaled-probability effect was found when reinforcers were arranged independently and for all correct responses, whereas a smaller, less systematic effect was found when reinforcers were arranged dependently and probabilistically. These findings highlight the importance of reinforcement contingencies and contingency discriminability in remembering, and add to the evidence showing that cues signaling differential reinforcement in DMTS may affect processes during the retention interval and comparison phase, rather than attention to the sample stimulus.
Collapse
|
6
|
Plessas A, Espinosa-Ramos JI, Parry D, Cowie S, Landon J. Machine learning with a snapshot of data: Spiking neural network 'predicts' reinforcement histories of pigeons' choice behavior. J Exp Anal Behav 2022; 117:301-319. [PMID: 35445745 PMCID: PMC9320819 DOI: 10.1002/jeab.759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 03/25/2022] [Accepted: 03/29/2022] [Indexed: 11/20/2022]
Abstract
An accumulated body of choice research has demonstrated that choice behavior can be understood within the context of its history of reinforcement by measuring response patterns. Traditionally, work on predicting choice behaviors has been based on the relationship between the history of reinforcement—the reinforcer arrangement used in training conditions—and choice behavior. We suggest an alternative method that treats the reinforcement history as unknown and focuses only on operant choices to accurately predict (more precisely, retrodict) reinforcement histories. We trained machine learning models known as artificial spiking neural networks (SNNs) on previously published pigeon datasets to detect patterns in choices with specific reinforcement histories—seven arranged concurrent variable‐interval schedules in effect for nine reinforcers. Notably, SNN extracted information from a small ‘window’ of observational data to predict reinforcer arrangements. The models' generalization ability was then tested with new choices of the same pigeons to predict the type of schedule used in training. We examined whether the amount of the data provided affected the prediction accuracy and our results demonstrated that choices made by the pigeons immediately after the delivery of reinforcers provided sufficient information for the model to determine the reinforcement history. These results support the idea that SNNs can process small sets of behavioral data for pattern detection, when the reinforcement history is unknown. This novel approach can influence our decisions to determine appropriate interventions; it can be a valuable addition to our toolbox, for both therapy design and research.
Collapse
Affiliation(s)
| | | | - Dave Parry
- Auckland University of Technology, New Zealand
| | | | | |
Collapse
|
7
|
Cowie S, Davison M. Pigeons prefer to invest early for future reinforcers. J Exp Anal Behav 2021; 115:650-666. [PMID: 33945152 DOI: 10.1002/jeab.687] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Revised: 02/14/2021] [Accepted: 03/08/2021] [Indexed: 11/06/2022]
Abstract
Five pigeons were trained in a series of conditions in which food was delivered after 25 responses, but only when a different (Investing) response had been made before the 25 responses had been completed. If an Investing response was not made, the 25 responses ended in blackout. In various conditions, effective Investing responses either had to be made before the first of the 25 responses, or anywhere within the 25 responses; and effective Investing responses either resulted in a stimulus change or did not. Pigeons Invested even when the consequences were temporally and spatially distant, but Investing was most likely when it produced an immediate stimulus change. When given the choice, pigeons preferred to make Investing responses at the beginning of a trial. These findings again demonstrate that behavior may be maintained by events that are separated in time and space from the present.
Collapse
|
8
|
Houston AI, Trimmer PC, McNamara JM. Matching Behaviours and Rewards. Trends Cogn Sci 2021; 25:403-415. [PMID: 33612384 DOI: 10.1016/j.tics.2021.01.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Revised: 01/19/2021] [Accepted: 01/21/2021] [Indexed: 10/22/2022]
Abstract
Matching describes how behaviour is related to rewards. The matching law holds when the ratio of an individual's behaviours equals the ratio of the rewards obtained. From its origins in the study of pigeons working for food in the laboratory, the law has been applied to a range of species, both in the laboratory and outside it (e.g., human sporting decisions). Probability matching occurs when the probability of a behaviour equals the probability of being rewarded. Input matching predicts the distribution of individuals across habitats. We evaluate the rationality of the matching law and probability matching, expose the logic of matching in real-world cases, review how recent neuroscience findings relate to matching, and suggest future research directions.
Collapse
Affiliation(s)
- Alasdair I Houston
- School of Biological Sciences, University of Bristol, Life Sciences Building, 24 Tyndall Avenue, Bristol, BS8 1TQ, UK.
| | - Pete C Trimmer
- Department of Psychology, University of Warwick, Coventry, CV4 7AL, UK
| | - John M McNamara
- School of Mathematics, University of Bristol, Fry Building, Woodland Road, Bristol, BS8 1UG, UK
| |
Collapse
|
9
|
Abstract
The place of the concept of response strength in a natural science of behavior has been the subject of much debate. This article reconsiders the concept of response strength for reasons linked to the foundations of a natural science of behavior. The notion of response strength is implicit in many radical behaviorists' work. Palmer (2009) makes it explicit by applying the response strength concept to three levels: (1) overt behavior, (2) covert behavior, and (3) latent or potential behavior. We argue that the concept of response strength is superfluous in general, and an explication of the notion of giving causal status to nonobservable events like latent behavior or response strength is harmful to a scientific endeavor. Interpreting EEG recordings as indicators of changes in response strength runs the risk of reducing behavior to underlying mechanisms, regardless of whether such suggestions are accompanied by behavioral observations. Many radical behaviorists understand behavior as a discrete unit, inviting conceptual mistakes reflected in the notion of response strength. A molar view is suggested as an alternative that accounts for the temporally extended nature of behavior and avoids the perils of a response-strength based approach.
Collapse
Affiliation(s)
- Carsta Simon
- University of Agder, Postboks 422, 4604 Kristiansand, Norway
| | | | - Sarah Cowie
- University of Auckland, Auckland, New Zealand
| |
Collapse
|
10
|
Cowie S, Gomes-Ng S, Hopkinson B, Bai JYH, Landon J. Stimulus control depends on the subjective value of the outcome. J Exp Anal Behav 2020; 114:216-232. [PMID: 32820528 DOI: 10.1002/jeab.622] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 06/28/2020] [Accepted: 07/06/2020] [Indexed: 01/02/2023]
Abstract
Stimuli that provide information about likely future reinforcers tend to shift behavior, provided a reliable relation between the stimulus and the reinforcer can be discriminated. Stimuli that are apparently more reliable exert greater control over behavior. We asked how the subjective value (measured in terms of preference) of reinforcers associated with stimuli influences stimulus control. Five pigeons worked on a concurrent chains procedure in which half of all trials ended in a smaller reinforcer sooner, and the other half in a larger reinforcer later. In Signaled trials, the color and flash duration on the keys in the initial link signaled the outcome of the trial. In Conflicting probe trials, the color and the flash duration signaled conflicting information about the outcome of the trial. Choice in Signaled trials shifted toward the signaled outcome, but was never exclusive. In Conflicting probe trials, control was divided idiosyncratically between the 2 stimulus dimensions, but still favored the outcome with the higher subjective value. Thus, stimulus control depends not only on the perceived reliability of stimuli, but also on the subjective value of the outcome.
Collapse
|
11
|
Galizio M, Mason MG, Bruce K. Successive incrementing non-matching-to-samples in rats: An automated version of the odor span task. J Exp Anal Behav 2020; 114:248-265. [PMID: 32725820 DOI: 10.1002/jeab.619] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Revised: 06/20/2020] [Accepted: 06/22/2020] [Indexed: 12/15/2022]
Abstract
The odor span task is a procedure frequently used to study remembering of multiple stimuli in rodents. A large arena is used and odor stimuli are presented using scented cups. Selection of each odor is reinforced when first presented, but not on subsequent presentations; correct selections depend on remembering which stimuli were previously presented. The use of an arena setting with manual stimulus presentation makes the odor span task labor-intensive and limits experimental control; thus, an automated version of the task would be of value. The present study used an operant chamber equipped with an olfactometer and trained rats using successive conditional discrimination procedures under an incrementing non-matching-to-samples contingency. High rates of responding developed to odor stimuli when they were session-novel with low rates of responding to subsequent presentations of that odor. Additional experiments assessed variations of the procedure to determine the role of the frequency of odor presentation and the retention interval separating sample and comparison. Discrimination was impaired with long retention intervals suggesting the importance of this variable. These findings confirmed that rats differentiate between stimuli that are session-novel and those previously encountered and support the use of an automated procedure as an alternative to the odor span task.
Collapse
|
12
|
Effects of variable durations of food availability on interval time-place learning with pigeons Columba Livia. Behav Processes 2020; 179:104192. [PMID: 32645386 DOI: 10.1016/j.beproc.2020.104192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Revised: 07/03/2020] [Accepted: 07/03/2020] [Indexed: 11/20/2022]
Abstract
We assessed the effects of variable durations of food availability on an interval time-place learning task. 3 pigeons were exposed to a task in which food could be obtained for responses in one of four feeders according to an RI 25 s during 3 min, after which, food could be obtained on a different feeder according to the same schedule. The correct feeder changed following a fixed sequence that was repeated four times throughout the session. After 50 training sessions, an Open Hopper Test was conducted, after which, the second training condition ensued. This condition was like the first one with the exception that the availability period could be either 1,2,3, or 6 min long. A second test was conducted after 50 sessions of this training. Another group of 3 birds experienced these conditions in the reverse order. Data suggest that birds solved the task via interval timing under the fixed duration condition, and via ordinal timing when faced with variable durations. Birds learned the fixed sequence involved in the task under both conditions. Although the present data agree with previous research exploring variability in TPL tasks, they do not necessarily support previous claims for an asymmetrical role of spatial and temporal information in these tasks.
Collapse
|
13
|
Abstract
Behavior in the present depends critically on experience in similar environments in the past. Such past experience may be important in controlling behavior not because it determines the strength of a behavior, but because it allows the structure of the current environment to be detected and used. We explore a prospective-control approach to understanding simple behavior. Under this approach, order in the environment allows even simple organisms to use their personal past to respond according to the likely future. The predicted future controls behavior, and past experience forms the building blocks of the predicted future. We explore how generalization affects the use of past experience to predict and respond to the future. First, we consider how generalization across various dimensions of an event determines the degree to which the structure of the environment exerts control over behavior. Next, we explore generalization from the past to the present as the method of deciding when, where, and what to do. This prospective-control approach is measurable and testable; it builds predictions from events that have already occurred, and assumes no agency. Under this prospective-control approach, generalization is fundamental to understanding both adaptive and maladaptive behavior.
Collapse
Affiliation(s)
- Sarah Cowie
- University of Auckland, Auckland, New Zealand
| | | |
Collapse
|
14
|
Response–reinforcer contiguity versus response‐rate–reinforcer‐rate covariance in rats' lever pressing: Support for a multiscale view. J Exp Anal Behav 2020; 113:530-548. [DOI: 10.1002/jeab.594] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Revised: 02/07/2020] [Accepted: 02/24/2020] [Indexed: 11/07/2022]
|
15
|
Cowie S, Davison M. Being there on time: Reinforcer effects on timing and locating. J Exp Anal Behav 2020; 113:340-362. [PMID: 31994217 DOI: 10.1002/jeab.581] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2018] [Revised: 10/03/2019] [Accepted: 12/10/2019] [Indexed: 11/08/2022]
Abstract
In research on timing, reinforcers often are assumed to influence discrimination of elapsed time. We asked whether changes in choice used to measure timing arise because of joint control by elapsed time and reinforcers, rather than from the direct modification of control by elapsed time by reinforcers. Pigeons worked on a concurrent-choice task in which 1 response was 9 times more likely to produce a reinforcer, reversing between locations when 19 s had elapsed since the marker event. Across conditions, we manipulated the percentage of reinforcers arranged before the probability reversal from 5 to 95%. These changes in reinforcer percentages altered control by location-based elements of the contingency, but not by time-based elements. Choice was well described by a model that assumes that control by the contingency is weakened by generalization across the time and location of reinforcers, and that these generalizations become more likely at later times since a marker. These findings add to a growing body of research that suggests that reinforcers share the same function as other environmental events in determining how the environment controls behavior.
Collapse
|
16
|
Mattaini MA. Out of the Lab: Shaping an Ecological and Constructional Cultural Systems Science. Perspect Behav Sci 2019; 42:713-731. [PMID: 31976457 PMCID: PMC6901649 DOI: 10.1007/s40614-019-00208-z] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022] Open
Abstract
Contemporary societies face critical, interlocking, "wicked" challenges, including economic inequities and marginalization, personal and collective violence, ethnic and religious conflicts, degradation of "the commons," climate change, and more, and all of these issues clearly are grounded in behavior. An adequate culturo-behavior science could be positioned to advance and leverage research and interventions supporting community well-being, and contribute to overcoming urgent societal and global challenges. The current state of cultural systems science, however, is limited by theory and methodology, and by competition for attention with well-established research and practice opportunities related to individual-level challenges. In this article, the author explores those limitations, and suggests a more expansive perspective drawing on historical and contemporary ecological science and contemporary theories of complex systems. Research guided by established science within those disciplines offers opportunities to move cultural systems science out of the lab, and into a more adequate, environmentally rich stance drawing on ecological strategies, recursively integrating contextual observations, conceptual advances, and in vivo experimentation. Examples of each of those strategies and exploration of developmental programs of research grounded in such integration are explored.
Collapse
Affiliation(s)
- Mark A. Mattaini
- Jane Addams College of Social Work, University of Illinois at Chicago, Chicago, IL USA
- Paguate, USA
| |
Collapse
|
17
|
Shahan TA, Browning KO, Nall RW. Resurgence as Choice in Context: Treatment duration and on/off alternative reinforcement. J Exp Anal Behav 2019; 113:57-76. [PMID: 31777091 DOI: 10.1002/jeab.563] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2019] [Revised: 11/04/2019] [Accepted: 11/13/2019] [Indexed: 12/15/2022]
Abstract
Resurgence as Choice (RaC) is a quantitative theory suggesting that an increase in an extinguished target behavior with subsequent extinction of an alternative behavior (i.e., resurgence) is governed by the same processes as choice more generally. We present data from an experiment with rats examining a range of treatment durations with alternative reinforcement plus extinction and demonstrate that increases in treatment duration produce small but reliable decreases in resurgence. Although RaC predicted the relation between target responding and treatment duration, the model failed in other respects. First, contrary to predictions, the present experiment also replicated previous findings that exposure to cycling on/off alternative reinforcement reduces resurgence. Second, RaC did a poor job simultaneously accounting for target and alternative behaviors across conditions. We present a revised model incorporating a role for more local signaling effects of reinforcer deliveries or their absence on response allocation. Such signaling effects are suggested to impact response allocation above and beyond the values of the target and alternative behaviors as longer-term repositories of experience. The new model provides an excellent account of the data and can be viewed as an integration of RaC and a quantitative approximation of some aspects of Context Theory.
Collapse
|
18
|
Affiliation(s)
- Sarah Cowie
- School of Psychology, The Univeristy of Auckland, Auckland, New Zealand
| |
Collapse
|
19
|
Abstract
Recent studies have demonstrated that saccadic reaction times (SRTs) are influenced by the temporal regularities of dynamic environments (Vullings & Madelain, 2018). Here, we ask whether discriminative control (i.e., the possibility to use external stimuli signaling the future state of the environment) of latencies in a search task might be established using reinforcement contingencies. Eight participants made saccades within 80-750 ms toward a target displayed among distractors. We constructed two latency classes, "short" and "long," using the first and last quartiles of the individual baseline distributions. We then used a latency-contingent display paradigm in which finding the visual target among other items was made contingent upon specific SRTs. For a first group, the postsaccadic target was displayed only following short latencies with leftward saccades, and following long latencies with rightward saccades. The opposite was true for a second group. When short- and long-latency saccades were reinforced (i.e., the target was displayed) depending on the saccade direction, median latencies differed by 74 ms on average (all outside the 98% null hypothesis confidence intervals). Posttraining, in the absence of reinforcement, we still observed strong differences in latency distributions, averaging 64 ms for leftward versus rightward saccades. Our results demonstrate the discriminative control of SRTs, further supporting the effects of reinforcement learning for saccade. This study reveals that saccade triggering is finely controlled by learned temporal and spatial properties of the environment using predictive mechanisms.
Collapse
Affiliation(s)
- Cécile Vullings
- Université de Lille, CNRS, CHU Lille, UMR 9193 - SCALab - Sciences Cognitives et Sciences Affectives, Lille, France
| | - Laurent Madelain
- Université de Lille, CNRS, CHU Lille, UMR 9193 - SCALab - Sciences Cognitives et Sciences Affectives, Lille, France.,Aix Marseille Université, CNRS, Institut de Neurosciences de la Timone, UMR 7289, Marseille, France
| |
Collapse
|
20
|
Hunter M, Rosales‐Ruiz J. The power of one reinforcer: The effect of a single reinforcer in the context of shaping. J Exp Anal Behav 2019; 111:449-464. [DOI: 10.1002/jeab.517] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2017] [Accepted: 04/11/2019] [Indexed: 11/05/2022]
|
21
|
Jensen G. Instrumental variable estimation of reinforcer effects. J Exp Anal Behav 2019; 111:359-368. [DOI: 10.1002/jeab.498] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2018] [Accepted: 12/21/2018] [Indexed: 11/11/2022]
Affiliation(s)
- Greg Jensen
- Department of Psychology; Columbia University
| |
Collapse
|
22
|
Leslie JC. Testing the limits of behavior analysis: A review of Frans de Waal's Are we smart enough to know how smart animals are? J Exp Anal Behav 2018. [DOI: 10.1002/jeab.482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
23
|
Gomes-Ng S, Elliffe D, Cowie S. Environment tracking and signal following in a reinforcer-ratio reversal procedure. Behav Processes 2018; 157:208-224. [PMID: 30315866 DOI: 10.1016/j.beproc.2018.10.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2018] [Revised: 09/27/2018] [Accepted: 10/04/2018] [Indexed: 01/05/2023]
Abstract
Several studies suggest that the degree of control by reinforcer ratios (environment tracking) and by exteroceptive stimuli that signal future reinforcer availability (signal following) depends on environmental certainty: As reinforcers become more likely at one location, environmental contingencies exert stronger control and exteroceptive stimuli exert weaker control. This research has not yet been extended to environments in which reinforcer availability changes across time, even though such changes are present in most natural environments. Thus, in the present experiment, we examined environment tracking and signal following in a concurrent schedule in which the reinforcer ratio reversed to its reciprocal 30 s after a reinforcer delivery and keylight-color stimuli signaled the likely or definite time or location of the next reinforcer. Across conditions, we manipulated environmental certainty by varying the probability of reinforcer deliveries on the locally richer key. This made the location of future reinforcers at a particular time more or less certain, but did not change the overall reinforcer ratio. Changes in local environmental certainty had little to no effect on environment tracking and signal following; in all conditions, keylight-color stimuli strongly controlled choice and reinforcer ratios exerted weak control. The present findings suggest that the extent of environment tracking and signal following is primarily determined by global, not local, environmental certainty.
Collapse
Affiliation(s)
| | - Douglas Elliffe
- School of Psychology, The University of Auckland, New Zealand
| | - Sarah Cowie
- School of Psychology, The University of Auckland, New Zealand
| |
Collapse
|
24
|
Baum WM. Multiscale behavior analysis and molar behaviorism: An overview. J Exp Anal Behav 2018; 110:302-322. [DOI: 10.1002/jeab.476] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2018] [Accepted: 09/22/2018] [Indexed: 01/04/2023]
Affiliation(s)
- William M. Baum
- University of California, Davis, and University of New Hampshire
| |
Collapse
|
25
|
Bland VJ, Cowie S, Elliffe D, Podlesnik CA. Does a negative discriminative stimulus function as a punishing consequence? J Exp Anal Behav 2018; 110:87-104. [PMID: 29926923 DOI: 10.1002/jeab.444] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2017] [Accepted: 05/09/2018] [Indexed: 11/12/2022]
Abstract
The study and use of punishment in behavioral treatments has been constrained by ethical concerns. However, there remains a need to reduce harmful behavior unable to be reduced by differential-reinforcement procedures. We investigated whether response-contingent presentation of a negative discriminative stimulus previously correlated with an absence of reinforcers would punish behavior maintained by positive reinforcers. Across four conditions, pigeons were trained to discriminate between a positive discriminative stimulus (S+) signaling the presence of food, and a negative discriminative stimulus (S-) signaling the absence of food. Once learned, every five responses on average to the S+ produced S- for a duration of 1.5 s. S+ response rate decreased for a majority of pigeons when responses produced S-, compared to when they did not, or when a neutral control stimulus was presented. In Condition 5, choice between two concurrently presented S+ alternatives shifted away from the alternative producing S-, despite a 1:1 reinforcer ratio. Therefore, presenting contingent S- stimuli punishes operant behavior maintained on simple schedules and in choice situations. Development of negative discriminative stimuli as punishers of operant behavior could provide an effective approach to behavioral treatments for problem behavior and subverting suboptimal choices involved in addictions.
Collapse
Affiliation(s)
| | | | | | - Christopher A Podlesnik
- The University of Auckland.,Florida Institute of Technology and The Scott Center for Autism Treatment
| |
Collapse
|
26
|
Abstract
Resurgence is the recurrence of a previously reinforced and then extinguished behavior induced by the extinction of another more recently reinforced behavior. Resurgence provides insight into behavioral processes relevant to treatment relapse of a range of problem behaviors. Resurgence is typically studied across three phases: (1) reinforcement of a target response, (2) extinction of the target and concurrent reinforcement of an alternative response, and (3) extinction of the alternative response, resulting in the recurrence of target responding. Because each phase typically occurs successively and spans multiple sessions, extended time frames separate the training and resurgence of target responding. This study assessed resurgence more dynamically and throughout ongoing training in 6 pigeons. Baseline entailed 50-s trials of a free-operant psychophysical procedure, resembling Phases 1 and 2 of typical resurgence procedures. During the first 25 s, we reinforced target (left-key) responding but not alternative (right-key) responding; contingencies reversed during the second 25 s. Target and alternative responding followed the baseline reinforcement contingencies, with alternative responding replacing target responding across the 50 s. We observed resurgence of target responding during signaled and unsignaled probes that extended trial durations an additional 100 s in extinction. Furthermore, resurgence was greater and/or sooner when probes were signaled, suggesting an important role of discriminating transitions to extinction in resurgence. The data were well described by an extension of a stimulus-control model of discrimination that assumes resurgence is the result of generalization of obtained reinforcers across space and time. Therefore, the present findings introduce novel methods and quantitative analyses for assessing behavioral processes underlying resurgence.
Collapse
|
27
|
Gomes-Ng S, Landon J, Elliffe D, Bensemann J, Cowie S. The effects of changeover delays on local choice. Behav Processes 2018; 150:36-46. [DOI: 10.1016/j.beproc.2018.02.019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2017] [Revised: 12/06/2017] [Accepted: 02/26/2018] [Indexed: 10/17/2022]
|
28
|
Rayburn-Reeves RM, Moore MK, Smith TE, Crafton DA, Marden KL. Spatial midsession reversal learning in rats: Effects of egocentric Cue use and memory. Behav Processes 2018. [PMID: 29524553 DOI: 10.1016/j.beproc.2018.03.005] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
The midsession reversal task has been used to investigate behavioral flexibility and cue use in non-human animals, with results indicating differences in the degree of control by environmental cues across species. For example, time-based control has been found in rats only when tested in a T-maze apparatus and under specific conditions in which position and orientation (i.e., egocentric) cues during the intertrial interval could not be used to aid performance. Other research in an operant setting has shown that rats often produce minimal errors around the reversal location, demonstrating response patterns similar to patterns exhibited by humans and primates in this task. The current study aimed to reduce, but not eliminate, the ability for rats to utilize egocentric cues by placing the response levers on the opposite wall of the chamber in relation to the pellet dispenser. Results showed that rats made minimal errors prior to the reversal, suggesting time-based cues were not controlling responses, and that they switched to the second correct stimulus within a few trials after the reversal event. Video recordings also revealed highly structured patterns of behavior by the majority of rats, which often differed depending on which response was reinforced. We interpret these findings as evidence that rats are adept at utilizing their own egocentric cues and that these cues, along with memory for the recent response-reinforcement contingencies, aid in maximizing reinforcement over the session.
Collapse
Affiliation(s)
| | - Mary K Moore
- Georgia Southern University - Armstrong Campus, United States
| | - Thea E Smith
- Georgia Southern University - Armstrong Campus, United States
| | | | - Kelly L Marden
- Georgia Southern University - Armstrong Campus, United States
| |
Collapse
|
29
|
The non-Darwinian evolution of behavers and behaviors. Behav Processes 2017; 161:45-53. [PMID: 29292172 DOI: 10.1016/j.beproc.2017.12.024] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2017] [Revised: 12/22/2017] [Accepted: 12/22/2017] [Indexed: 11/24/2022]
Abstract
Many readers of this journal have been schooled in both Darwinian evolution and Skinnerian psychology, which have in common the vision of powerful control of their subjects by their sequalae. Individuals of species that generate more successful offspring come to dominate their habitat; responses of those individuals that generate more reinforcers come to dominate the repertoire of the individual in that context. This is unarguable. What is questionable is how large a role these forces of selection play in the larger landscape of existing organisms and the repertoires of their individuals. Here it is argued that non-Darwinian and non-Skinnerian selection play much larger roles in both than the reader may appreciate. The argument is based on the history of, and recent advances in, microbiology. Lessons from that history re-illuminate the three putative domains of selection by consequences: The evolution of species, response repertoires, and cultures. It is argued that before, beneath, and after the cosmically brief but crucial epoch of Darwinian evolution that shaped creatures such as ourselves, non-Darwinian forces pervade all three domains.
Collapse
|
30
|
Bell MC, Baum WM. Concurrent variable-interval variable-ratio schedules in a dynamic choice environment. J Exp Anal Behav 2017; 108:367-397. [DOI: 10.1002/jeab.286] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Accepted: 10/11/2017] [Indexed: 11/08/2022]
|
31
|
Kuroda T, Mizutani Y, Cançado CR, Podlesnik CA. Reversal learning and resurgence of operant behavior in zebrafish ( Danio rerio ). Behav Processes 2017. [DOI: 10.1016/j.beproc.2017.06.004] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
|
32
|
|
33
|
Gomes-Ng S, Elliffe D, Cowie S. How do reinforcers affect choice? Preference pulses after responses and reinforcers. J Exp Anal Behav 2017; 108:17-38. [DOI: 10.1002/jeab.260] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2016] [Accepted: 05/15/2017] [Indexed: 11/10/2022]
|
34
|
Affiliation(s)
- Michael Davison
- School of Psychology, The University of Auckland, Private Bag 92019, Auckland, 1142 New Zealand
| |
Collapse
|
35
|
Pimenta D, Tonneau F. Correlations among stimuli affect stimulus matching and stimulus liking. Behav Processes 2016; 130:36-8. [DOI: 10.1016/j.beproc.2016.07.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2016] [Revised: 06/30/2016] [Accepted: 07/06/2016] [Indexed: 10/21/2022]
|