1
|
Wilbrecht L, Davidow JY. Goal-directed learning in adolescence: neurocognitive development and contextual influences. Nat Rev Neurosci 2024; 25:176-194. [PMID: 38263216 DOI: 10.1038/s41583-023-00783-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/12/2023] [Indexed: 01/25/2024]
Abstract
Adolescence is a time during which we transition to independence, explore new activities and begin pursuit of major life goals. Goal-directed learning, in which we learn to perform actions that enable us to obtain desired outcomes, is central to many of these processes. Currently, our understanding of goal-directed learning in adolescence is itself in a state of transition, with the scientific community grappling with inconsistent results. When we examine metrics of goal-directed learning through the second decade of life, we find that many studies agree there are steady gains in performance in the teenage years, but others report that adolescent goal-directed learning is already adult-like, and some find adolescents can outperform adults. To explain the current variability in results, sophisticated experimental designs are being applied to test learning in different contexts. There is also increasing recognition that individuals of different ages and in different states will draw on different neurocognitive systems to support goal-directed learning. Through adoption of more nuanced approaches, we can be better prepared to recognize and harness adolescent strengths and to decipher the purpose (or goals) of adolescence itself.
Collapse
Affiliation(s)
- Linda Wilbrecht
- Department of Psychology, University of California, Berkeley, CA, USA.
- Helen Wills Neuroscience Institute, University of California, Berkeley, CA, USA.
| | - Juliet Y Davidow
- Department of Psychology, Northeastern University, Boston, MA, USA.
| |
Collapse
|
2
|
Krugliakova E, Klucharev V, Fedele T, Gorin A, Kuznetsova A, Shestakova A. Correlation of cue-locked FRN and feedback-locked FRN in the auditory monetary incentive delay task. Exp Brain Res 2017; 236:141-151. [PMID: 29196772 DOI: 10.1007/s00221-017-5113-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2016] [Accepted: 10/24/2017] [Indexed: 02/02/2023]
Abstract
Reflecting the discrepancy between received and predicted outcomes, the reward prediction error (RPE) plays an important role in learning in a dynamic environment. A number of studies suggested that the feedback-related negativity (FRN) component of an event-related potential, known to be associated with unexpected outcomes, encodes RPEs. While FRN was clearly shown to be sensitive to the probability of outcomes, the effect of outcome magnitude on FRN remains to be further clarified. In studies on the neural underpinnings of reward anticipation and outcome evaluation, a monetary incentive delay (MID) task proved to be particularly useful. We investigated whether feedback-locked FRN and cue-locked dN200 responses recorded during an auditory MID task were sensitive to the probability and magnitude of outcomes. The cue-locked dN200 is associated with the update of information about the magnitude of prospective outcomes. Overall, we showed that feedback-locked FRN was modulated by both the magnitude and the probability of outcomes during an auditory version of MID task, whereas no such effect was found for cue-locked dN200. Furthermore, the cue-locked dN200, which is associated with the update of information about the magnitude of prospective outcomes, correlated with the standard feedback-locked FRN, which is associated with a negative RPE. These results further expand our knowledge on the interplay between the processing of predictive cues that forecast future outcomes and the subsequent revision of these predictions during outcome delivery.
Collapse
Affiliation(s)
- Elena Krugliakova
- Centre for Cognition and Decision Making, National Research University Higher School of Economics, 3a Krivokolenniy sidewalk, Moscow, 101000, Russian Federation.
| | - Vasily Klucharev
- Centre for Cognition and Decision Making, National Research University Higher School of Economics, 3a Krivokolenniy sidewalk, Moscow, 101000, Russian Federation
| | - Tommaso Fedele
- Neurosurgery Department, University Hospital Zürich, Frauenklinikstrasse 10, 8091, Zurich, Switzerland
| | - Alexey Gorin
- Centre for Cognition and Decision Making, National Research University Higher School of Economics, 3a Krivokolenniy sidewalk, Moscow, 101000, Russian Federation
| | - Aleksandra Kuznetsova
- Centre for Cognition and Decision Making, National Research University Higher School of Economics, 3a Krivokolenniy sidewalk, Moscow, 101000, Russian Federation
| | - Anna Shestakova
- Centre for Cognition and Decision Making, National Research University Higher School of Economics, 3a Krivokolenniy sidewalk, Moscow, 101000, Russian Federation
| |
Collapse
|
3
|
Balodis IM, Potenza MN. Anticipatory reward processing in addicted populations: a focus on the monetary incentive delay task. Biol Psychiatry 2015; 77:434-44. [PMID: 25481621 PMCID: PMC4315733 DOI: 10.1016/j.biopsych.2014.08.020] [Citation(s) in RCA: 156] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/07/2014] [Revised: 08/12/2014] [Accepted: 08/26/2014] [Indexed: 11/26/2022]
Abstract
Advances in brain imaging techniques have allowed neurobiological research to temporally analyze signals coding for the anticipation of reward. In addicted populations, both hyporesponsiveness and hyperresponsiveness of brain regions (e.g., ventral striatum) implicated in drug effects and reward system processing have been reported during anticipation of generalized reward. We discuss the current state of knowledge of reward processing in addictive disorders from a widely used and validated task: the monetary incentive delay task. Only studies applying the monetary incentive delay task in addicted and at-risk adult populations are reviewed, with a focus on anticipatory processing and striatal regions activated during task performance as well as the relationship of these regions with individual difference (e.g., impulsivity) and treatment outcome variables. We further review drug influences in challenge studies as a means to examine acute influences on reward processing in abstinent, recreationally using, and addicted populations. Generalized reward processing in addicted and at-risk populations is often characterized by divergent anticipatory signaling in the ventral striatum. Although methodologic and task variations may underlie some discrepant findings, anticipatory signaling in the ventral striatum may also be influenced by smoking status, drug metabolites, and treatment status in addicted populations. Divergent results across abstinent, recreationally using, and addicted populations demonstrate complexities in interpreting findings. Future studies would benefit from focusing on characterizing how impulsivity and other addiction-related features relate to anticipatory striatal signaling over time. Additionally, identifying how anticipatory signals recover or adjust after protracted abstinence will be important in understanding recovery processes.
Collapse
Affiliation(s)
- Iris M. Balodis
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA,Corresponding Author: Iris M. Balodis, PhD, Yale University School of Medicine, 1 Church Street, Rm 731, New Haven, CT 06519, Tel: 203-737-2668,
| | - Marc N. Potenza
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA,Department of Neurobiology, Yale University School of Medicine, New Haven, CT, USA,Child Study Center, Yale University School of Medicine, New Haven, CT, USA
| |
Collapse
|
4
|
Mondragón E, Gray J, Alonso E, Bonardi C, Jennings DJ. SSCC TD: a serial and simultaneous configural-cue compound stimuli representation for temporal difference learning. PLoS One 2014; 9:e102469. [PMID: 25054799 PMCID: PMC4108321 DOI: 10.1371/journal.pone.0102469] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2014] [Accepted: 06/18/2014] [Indexed: 11/18/2022] Open
Abstract
This paper presents a novel representational framework for the Temporal Difference (TD) model of learning, which allows the computation of configural stimuli--cumulative compounds of stimuli that generate perceptual emergents known as configural cues. This Simultaneous and Serial Configural-cue Compound Stimuli Temporal Difference model (SSCC TD) can model both simultaneous and serial stimulus compounds, as well as compounds including the experimental context. This modification significantly broadens the range of phenomena which the TD paradigm can explain, and allows it to predict phenomena which traditional TD solutions cannot, particularly effects that depend on compound stimuli functioning as a whole, such as pattern learning and serial structural discriminations, and context-related effects.
Collapse
Affiliation(s)
- Esther Mondragón
- Centre for Computational and Animal Learning Research, St Albans, United Kingdom
| | - Jonathan Gray
- Centre for Computational and Animal Learning Research, St Albans, United Kingdom
- Institute for Complex Systems Simulations, University of Southampton, Southampton, United Kingdom
| | - Eduardo Alonso
- Centre for Computational and Animal Learning Research, St Albans, United Kingdom
- Department of Computer Science, City University London, London, United Kingdom
| | - Charlotte Bonardi
- Centre for Computational and Animal Learning Research, St Albans, United Kingdom
- School of Psychology, University of Nottingham, Nottingham, United Kingdom
| | - Dómhnall J. Jennings
- Centre for Computational and Animal Learning Research, St Albans, United Kingdom
- Institute of Neuroscience, Newcastle University, Newcastle upon Tyne, United Kingdom
| |
Collapse
|
5
|
Aquili L, Liu AW, Shindou M, Shindou T, Wickens JR. Behavioral flexibility is increased by optogenetic inhibition of neurons in the nucleus accumbens shell during specific time segments. Learn Mem 2014; 21:223-31. [PMID: 24639489 PMCID: PMC3966536 DOI: 10.1101/lm.034199.113] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]
Abstract
Behavioral flexibility is vital for survival in an environment of changing contingencies. The nucleus accumbens may play an important role in behavioral flexibility, representing learned stimulus–reward associations in neural activity during response selection and learning from results. To investigate the role of nucleus accumbens neural activity in behavioral flexibility, we used light-activated halorhodopsin to inhibit nucleus accumbens shell neurons during specific time segments of a bar-pressing task requiring a win–stay/lose–shift strategy. We found that optogenetic inhibition during action selection in the time segment preceding a lever press had no effect on performance. However, inhibition occurring in the time segment during feedback of results—whether rewards or nonrewards—reduced the errors that occurred after a change in contingency. Our results demonstrate critical time segments during which nucleus accumbens shell neurons integrate feedback into subsequent responses. Inhibiting nucleus accumbens shell neurons in these time segments, during reinforced performance or after a change in contingencies, increases lose–shift behavior. We propose that the activity of nucleus shell accumbens shell neurons in these time segments plays a key role in integrating knowledge of results into subsequent behavior, as well as in modulating lose–shift behavior when contingencies change.
Collapse
Affiliation(s)
- Luca Aquili
- Okinawa Institute of Science and Technology Graduate University, Neurobiology Research Unit, Onna-son, Japan 904-0495
| | | | | | | | | |
Collapse
|
6
|
Okatan M. Correlates of reward-predictive value in learning-related hippocampal neural activity. Hippocampus 2009; 19:487-506. [PMID: 19123250 PMCID: PMC2742500 DOI: 10.1002/hipo.20535] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Temporal difference learning (TD) is a popular algorithm in machine learning. Two learning signals that are derived from this algorithm, the predictive value and the prediction error, have been shown to explain changes in neural activity and behavior during learning across species. Here, the predictive value signal is used to explain the time course of learning-related changes in the activity of hippocampal neurons in monkeys performing an associative learning task. The TD algorithm serves as the centerpiece of a joint probability model for the learning-related neural activity and the behavioral responses recorded during the task. The neural component of the model consists of spiking neurons that compete and learn the reward-predictive value of task-relevant input signals. The predictive-value signaled by these neurons influences the behavioral response generated by a stochastic decision stage, which constitutes the behavioral component of the model. It is shown that the time course of the changes in neural activity and behavioral performance generated by the model exhibits key features of the experimental data. The results suggest that information about correct associations may be expressed in the hippocampus before it is detected in the behavior of a subject. In this way, the hippocampus may be among the earliest brain areas to express learning and drive the behavioral changes associated with learning. Correlates of reward-predictive value may be expressed in the hippocampus through rate remapping within spatial memory representations, they may represent reward-related aspects of a declarative or explicit relational memory representation of task contingencies, or they may correspond to reward-related components of episodic memory representations. These potential functions are discussed in connection with hippocampal cell assembly sequences and their reverse reactivation during the awake state. The results provide further support for the proposal that neural processes underlying learning may be implementing a temporal difference-like algorithm.
Collapse
Affiliation(s)
- Murat Okatan
- Laboratory of Cognitive Neurobiology, Department of Psychology, Boston University, Boston, MA 02215, USA.
| |
Collapse
|
7
|
A head-neck-eye system that learns fault-tolerant saccades to 3-D targets using a self-organizing neural model. Neural Netw 2008; 21:1380-91. [PMID: 18775642 DOI: 10.1016/j.neunet.2008.07.007] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2007] [Revised: 07/31/2008] [Accepted: 07/31/2008] [Indexed: 11/22/2022]
Abstract
This paper describes a head-neck-eye camera system that is capable of learning to saccade to 3-D targets in a self-organized fashion. The self-organized learning process is based on action perception cycles where the camera system performs micro saccades about a given head-neck-eye camera position and learns to map these micro saccades to changes in position of a 3-D target currently in view of the stereo camera. This motor babbling phase provides self-generated movement commands that activate correlated visual, spatial and motor information that are used to learn an internal coordinate transformation between vision and motor systems. The learned transform is used by resulting head-neck-eye camera system to accurately saccade to 3-D targets using many different combinations of head, neck, and eye positions. The interesting aspect of the learned transform is that it is robust to a wide variety of disturbances including reduced degrees of freedom of movement for the head, neck, one eye, or any combination of two of the three, movement of head and neck as a function of eye movements, changes in the stereo camera separation distance and changes in focal lengths of the cameras. These disturbances were not encountered during motor babbling phase. This feature points to general nature of the learned transform in its ability to control autonomous systems with redundant degrees of freedom in a very robust and fault-tolerant fashion.
Collapse
|
8
|
Zeybek Z, Yüce Cetinkaya S, Alioglu F, Alpbaz M. Determination of optimum operating conditions for industrial dye wastewater treatment using adaptive heuristic criticism pH control. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2007; 85:404-14. [PMID: 17141939 DOI: 10.1016/j.jenvman.2006.10.013] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2005] [Revised: 10/08/2006] [Accepted: 10/17/2006] [Indexed: 05/12/2023]
Abstract
For a pilot-scale application, pH control in the treatment of highly contaminated dye industrial wastewater containing metallic compounds as the main pollutants has been investigated with a method using adaptive heuristic criticism control (AHCC). Subsequent experimentation on between 12 and 18 l of the wastewater was carried out using statistical experimental design methodology to evaluate the effects of three critical factors: slaked lime (calcium hydroxide, Ca(OH)(2)) concentration, iron chloride (FeCl(3)) concentration and wastewater volume. With these critical factors, the wastewater treatment process is modeled as an appropriate quadratic cost function of the turbidity of the clarified water. The model is optimized with Rosenbrock's method. Response surface topology of the wastewater treatment is given in terms of optimal concentrations of lime water and FeCl(3) and optimal wastewater volume at pH 11.
Collapse
Affiliation(s)
- Z Zeybek
- Department of Chemical Engineering, Ankara University, 06100 Tandogan, Ankara, Turkey.
| | | | | | | |
Collapse
|
9
|
Zeybek Z, Karapinar T, Alpbaz M, Hapoglu H. Application of adaptive heuristic criticism control (AHCC) to dye wastewater. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2007; 84:461-72. [PMID: 16949196 DOI: 10.1016/j.jenvman.2006.06.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2005] [Revised: 06/19/2006] [Accepted: 06/21/2006] [Indexed: 05/11/2023]
Abstract
This paper presents an experimental application of AHCC to study the coagulation process of wastewater treatment in a dye plant. Also this study includes a series of tests in which an AHCC control was used for pH control. The performance results of the AHCC controller are compared with the results obtained by using a conventional proportional-integral-derivative (PID) algorithm. It is useful to compare PID with AHCC to illustrate the extreme range of the nonlinearity of the dye wastewater treatment process. Although the removal of pollutants from wastewater is similar with AHCC and PID, our results show excellent AHCC performance in the region where conventional PID control fails.
Collapse
Affiliation(s)
- Z Zeybek
- Ankara University, Engineering Faculty, Department of Chemical Engineering 06100 Tandogan, Ankara, Turkey.
| | | | | | | |
Collapse
|
10
|
Padlubnaya DB, Parekh NH, Brown TH. Neurophysiological theory of Kamin blocking in fear conditioning. Behav Neurosci 2006; 120:337-52. [PMID: 16719698 DOI: 10.1037/0735-7044.120.2.337] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Kamin blocking in fear conditioning is thought to reflect diminished processing of the unconditional stimulus (US) in the presence of a conditional stimulus (CS-super(+)) that was previously paired with this US. According to Fanselow's (1998) hypothesis, the CS-super(+) drives output from the amygdala that ultimately produces analgesia by causing opiate release onto afferent pain circuits. This hypothesis was explored quantitatively through neurophysiological simulations. The results suggest that opiate-mediated, negative-feedback control of US processing is too slow for efficient blocking of cue conditioning. The reason is that conditioning-produced synaptic modifications can be induced before the opiate-mediated inhibition has any substantial effect on US processing. The results suggest the existence of an additional, faster-acting, inhibitory neurotransmitter in the blocking circuit.
Collapse
|
11
|
Pan WX, Schmidt R, Wickens JR, Hyland BI. Dopamine cells respond to predicted events during classical conditioning: evidence for eligibility traces in the reward-learning network. J Neurosci 2005; 25:6235-42. [PMID: 15987953 PMCID: PMC6725057 DOI: 10.1523/jneurosci.1478-05.2005] [Citation(s) in RCA: 308] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2005] [Revised: 05/13/2005] [Accepted: 05/14/2005] [Indexed: 11/21/2022] Open
Abstract
Behavioral conditioning of cue-reward pairing results in a shift of midbrain dopamine (DA) cell activity from responding to the reward to responding to the predictive cue. However, the precise time course and mechanism underlying this shift remain unclear. Here, we report a combined single-unit recording and temporal difference (TD) modeling approach to this question. The data from recordings in conscious rats showed that DA cells retain responses to predicted reward after responses to conditioned cues have developed, at least early in training. This contrasts with previous TD models that predict a gradual stepwise shift in latency with responses to rewards lost before responses develop to the conditioned cue. By exploring the TD parameter space, we demonstrate that the persistent reward responses of DA cells during conditioning are only accurately replicated by a TD model with long-lasting eligibility traces (nonzero values for the parameter lambda) and low learning rate (alpha). These physiological constraints for TD parameters suggest that eligibility traces and low per-trial rates of plastic modification may be essential features of neural circuits for reward learning in the brain. Such properties enable rapid but stable initiation of learning when the number of stimulus-reward pairings is limited, conferring significant adaptive advantages in real-world environments.
Collapse
Affiliation(s)
- Wei-Xing Pan
- Department of Physiology, School of Medical Sciences, University of Otago, Dunedin 9001, New Zealand
| | | | | | | |
Collapse
|
12
|
Church R. A Concise Introduction to Scalar Timing Theory. FUNCTIONAL AND NEURAL MECHANISMS OF INTERVAL TIMING 2003. [DOI: 10.1201/9780203009574.sec1] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/09/2023]
|
13
|
Porr B, Wörgötter P. Isotropic sequence order learning using a novel linear algorithm in a closed loop behavioural system. Biosystems 2002; 67:195-202. [PMID: 12459299 DOI: 10.1016/s0303-2647(02)00077-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
In this article, we present an isotropic algorithm for sequence order learning. Its central goal is to learn the causal relation between two (or more) inputs in order to react to the earliest incoming signal after successful learning (like in typical classical conditioning situations). We implement this algorithm in a behaving system (a robot) thereby creating a closed loop situation where the learner's actions influence its own sensor inputs to the end of creating an autonomous agent. Autonomous behaviour implies that learning goals are internally defined within the organism's capabilities. Standard learning models for sequence learning (e.g. temporal difference (TD)-learning) need an externally defined reward. This, however, is in conflict with the requirement of an implicitly defined internal goal in autonomous behaviour. Therefore, in this study we present a system in which the external reward is replaced by a reflex loop. This loop explicitly includes the environment. Every reflex loop has the inherent disadvantage, which is that its re-actions occur each time just after a reflex-eliciting sensor event and thus 'too late'. However, a reflex can serve as the internal reference for sequence order learning, which has the task of eliminating this disadvantage by creating earlier anticipatory actions. In our system learning is achieved by modifying synaptic weights of a linear neuron with a correlation based learning rule which involves the derivative of the neuron's output. All input lines are entirely isotropic. The synaptic weight change curve of this rule is strongly related to the temporal Hebb learning rule, which was found in spike timing experiments. We find that after learning the reflex loop is replaced in functional terms with an earlier anticipatory action (and pathway). In addition, we observed that the synaptic weights stabilise as soon as the reflex remains silent.
Collapse
Affiliation(s)
- B Porr
- Department of Psychology, University of Stirling, Stirling, UK
| | | |
Collapse
|
14
|
Abstract
The control or prediction of the precise timing of events are central aspects of the many tasks assigned to the cerebellum. Despite much detailed knowledge of its physiology and anatomy, it remains unclear how the cerebellar circuitry can achieve such an adaptive timing function. We present a computational model pursuing this question for one extensively studied type of cerebellar-mediated learning: the classical conditioning of discrete motor responses. This model combines multiple current assumptions on the function of the cerebellar circuitry and was used to investigate whether plasticity in the cerebellar cortex alone can mediate adaptive conditioned response timing. In particular, we studied the effect of changes in the strength of the synapses formed between parallel fibres and Purkinje cells under the control of a negative feedback loop formed between inferior olive, cerebellar cortex and cerebellar deep nuclei. The learning performance of the model was evaluated at the circuit level in simulated conditioning experiments as well as at the behavioural level using a mobile robot. We demonstrate that the model supports adaptively timed responses under real-world conditions. Thus, in contrast to many other models that have focused on cerebellar-mediated conditioning, we investigated whether and how the suggested underlying mechanisms could give rise to behavioural phenomena.
Collapse
Affiliation(s)
- Constanze Hofstötter
- Institute of Neuroinformatics, University and ETH Zurich, Winterthurerstr. 190, CH-8057 Zurich, Switzerland
| | | | | |
Collapse
|
15
|
Gluck MA, Ermita BR, Oliver LM, Myers CE. Extending models of hippocampal function in animal conditioning to human amnesia. Memory 1997; 5:179-212. [PMID: 9156098 DOI: 10.1080/741941141] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Although most analyses of amnesia have focused on the loss of explicit declarative and episodic memories following hippocampal-region damage, considerable insights into amnesia can also be realised by studying hippocampal function in simple procedural, or habit-based, associative learning tasks. Although many simple forms of associative learning are unimpaired by hippocampal damage, more complex tasks which require sensitivity to unreinforced stimuli, configurations of multiple stimuli, or contextual information are impaired by hippocampal damage. In several recent papers we have developed a computational theory of hippocampal function which argues that this brain region plays a critical role in the formation of new stimulus representations during learning (Gluck & Myers, 1993, 1995; Myers & Gluck, 1996; Myers, Gluck, & Granger, 1995). We have applied this theory to a broad range of empirical data from studies of classical conditioning in both intact and hippocampal-lesioned animals, and the model correctly accounts for these data. The classical conditioning paradigm can be adapted for use in humans, and similar results for acquisition are obtained in both normal and hippocampal-damaged humans. More recently, we have begun to address an important set of category learning studies in both normals and hippocampal-damaged amnesics. This work integrates experimental studies of amnesic category learning (Knowlton, Squire, & Gluck, 1994) with theoretical accounts of associative learning, and builds on previously established behavioural correspondences between animal conditioning and human category learning (Gluck & Bower, 1988a). Our work to date illustrates some initial progress towards a more integrative understanding of hippocampal function in both animal and human learning, which may be useful in guiding further empirical and theoretical research in human memory and amnesia.
Collapse
Affiliation(s)
- M A Gluck
- Center for Molecular and Behavioral Neuroscience, Rutgers University, Newark, New Jersey 07102, USA.
| | | | | | | |
Collapse
|
16
|
Affiliation(s)
- S Hampson
- Department of Information and Computing, University of California, Irvine 92717
| |
Collapse
|
17
|
Blazis DE, Moore JW. Conditioned stimulus duration in classical trace conditioning: test of a real-time neural network model. Behav Brain Res 1991; 43:73-8. [PMID: 1650232 DOI: 10.1016/s0166-4328(05)80054-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
In classical trace conditioning, the interstimulus interval (ISI) is equal to the conditioned stimulus (CS) duration plus the trace interval (TI), the interval between CS offset and unconditioned stimulus (US) onset. The Sutton-Barto-Desmond neural-network model of classical conditioning predicts that, with a sufficiently long TI, conditioning will be faster with a CS of relatively long duration than with one of shorter duration. This prediction is illustrated with simulations and tested with the rabbit nictitating membrane response. Animals were trained with a tone CS of 350- or 700-ms duration. The TI was fixed at 300 ms, so that the ISI for the two durations was 650 or 1000 ms, respectively. Another factor in the experimental design was tone intensity (63 or 83 dB). Consistent with the model's prediction, conditioning was faster with the longer ISI, but only with the louder tone. The results have implications for computational models of classical conditioning.
Collapse
Affiliation(s)
- D E Blazis
- Program in Neuroscience and Behavior, University of Massachusetts, Amherst 01003
| | | |
Collapse
|
18
|
Witt JC, Clark JW. Experiments in artificial psychology: conditioning of asynchronous neural network models. Math Biosci 1990; 99:77-104. [PMID: 2134515 DOI: 10.1016/0025-5564(90)90140-t] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
An asynchronous model for the dynamics of neural networks admits learning behaviors characteristic of classical and operant conditioning provided that appropriate plasticity algorithms are chosen. Stimulus generalization and discrimination can also be observed. Studies of such psychological phenomena are carried out by computer simulation of networks with designated sensory, association, and motor neurons, and the results are compared to those for live subjects. Various prescriptions for plasticity are investigated, including those corresponding to reward, punishment, and unlearning routines. These are characterized by their effect on network stability as quantified by a newly proposed stability measure.
Collapse
Affiliation(s)
- J C Witt
- McDonnell Center for the Space Sciences, Washington University, St. Louis, Missouri 63130
| | | |
Collapse
|
19
|
|
20
|
Kleinfeld D, Sompolinsky H. Associative neural network model for the generation of temporal patterns. Theory and application to central pattern generators. Biophys J 1988; 54:1039-51. [PMID: 3233265 PMCID: PMC1330416 DOI: 10.1016/s0006-3495(88)83041-8] [Citation(s) in RCA: 119] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Cyclic patterns of motor neuron activity are involved in the production of many rhythmic movements, such as walking, swimming, and scratching. These movements are controlled by neural circuits referred to as central pattern generators (CPGs). Some of these circuits function in the absence of both internal pacemakers and external feedback. We describe an associative neural network model whose dynamic behavior is similar to that of CPGs. The theory predicts the strength of all possible connections between pairs of neurons on the basis of the outputs of the CPG. It also allows the mean operating levels of the neurons to be deduced from the measured synaptic strengths between the pairs of neurons. We apply our theory to the CPG controlling escape swimming in the mollusk Tritonia diomedea. The basic rhythmic behavior is shown to be consistent with a simplified model that approximates neurons as threshold units and slow synaptic responses as elementary time delays. The model we describe may have relevance to other fixed action behaviors, as well as to the learning, recall, and recognition of temporally ordered information.
Collapse
Affiliation(s)
- D Kleinfeld
- Molecular Biophysics Research Department, AT&T Bell Laboratories, Murray Hill, New Jersey 07974
| | | |
Collapse
|
21
|
Temporal primacy overrides prior training in serial compound conditioning of the rabbit’s nictitating membrane response. ACTA ACUST UNITED AC 1987. [DOI: 10.3758/bf03205056] [Citation(s) in RCA: 49] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
22
|
Grossberg S, Levine DS. Neural dynamics of attentionally modulated Pavlovian conditioning: blocking, interstimulus interval, and secondary reinforcement. APPLIED OPTICS 1987; 26:5015-5030. [PMID: 20523481 DOI: 10.1364/ao.26.005015] [Citation(s) in RCA: 140] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
Selective information processing in neural networks is studied through computer simulations of Pavlovian conditioning data. The model reproduces properties of blocking, inverted-U in learning as a function of interstimulus interval, anticipatory conditioned responses, secondary reinforcement, attentional focusing by conditioned motivational feedback, and limited capacity short-term memory processing. Conditioning occurs from sensory to drive representations (conditioned reinforcer learning), from drive to sensory representations (incentive motivational learning), and from sensory to motor representations (habit learning).The conditionable pathwas contain long-term memory traces that obey a non-Hebbian associative law. The neural model embodies a solution to two key design problems of conditioning, the synchronization and persistence problems. This model of vertebrate learning is compared with data and models of invertebrate learning. Predictions derived from models of vertebrate learning are compared with data about invertebrate learning, including data from Aplysia about facilitator neurons and data from Hermissenda about voltage-dependent Ca(2+) currents. A prediction is stated about classical conditioning in all species, called the secondary conditioning alternative, and if confirmed would constitute an evolutionary invariant of learning.
Collapse
|
23
|
Moore JW, Desmond JE, Berthier NE, Blazis DE, Sutton RS, Barto AG. Simulation of the classically conditioned nictitating membrane response by a neuron-like adaptive element: response topography, neuronal firing, and interstimulus intervals. Behav Brain Res 1986; 21:143-54. [PMID: 3755947 DOI: 10.1016/0166-4328(86)90092-6] [Citation(s) in RCA: 72] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
A neuron-like adaptive element with computational features suitable for classical conditioning, the Sutton-Barto (S-B) model, was extended to simulate real-time aspects of the conditioned nictitating membrane (NM) response. The aspects of concern were response topography, CR-related neuronal firing, and interstimulus interval (ISI) effects for forward-delay and trace conditioning paradigms. The topography of the NM CR has the following features: response latency after CS onset decreases over trials; response amplitude increases gradually within the ISI and attains its maximum coincidentally with the UR. A similar pattern characterizes the firing of some (but not all) neurons in brain regions demonstrated experimentally to be important for NM conditioning. The variant of the S-B model described in this paper consists of a set of parameters and implementation rules based on 10-ms computational time steps. It differs from the original S-B model in a number of ways. The main difference is the assumption that CS inputs to the adaptive element are not instantaneous but are instead shaped by unspecified coding processes so as to produce outputs that conform with the real-time properties of NM conditioning. The model successfully simulates the aforementioned features of NM response topography. It is also capable of simulating appropriate ISI functions, i.e. with maximum conditioning strength with ISIs of 250 ms, for forward-delay and trace paradigms. The original model's successful treatment of multiple-CS phenomena, such as blocking, conditioned inhibition, and higher-order conditioning, are retained by the present model.
Collapse
|
24
|
|