Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Barto AG, Sutton RS. Simulation of anticipatory responses in classical conditioning by a neuron-like adaptive element. Behav Brain Res 1982;4:221-35. [PMID: 6277346 DOI: 10.1016/0166-4328(82)90001-8] [Citation(s) in RCA: 24] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

For:	Barto AG, Sutton RS. Simulation of anticipatory responses in classical conditioning by a neuron-like adaptive element. Behav Brain Res 1982;4:221-35. [PMID: 6277346 DOI: 10.1016/0166-4328(82)90001-8] [Citation(s) in RCA: 24] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

Number

Cited by Other Article(s)

Wilbrecht L, Davidow JY. Goal-directed learning in adolescence: neurocognitive development and contextual influences. Nat Rev Neurosci 2024;25:176-194. [PMID: 38263216 DOI: 10.1038/s41583-023-00783-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/12/2023] [Indexed: 01/25/2024]

Krugliakova E, Klucharev V, Fedele T, Gorin A, Kuznetsova A, Shestakova A. Correlation of cue-locked FRN and feedback-locked FRN in the auditory monetary incentive delay task. Exp Brain Res 2017;236:141-151. [PMID: 29196772 DOI: 10.1007/s00221-017-5113-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2016] [Accepted: 10/24/2017] [Indexed: 02/02/2023]

Balodis IM, Potenza MN. Anticipatory reward processing in addicted populations: a focus on the monetary incentive delay task. Biol Psychiatry 2015;77:434-44. [PMID: 25481621 PMCID: PMC4315733 DOI: 10.1016/j.biopsych.2014.08.020] [Citation(s) in RCA: 156] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/07/2014] [Revised: 08/12/2014] [Accepted: 08/26/2014] [Indexed: 11/26/2022]

Abstract

Advances in brain imaging techniques have allowed neurobiological research to temporally analyze signals coding for the anticipation of reward. In addicted populations, both hyporesponsiveness and hyperresponsiveness of brain regions (e.g., ventral striatum) implicated in drug effects and reward system processing have been reported during anticipation of generalized reward. We discuss the current state of knowledge of reward processing in addictive disorders from a widely used and validated task: the monetary incentive delay task. Only studies applying the monetary incentive delay task in addicted and at-risk adult populations are reviewed, with a focus on anticipatory processing and striatal regions activated during task performance as well as the relationship of these regions with individual difference (e.g., impulsivity) and treatment outcome variables. We further review drug influences in challenge studies as a means to examine acute influences on reward processing in abstinent, recreationally using, and addicted populations. Generalized reward processing in addicted and at-risk populations is often characterized by divergent anticipatory signaling in the ventral striatum. Although methodologic and task variations may underlie some discrepant findings, anticipatory signaling in the ventral striatum may also be influenced by smoking status, drug metabolites, and treatment status in addicted populations. Divergent results across abstinent, recreationally using, and addicted populations demonstrate complexities in interpreting findings. Future studies would benefit from focusing on characterizing how impulsivity and other addiction-related features relate to anticipatory striatal signaling over time. Additionally, identifying how anticipatory signals recover or adjust after protracted abstinence will be important in understanding recovery processes.

Collapse

Mondragón E, Gray J, Alonso E, Bonardi C, Jennings DJ. SSCC TD: a serial and simultaneous configural-cue compound stimuli representation for temporal difference learning. PLoS One 2014;9:e102469. [PMID: 25054799 PMCID: PMC4108321 DOI: 10.1371/journal.pone.0102469] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2014] [Accepted: 06/18/2014] [Indexed: 11/18/2022] Open

Aquili L, Liu AW, Shindou M, Shindou T, Wickens JR. Behavioral flexibility is increased by optogenetic inhibition of neurons in the nucleus accumbens shell during specific time segments. Learn Mem 2014;21:223-31. [PMID: 24639489 PMCID: PMC3966536 DOI: 10.1101/lm.034199.113] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]

Okatan M. Correlates of reward-predictive value in learning-related hippocampal neural activity. Hippocampus 2009;19:487-506. [PMID: 19123250 PMCID: PMC2742500 DOI: 10.1002/hipo.20535] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Abstract

Temporal difference learning (TD) is a popular algorithm in machine learning. Two learning signals that are derived from this algorithm, the predictive value and the prediction error, have been shown to explain changes in neural activity and behavior during learning across species. Here, the predictive value signal is used to explain the time course of learning-related changes in the activity of hippocampal neurons in monkeys performing an associative learning task. The TD algorithm serves as the centerpiece of a joint probability model for the learning-related neural activity and the behavioral responses recorded during the task. The neural component of the model consists of spiking neurons that compete and learn the reward-predictive value of task-relevant input signals. The predictive-value signaled by these neurons influences the behavioral response generated by a stochastic decision stage, which constitutes the behavioral component of the model. It is shown that the time course of the changes in neural activity and behavioral performance generated by the model exhibits key features of the experimental data. The results suggest that information about correct associations may be expressed in the hippocampus before it is detected in the behavior of a subject. In this way, the hippocampus may be among the earliest brain areas to express learning and drive the behavioral changes associated with learning. Correlates of reward-predictive value may be expressed in the hippocampus through rate remapping within spatial memory representations, they may represent reward-related aspects of a declarative or explicit relational memory representation of task contingencies, or they may correspond to reward-related components of episodic memory representations. These potential functions are discussed in connection with hippocampal cell assembly sequences and their reverse reactivation during the awake state. The results provide further support for the proposal that neural processes underlying learning may be implementing a temporal difference-like algorithm.

Collapse

A head-neck-eye system that learns fault-tolerant saccades to 3-D targets using a self-organizing neural model. Neural Netw 2008;21:1380-91. [PMID: 18775642 DOI: 10.1016/j.neunet.2008.07.007] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2007] [Revised: 07/31/2008] [Accepted: 07/31/2008] [Indexed: 11/22/2022]

Zeybek Z, Yüce Cetinkaya S, Alioglu F, Alpbaz M. Determination of optimum operating conditions for industrial dye wastewater treatment using adaptive heuristic criticism pH control. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2007;85:404-14. [PMID: 17141939 DOI: 10.1016/j.jenvman.2006.10.013] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2005] [Revised: 10/08/2006] [Accepted: 10/17/2006] [Indexed: 05/12/2023]

Zeybek Z, Karapinar T, Alpbaz M, Hapoglu H. Application of adaptive heuristic criticism control (AHCC) to dye wastewater. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2007;84:461-72. [PMID: 16949196 DOI: 10.1016/j.jenvman.2006.06.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2005] [Revised: 06/19/2006] [Accepted: 06/21/2006] [Indexed: 05/11/2023]

Padlubnaya DB, Parekh NH, Brown TH. Neurophysiological theory of Kamin blocking in fear conditioning. Behav Neurosci 2006;120:337-52. [PMID: 16719698 DOI: 10.1037/0735-7044.120.2.337] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Pan WX, Schmidt R, Wickens JR, Hyland BI. Dopamine cells respond to predicted events during classical conditioning: evidence for eligibility traces in the reward-learning network. J Neurosci 2005;25:6235-42. [PMID: 15987953 PMCID: PMC6725057 DOI: 10.1523/jneurosci.1478-05.2005] [Citation(s) in RCA: 308] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2005] [Revised: 05/13/2005] [Accepted: 05/14/2005] [Indexed: 11/21/2022] Open

Church R. A Concise Introduction to Scalar Timing Theory. FUNCTIONAL AND NEURAL MECHANISMS OF INTERVAL TIMING 2003. [DOI: 10.1201/9780203009574.sec1] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/09/2023]

Porr B, Wörgötter P. Isotropic sequence order learning using a novel linear algorithm in a closed loop behavioural system. Biosystems 2002;67:195-202. [PMID: 12459299 DOI: 10.1016/s0303-2647(02)00077-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Abstract

In this article, we present an isotropic algorithm for sequence order learning. Its central goal is to learn the causal relation between two (or more) inputs in order to react to the earliest incoming signal after successful learning (like in typical classical conditioning situations). We implement this algorithm in a behaving system (a robot) thereby creating a closed loop situation where the learner's actions influence its own sensor inputs to the end of creating an autonomous agent. Autonomous behaviour implies that learning goals are internally defined within the organism's capabilities. Standard learning models for sequence learning (e.g. temporal difference (TD)-learning) need an externally defined reward. This, however, is in conflict with the requirement of an implicitly defined internal goal in autonomous behaviour. Therefore, in this study we present a system in which the external reward is replaced by a reflex loop. This loop explicitly includes the environment. Every reflex loop has the inherent disadvantage, which is that its re-actions occur each time just after a reflex-eliciting sensor event and thus 'too late'. However, a reflex can serve as the internal reference for sequence order learning, which has the task of eliminating this disadvantage by creating earlier anticipatory actions. In our system learning is achieved by modifying synaptic weights of a linear neuron with a correlation based learning rule which involves the derivative of the neuron's output. All input lines are entirely isotropic. The synaptic weight change curve of this rule is strongly related to the temporal Hebb learning rule, which was found in spike timing experiments. We find that after learning the reflex loop is replaced in functional terms with an earlier anticipatory action (and pathway). In addition, we observed that the synaptic weights stabilise as soon as the reflex remains silent.

Collapse

Hofstötter C, Mintz M, Verschure PFMJ. The cerebellum in action: a simulation and robotics study. Eur J Neurosci 2002;16:1361-76. [PMID: 12405996 DOI: 10.1046/j.1460-9568.2002.02182.x] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Gluck MA, Ermita BR, Oliver LM, Myers CE. Extending models of hippocampal function in animal conditioning to human amnesia. Memory 1997;5:179-212. [PMID: 9156098 DOI: 10.1080/741941141] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Abstract

Although most analyses of amnesia have focused on the loss of explicit declarative and episodic memories following hippocampal-region damage, considerable insights into amnesia can also be realised by studying hippocampal function in simple procedural, or habit-based, associative learning tasks. Although many simple forms of associative learning are unimpaired by hippocampal damage, more complex tasks which require sensitivity to unreinforced stimuli, configurations of multiple stimuli, or contextual information are impaired by hippocampal damage. In several recent papers we have developed a computational theory of hippocampal function which argues that this brain region plays a critical role in the formation of new stimulus representations during learning (Gluck & Myers, 1993, 1995; Myers & Gluck, 1996; Myers, Gluck, & Granger, 1995). We have applied this theory to a broad range of empirical data from studies of classical conditioning in both intact and hippocampal-lesioned animals, and the model correctly accounts for these data. The classical conditioning paradigm can be adapted for use in humans, and similar results for acquisition are obtained in both normal and hippocampal-damaged humans. More recently, we have begun to address an important set of category learning studies in both normals and hippocampal-damaged amnesics. This work integrates experimental studies of amnesic category learning (Knowlton, Squire, & Gluck, 1994) with theoretical accounts of associative learning, and builds on previously established behavioural correspondences between animal conditioning and human category learning (Gluck & Bower, 1988a). Our work to date illustrates some initial progress towards a more integrative understanding of hippocampal function in both animal and human learning, which may be useful in guiding further empirical and theoretical research in human memory and amnesia.

Collapse

Hampson S. Problem solving in artificial neural networks. Prog Neurobiol 1994;42:229-81. [PMID: 8008826 DOI: 10.1016/0301-0082(94)90065-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

Blazis DE, Moore JW. Conditioned stimulus duration in classical trace conditioning: test of a real-time neural network model. Behav Brain Res 1991;43:73-8. [PMID: 1650232 DOI: 10.1016/s0166-4328(05)80054-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]

Witt JC, Clark JW. Experiments in artificial psychology: conditioning of asynchronous neural network models. Math Biosci 1990;99:77-104. [PMID: 2134515 DOI: 10.1016/0025-5564(90)90140-t] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Conditioning and the Cerebellum. ACTA ACUST UNITED AC 1989. [DOI: 10.1007/978-1-4612-4536-0_16] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]

Kleinfeld D, Sompolinsky H. Associative neural network model for the generation of temporal patterns. Theory and application to central pattern generators. Biophys J 1988;54:1039-51. [PMID: 3233265 PMCID: PMC1330416 DOI: 10.1016/s0006-3495(88)83041-8] [Citation(s) in RCA: 119] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open

Temporal primacy overrides prior training in serial compound conditioning of the rabbit’s nictitating membrane response. ACTA ACUST UNITED AC 1987. [DOI: 10.3758/bf03205056] [Citation(s) in RCA: 49] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Grossberg S, Levine DS. Neural dynamics of attentionally modulated Pavlovian conditioning: blocking, interstimulus interval, and secondary reinforcement. APPLIED OPTICS 1987;26:5015-5030. [PMID: 20523481 DOI: 10.1364/ao.26.005015] [Citation(s) in RCA: 140] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]

Moore JW, Desmond JE, Berthier NE, Blazis DE, Sutton RS, Barto AG. Simulation of the classically conditioned nictitating membrane response by a neuron-like adaptive element: response topography, neuronal firing, and interstimulus intervals. Behav Brain Res 1986;21:143-54. [PMID: 3755947 DOI: 10.1016/0166-4328(86)90092-6] [Citation(s) in RCA: 72] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

Neural population modeling and psychology: A review. Math Biosci 1983. [DOI: 10.1016/0025-5564(83)90077-9] [Citation(s) in RCA: 65] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]