Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Schweighofer N, Shishida K, Han CE, Okamoto Y, Tanaka SC, Yamawaki S, Doya K. Humans can adopt optimal discounting strategy under real-time constraints. PLoS Comput Biol 2006;2:e152. [PMID: 17096592 PMCID: PMC1635539 DOI: 10.1371/journal.pcbi.0020152] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2006] [Accepted: 10/04/2006] [Indexed: 11/19/2022] Open

For:	Schweighofer N, Shishida K, Han CE, Okamoto Y, Tanaka SC, Yamawaki S, Doya K. Humans can adopt optimal discounting strategy under real-time constraints. PLoS Comput Biol 2006;2:e152. [PMID: 17096592 PMCID: PMC1635539 DOI: 10.1371/journal.pcbi.0020152] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2006] [Accepted: 10/04/2006] [Indexed: 11/19/2022] Open

Number

Cited by Other Article(s)

Yi R, Landes RD, Bickel WK. Novel Models of Intertemporal Valuation: Past and Future Outcomes. ACTA ACUST UNITED AC 2009;2:102. [PMID: 20157625 DOI: 10.1037/a0017571] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Han CE, Arbib MA, Schweighofer N. Stroke rehabilitation reaches a threshold. PLoS Comput Biol 2008;4:e1000133. [PMID: 18769588 PMCID: PMC2527783 DOI: 10.1371/journal.pcbi.1000133] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2007] [Accepted: 06/18/2008] [Indexed: 11/18/2022] Open

Abstract

Motor training with the upper limb affected by stroke partially reverses the loss of cortical representation after lesion and has been proposed to increase spontaneous arm use. Moreover, repeated attempts to use the affected hand in daily activities create a form of practice that can potentially lead to further improvement in motor performance. We thus hypothesized that if motor retraining after stroke increases spontaneous arm use sufficiently, then the patient will enter a virtuous circle in which spontaneous arm use and motor performance reinforce each other. In contrast, if the dose of therapy is not sufficient to bring spontaneous use above threshold, then performance will not increase and the patient will further develop compensatory strategies with the less affected hand. To refine this hypothesis, we developed a computational model of bilateral hand use in arm reaching to study the interactions between adaptive decision making and motor relearning after motor cortex lesion. The model contains a left and a right motor cortex, each controlling the opposite arm, and a single action choice module. The action choice module learns, via reinforcement learning, the value of using each arm for reaching in specific directions. Each motor cortex uses a neural population code to specify the initial direction along which the contralateral hand moves towards a target. The motor cortex learns to minimize directional errors and to maximize neuronal activity for each movement. The derived learning rule accounts for the reversal of the loss of cortical representation after rehabilitation and the increase of this loss after stroke with insufficient rehabilitation. Further, our model exhibits nonlinear and bistable behavior: if natural recovery, motor training, or both, brings performance above a certain threshold, then training can be stopped, as the repeated spontaneous arm use provides a form of motor learning that further bootstraps performance and spontaneous use. Below this threshold, motor training is “in vain”: there is little spontaneous arm use after training, the model exhibits learned nonuse, and compensatory movements with the less affected hand are reinforced. By exploring the nonlinear dynamics of stroke recovery using a biologically plausible neural model that accounts for reversal of the loss of motor cortex representation following rehabilitation or the lack thereof, respectively, we can explain previously hard to reconcile data on spontaneous arm use in stroke recovery. Further, our threshold prediction could be tested with an adaptive train–wait–train paradigm: if spontaneous arm use has increased in the “wait” period, then the threshold has been reached, and rehabilitation can be stopped. If spontaneous arm use is still low or has decreased, then another bout of rehabilitation is to be provided.

Stroke often leaves patients with predominantly unilateral functional limitations of the arm and hand. Although recovery of function after stroke is often achieved by compensatory use of the less affected limb, improving use of the more affected limb has been associated with increased quality of life. Here, we developed a biologically plausible model of bilateral reaching movements to investigate the mechanisms and conditions leading to effective rehabilitation. Our motor cortex model accounts for the experimental observation that motor training can reverse the loss of cortical representation due to lesion. Further, our model predicts that if spontaneous arm use is above a certain threshold, then training can be stopped, as the repeated spontaneous use provides a form of motor learning that further improves performance and spontaneous use. Below this threshold, training is “in vain,” and compensatory movements with the less affected hand are reinforced. Our model is a first step in the development of adaptive and cost-effective rehabilitation methods tailored to individuals poststroke.

Collapse

Reinforcement learning: the good, the bad and the ugly. Curr Opin Neurobiol 2008;18:185-96. [PMID: 18708140 DOI: 10.1016/j.conb.2008.08.003] [Citation(s) in RCA: 288] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2008] [Revised: 07/30/2008] [Accepted: 08/05/2008] [Indexed: 11/21/2022]

Kim S, Hwang J, Lee D. Prefrontal coding of temporally discounted values during intertemporal choice. Neuron 2008;59:161-72. [PMID: 18614037 DOI: 10.1016/j.neuron.2008.05.010] [Citation(s) in RCA: 180] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2008] [Revised: 03/24/2008] [Accepted: 05/12/2008] [Indexed: 11/29/2022]

La Camera G, Richmond BJ. Modeling the violation of reward maximization and invariance in reinforcement schedules. PLoS Comput Biol 2008;4:e1000131. [PMID: 18688266 PMCID: PMC2453237 DOI: 10.1371/journal.pcbi.1000131] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2008] [Accepted: 06/18/2008] [Indexed: 11/19/2022] Open

Abstract

It is often assumed that animals and people adjust their behavior to maximize reward acquisition. In visually cued reinforcement schedules, monkeys make errors in trials that are not immediately rewarded, despite having to repeat error trials. Here we show that error rates are typically smaller in trials equally distant from reward but belonging to longer schedules (referred to as "schedule length effect"). This violates the principles of reward maximization and invariance and cannot be predicted by the standard methods of Reinforcement Learning, such as the method of temporal differences. We develop a heuristic model that accounts for all of the properties of the behavior in the reinforcement schedule task but whose predictions are not different from those of the standard temporal difference model in choice tasks. In the modification of temporal difference learning introduced here, the effect of schedule length emerges spontaneously from the sensitivity to the immediately preceding trial. We also introduce a policy for general Markov Decision Processes, where the decision made at each node is conditioned on the motivation to perform an instrumental action, and show that the application of our model to the reinforcement schedule task and the choice task are special cases of this general theoretical framework. Within this framework, Reinforcement Learning can approach contextual learning with the mixture of empirical findings and principled assumptions that seem to coexist in the best descriptions of animal behavior. As examples, we discuss two phenomena observed in humans that often derive from the violation of the principle of invariance: "framing," wherein equivalent options are treated differently depending on the context in which they are presented, and the "sunk cost" effect, the greater tendency to continue an endeavor once an investment in money, effort, or time has been made. The schedule length effect might be a manifestation of these phenomena in monkeys.

Collapse

Low-serotonin levels increase delayed reward discounting in humans. J Neurosci 2008;28:4528-32. [PMID: 18434531 DOI: 10.1523/jneurosci.4982-07.2008] [Citation(s) in RCA: 172] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Dayan P, Huys QJM. Serotonin, inhibition, and negative mood. PLoS Comput Biol 2008;4:e4. [PMID: 18248087 PMCID: PMC2222921 DOI: 10.1371/journal.pcbi.0040004] [Citation(s) in RCA: 178] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2007] [Accepted: 11/27/2007] [Indexed: 11/24/2022] Open

Kalenscher T, Pennartz CM. Is a bird in the hand worth two in the future? The neuroeconomics of intertemporal decision-making. Prog Neurobiol 2008;84:284-315. [DOI: 10.1016/j.pneurobio.2007.11.004] [Citation(s) in RCA: 114] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2007] [Revised: 11/28/2007] [Accepted: 11/29/2007] [Indexed: 10/22/2022]

Tanaka SC, Schweighofer N, Asahi S, Shishida K, Okamoto Y, Yamawaki S, Doya K. Serotonin differentially regulates short- and long-term prediction of rewards in the ventral and dorsal striatum. PLoS One 2007;2:e1333. [PMID: 18091999 PMCID: PMC2129114 DOI: 10.1371/journal.pone.0001333] [Citation(s) in RCA: 141] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2007] [Accepted: 11/26/2007] [Indexed: 11/26/2022] Open

Affiliation(s)

Saori C. Tanaka Department of Computational Neurobiology, ATR Computational Neuroscience Laboratories, Seika, Souraku, Kyoto, Japan Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Seika, Souraku, Kyoto, Japan * To whom correspondence should be addressed. E-mail: (ST); (KD)
Nicolas Schweighofer Department of Computational Neurobiology, ATR Computational Neuroscience Laboratories, Seika, Souraku, Kyoto, Japan Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Seika, Souraku, Kyoto, Japan Department of Biokinesiology and Physical Therapy, University of Southern California, Los Angeles, California, United States of America
Shuji Asahi Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Seika, Souraku, Kyoto, Japan Department of Psychiatry and Neurosciences, Hiroshima University, Minamiku, Hiroshima, Japan
Kazuhiro Shishida Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Seika, Souraku, Kyoto, Japan Department of Psychiatry and Neurosciences, Hiroshima University, Minamiku, Hiroshima, Japan
Yasumasa Okamoto Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Seika, Souraku, Kyoto, Japan Department of Psychiatry and Neurosciences, Hiroshima University, Minamiku, Hiroshima, Japan
Shigeto Yamawaki Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Seika, Souraku, Kyoto, Japan Department of Psychiatry and Neurosciences, Hiroshima University, Minamiku, Hiroshima, Japan
Kenji Doya Department of Computational Neurobiology, ATR Computational Neuroscience Laboratories, Seika, Souraku, Kyoto, Japan Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Seika, Souraku, Kyoto, Japan Neural Computational Unit, Okinawa Institute of Science and Technology, Suzaki, Uruma, Okinawa, Japan * To whom correspondence should be addressed. E-mail: (ST); (KD)

Collapse

Rosati AG, Stevens JR, Hare B, Hauser MD. The evolutionary origins of human patience: temporal preferences in chimpanzees, bonobos, and human adults. Curr Biol 2007;17:1663-8. [PMID: 17900899 DOI: 10.1016/j.cub.2007.08.033] [Citation(s) in RCA: 187] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2007] [Revised: 08/07/2007] [Accepted: 08/10/2007] [Indexed: 11/28/2022]

Vardavas R, Breban R, Blower S. Can influenza epidemics be prevented by voluntary vaccination? PLoS Comput Biol 2007;3:e85. [PMID: 17480117 PMCID: PMC1864996 DOI: 10.1371/journal.pcbi.0030085] [Citation(s) in RCA: 77] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2007] [Accepted: 03/30/2007] [Indexed: 11/24/2022] Open

Abstract

Previous modeling studies have identified the vaccination coverage level necessary for preventing influenza epidemics, but have not shown whether this critical coverage can be reached. Here we use computational modeling to determine, for the first time, whether the critical coverage for influenza can be achieved by voluntary vaccination. We construct a novel individual-level model of human cognition and behavior; individuals are characterized by two biological attributes (memory and adaptability) that they use when making vaccination decisions. We couple this model with a population-level model of influenza that includes vaccination dynamics. The coupled models allow individual-level decisions to influence influenza epidemiology and, conversely, influenza epidemiology to influence individual-level decisions. By including the effects of adaptive decision-making within an epidemic model, we can reproduce two essential characteristics of influenza epidemiology: annual variation in epidemic severity and sporadic occurrence of severe epidemics. We suggest that individual-level adaptive decision-making may be an important (previously overlooked) causal factor in driving influenza epidemiology. We find that severe epidemics cannot be prevented unless vaccination programs offer incentives. Frequency of severe epidemics could be reduced if programs provide, as an incentive to be vaccinated, several years of free vaccines to individuals who pay for one year of vaccination. Magnitude of epidemic amelioration will be determined by the number of years of free vaccination, an individuals' adaptability in decision-making, and their memory. This type of incentive program could control epidemics if individuals are very adaptable and have long-term memories. However, incentive-based programs that provide free vaccination for families could increase the frequency of severe epidemics. We conclude that incentive-based vaccination programs are necessary to control influenza, but some may be detrimental. Surprisingly, we find that individuals' memories and flexibility in adaptive decision-making can be extremely important factors in determining the success of influenza vaccination programs. Finally, we discuss the implication of our results for controlling pandemics.

Collapse