101
|
Rode AN, Moghaddam B, Morrison SE. Increased Goal Tracking in Adolescent Rats Is Goal-Directed and Not Habit-Like. Front Behav Neurosci 2020; 13:291. [PMID: 31992975 PMCID: PMC6971099 DOI: 10.3389/fnbeh.2019.00291] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Accepted: 12/23/2019] [Indexed: 12/14/2022] Open
Abstract
When a cue is paired with reward in a different location, some animals will approach the site of reward during the cue, a behavior called goal tracking, while other animals will approach and interact with the cue itself: a behavior called sign tracking. Sign tracking is thought to reflect a tendency to transfer incentive salience from the reward to the cue. Adolescence is a time of heightened sensitivity to rewards, including environmental cues that have been associated with rewards, which may account for increased impulsivity and vulnerability to drug abuse. Surprisingly, however, studies have shown that adolescents are actually less likely to interact with the cue (i.e., sign track) than adult animals. We reasoned that adolescents might show decreased sign tracking, accompanied by increased apparent goal tracking, because they tend to attribute incentive salience to a more reward-proximal "cue": the food magazine. On the other hand, adolescence is also a time of enhanced exploratory behavior, novelty-seeking, and behavioral flexibility. Therefore, adolescents might truly express more goal-directed reward-seeking and less inflexible habit-like approach to a reward-associated cue. Using a reward devaluation procedure to distinguish between these two hypotheses, we found that adolescents indeed exhibit more goal tracking, and less sign tracking, than a comparable group of adults. Moreover, adolescents' goal tracking behavior is highly sensitive to reward devaluation and therefore goal-directed and not habit-like.
Collapse
Affiliation(s)
| | | | - Sara E. Morrison
- Department of Neuroscience, University of Pittsburgh, Pittsburgh, PA, United States
| |
Collapse
|
102
|
Brown VM, Chen J, Gillan CM, Price RB. Improving the Reliability of Computational Analyses: Model-Based Planning and Its Relationship With Compulsivity. BIOLOGICAL PSYCHIATRY: COGNITIVE NEUROSCIENCE AND NEUROIMAGING 2020; 5:601-609. [PMID: 32249207 DOI: 10.1016/j.bpsc.2019.12.019] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Revised: 12/30/2019] [Accepted: 12/30/2019] [Indexed: 12/25/2022]
Abstract
BACKGROUND Computational models show great promise in mapping latent decision-making processes onto dissociable neural substrates and clinical phenotypes. One prominent example in reinforcement learning is model-based planning, which specifically relates to transdiagnostic compulsivity. However, the reliability of computational model-derived measures such as model-based planning is unclear. Establishing reliability is necessary to ensure that such models measure stable, traitlike processes, as assumed in computational psychiatry. Although analysis approaches affect validity of reinforcement learning models and reliability of other task-based measures, their effect on reliability of reinforcement learning models of empirical data has not been systematically studied. METHODS We first assessed within- and across-session reliability and effects of analysis approaches (model estimation, parameterization, and data cleaning) of measures of model-based planning in patients with compulsive disorders (n = 38). The analysis approaches affecting test-retest reliability were tested in 3 large generalization samples (healthy participants: n = 541 and 111; people with a range of compulsivity: n = 1413). RESULTS Analysis approaches greatly influenced reliability: reliability of model-based planning measures ranged from 0 (no concordance) to above 0.9 (acceptable for clinical applications). The largest influence on reliability was whether model-estimation approaches were robust and accounted for the hierarchical structure of estimated parameters. Improvements in reliability generalized to other datasets and greatly reduced the sample size needed to find a relationship between model-based planning and compulsivity in an independent dataset. CONCLUSIONS These results indicate that computational psychiatry measures such as model-based planning can reliably measure latent decision-making processes, but when doing so must assess the ability of methods to estimate complex models from limited data.
Collapse
Affiliation(s)
- Vanessa M Brown
- Department of Psychiatry, University of Pittsburgh, Pittsburgh, PA.
| | - Jiazhou Chen
- Department of Psychiatry, University of Pittsburgh, Pittsburgh, PA
| | - Claire M Gillan
- Department of Psychology, Trinity College Dublin, Dublin, Ireland
| | - Rebecca B Price
- Department of Psychiatry, University of Pittsburgh, Pittsburgh, PA.
| |
Collapse
|
103
|
Marzuki AA, Pereira de Souza AMFL, Sahakian BJ, Robbins TW. Are candidate neurocognitive endophenotypes of OCD present in paediatric patients? A systematic review. Neurosci Biobehav Rev 2019; 108:617-645. [PMID: 31821834 DOI: 10.1016/j.neubiorev.2019.12.010] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Revised: 12/01/2019] [Accepted: 12/06/2019] [Indexed: 01/03/2023]
Abstract
To-date it has been difficult to ascertain the exact cognitive profile of childhood OCD as studies report variable results. Adult OCD research lately utilises the endophenotype approach; studying cognitive traits that are present in both patients and their unaffected first-degree relatives, and are thought to lie closer to the genotype than the full-blown disorder. By observing whether candidate endopenotypes of adult OCD are present in child patients, we can determine whether the two subtypes show cognitive overlap. We conducted a systematic review of the paediatric OCD literature focussing on proposed neurocognitive endophenotypes of OCD: cognitive flexibility, response inhibition, memory, planning, decision-making, action monitoring, and reversal learning. We found that paediatric patients present robust increases in brain error related negativity associated with abnormal action monitoring, impaired decision-making under uncertainty, planning, and visual working memory, but there is less evidence for deficits in other cognitive domains. This implies that children with OCD show some cognitive similarities with adult patients, but other dysfunctions may only manifest later in the disorder trajectory.
Collapse
Affiliation(s)
- Aleya A Marzuki
- Behavioural and Clinical Neuroscience Institute, University of Cambridge, CB2 3EL, Cambridge, UK; Department of Psychology, Downing Site, University of Cambridge, CB2 3EB, Cambridge, UK.
| | - Ana Maria Frota Lisboa Pereira de Souza
- Behavioural and Clinical Neuroscience Institute, University of Cambridge, CB2 3EL, Cambridge, UK; Department of Psychology, Downing Site, University of Cambridge, CB2 3EB, Cambridge, UK.
| | - Barbara J Sahakian
- Herchel Smith Building, Department of Psychiatry, University of Cambridge, CB2 0SZ, Cambridge, UK.
| | - Trevor W Robbins
- Behavioural and Clinical Neuroscience Institute, University of Cambridge, CB2 3EL, Cambridge, UK; Department of Psychology, Downing Site, University of Cambridge, CB2 3EB, Cambridge, UK.
| |
Collapse
|
104
|
Jacquey L, Baldassarre G, Santucci VG, O’Regan JK. Sensorimotor Contingencies as a Key Drive of Development: From Babies to Robots. Front Neurorobot 2019; 13:98. [PMID: 31866848 PMCID: PMC6904889 DOI: 10.3389/fnbot.2019.00098] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2019] [Accepted: 11/06/2019] [Indexed: 01/22/2023] Open
Abstract
Much current work in robotics focuses on the development of robots capable of autonomous unsupervised learning. An essential prerequisite for such learning to be possible is that the agent should be sensitive to the link between its actions and the consequences of its actions, called sensorimotor contingencies. This sensitivity, and more particularly its role as a key drive of development, has been widely studied by developmental psychologists. However, the results of these studies may not necessarily be accessible or intelligible to roboticians. In this paper, we review the main experimental data demonstrating the role of sensitivity to sensorimotor contingencies in infants' acquisition of four fundamental motor and cognitive abilities: body knowledge, memory, generalization, and goal-directedness. We relate this data from developmental psychology to work in robotics, highlighting the links between these two domains of research. In the last part of the article we present a blueprint architecture demonstrating how exploitation of sensitivity to sensorimotor contingencies, combined with the notion of "goal," allows an agent to develop new sensorimotor skills. This architecture can be used to guide the design of specific computational models, and also to possibly envisage new empirical experiments.
Collapse
Affiliation(s)
- Lisa Jacquey
- Integrative Neuroscience and Cognition Center, UMR 8002, CNRS, Université Paris Descartes, Paris, France
- Laboratoire Ethologie Cognition Développement, Université Paris Nanterre, Nanterre, France
| | - Gianluca Baldassarre
- Laboratory of Computational Embodied Neuroscience, Istituto di Scienze e Tecnologie della Cognizione, Consiglio Nazionale delle Ricerche, Rome, Italy
| | - Vieri Giuliano Santucci
- Laboratory of Computational Embodied Neuroscience, Istituto di Scienze e Tecnologie della Cognizione, Consiglio Nazionale delle Ricerche, Rome, Italy
| | - J. Kevin O’Regan
- Integrative Neuroscience and Cognition Center, UMR 8002, CNRS, Université Paris Descartes, Paris, France
| |
Collapse
|
105
|
Nussenbaum K, Hartley CA. Reinforcement learning across development: What insights can we draw from a decade of research? Dev Cogn Neurosci 2019; 40:100733. [PMID: 31770715 PMCID: PMC6974916 DOI: 10.1016/j.dcn.2019.100733] [Citation(s) in RCA: 72] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Revised: 10/24/2019] [Accepted: 11/04/2019] [Indexed: 01/02/2023] Open
Abstract
The past decade has seen the emergence of the use of reinforcement learning models to study developmental change in value-based learning. It is unclear, however, whether these computational modeling studies, which have employed a wide variety of tasks and model variants, have reached convergent conclusions. In this review, we examine whether the tuning of model parameters that govern different aspects of learning and decision-making processes vary consistently as a function of age, and what neurocognitive developmental changes may account for differences in these parameter estimates across development. We explore whether patterns of developmental change in these estimates are better described by differences in the extent to which individuals adapt their learning processes to the statistics of different environments, or by more static learning biases that emerge across varied contexts. We focus specifically on learning rates and inverse temperature parameter estimates, and find evidence that from childhood to adulthood, individuals become better at optimally weighting recent outcomes during learning across diverse contexts and less exploratory in their value-based decision-making. We provide recommendations for how these two possibilities - and potential alternative accounts - can be tested more directly to build a cohesive body of research that yields greater insight into the development of core learning processes.
Collapse
|
106
|
Schulz E, Wu CM, Ruggeri A, Meder B. Searching for Rewards Like a Child Means Less Generalization and More Directed Exploration. Psychol Sci 2019; 30:1561-1572. [PMID: 31652093 DOI: 10.1177/0956797619863663] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
How do children and adults differ in their search for rewards? We considered three different hypotheses that attribute developmental differences to (a) children's increased random sampling, (b) more directed exploration toward uncertain options, or (c) narrower generalization. Using a search task in which noisy rewards were spatially correlated on a grid, we compared the ability of 55 younger children (ages 7 and 8 years), 55 older children (ages 9-11 years), and 50 adults (ages 19-55 years) to successfully generalize about unobserved outcomes and balance the exploration-exploitation dilemma. Our results show that children explore more eagerly than adults but obtain lower rewards. We built a predictive model of search to disentangle the unique contributions of the three hypotheses of developmental differences and found robust and recoverable parameter estimates indicating that children generalize less and rely on directed exploration more than adults. We did not, however, find reliable differences in terms of random sampling.
Collapse
Affiliation(s)
- Eric Schulz
- Department of Psychology, Harvard University
| | - Charley M Wu
- Center for Adaptive Rationality, Max Planck Institute for Human Development, Berlin, Germany
| | - Azzurra Ruggeri
- Max Planck Research Group iSearch, Max Planck Institute for Human Development, Berlin, Germany.,School of Education, Technical University Munich
| | - Björn Meder
- Center for Adaptive Rationality, Max Planck Institute for Human Development, Berlin, Germany.,Max Planck Research Group iSearch, Max Planck Institute for Human Development, Berlin, Germany.,Department of Psychology, University of Erfurt
| |
Collapse
|
107
|
Garnaat SL, Conelea CA, McLaughlin NCR, Benito K. Pediatric OCD in the era of RDoC. J Obsessive Compuls Relat Disord 2019; 23:10.1016/j.jocrd.2018.03.002. [PMID: 32042574 PMCID: PMC7010312 DOI: 10.1016/j.jocrd.2018.03.002] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
The NIMH Research Domain Criteria (RDoC) initiative was established with the goal of developing an alternative research classification to further research efforts in mental health. While RDoC acknowledges that constructs should be considered within a developmental framework, developmental considerations have not yet been well integrated within the existing RDoC matrix. In this paper, we consider RDoC in relation to pediatric OCD, a paradigmatic example of a neuropsychiatric disorder that often has onset in childhood but is also present across the lifespan. We discuss three RDoC subdomains with relevance to OCD as exemplars, providing for each construct a brief review of normative developmental changes, the state of construct-relevant research in pediatric OCD, and challenges and limitations related to developmental considerations within each subdomain. Finally, we conclude with a brief discussion of how RDoC may continue to evolve with regard to developmental considerations in order to further research in pediatric OCD.
Collapse
Affiliation(s)
- Sarah L. Garnaat
- Butler Hospital, Providence, Rhode Island
- Alpert Medical School of Brown University, Department of Psychiatry and Human Behavior, Providence, Rhode Island
| | | | - Nicole C. R. McLaughlin
- Butler Hospital, Providence, Rhode Island
- Alpert Medical School of Brown University, Department of Psychiatry and Human Behavior, Providence, Rhode Island
| | - Kristen Benito
- Alpert Medical School of Brown University, Department of Psychiatry and Human Behavior, Providence, Rhode Island
- Bradley Hospital, Providence, Rhode Island
| |
Collapse
|
108
|
Bolenz F, Kool W, Reiter AM, Eppinger B. Metacontrol of decision-making strategies in human aging. eLife 2019; 8:49154. [PMID: 31397670 PMCID: PMC6703898 DOI: 10.7554/elife.49154] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2019] [Accepted: 08/08/2019] [Indexed: 12/29/2022] Open
Abstract
Humans employ different strategies when making decisions. Previous research has reported reduced reliance on model-based strategies with aging, but it remains unclear whether this is due to cognitive or motivational factors. Moreover, it is not clear how aging affects the metacontrol of decision making, that is the dynamic adaptation of decision-making strategies to varying situational demands. In this cross-sectional study, we tested younger and older adults in a sequential decision-making task that dissociates model-free and model-based strategies. In contrast to previous research, model-based strategies led to higher payoffs. Moreover, we manipulated the costs and benefits of model-based strategies by varying reward magnitude and the stability of the task structure. Compared to younger adults, older adults showed reduced model-based decision making and less adaptation of decision-making strategies. Our findings suggest that aging affects the metacontrol of decision-making strategies and that reduced model-based strategies in older adults are due to limited cognitive abilities.
Collapse
Affiliation(s)
- Florian Bolenz
- Chair of Lifespan Developmental Neuroscience, Faculty of Psychology, Technische Universität Dresden, Dresden, Germany
| | - Wouter Kool
- Department of Psychology, Harvard University, Cambridge, United States.,Department of Psychological & Brain Sciences, Washington University in St. Louis, St. Louis, United States
| | - Andrea Mf Reiter
- Chair of Lifespan Developmental Neuroscience, Faculty of Psychology, Technische Universität Dresden, Dresden, Germany.,Wellcome Centre for Human Neuroimaging, University College London, London, United Kingdom.,Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom
| | - Ben Eppinger
- Chair of Lifespan Developmental Neuroscience, Faculty of Psychology, Technische Universität Dresden, Dresden, Germany.,Department of Psychology, Concordia University, Montreal, Canada.,PERFORM centre, Concordia University, Montreal, Canada
| |
Collapse
|
109
|
Credit assignment to state-independent task representations and its relationship with model-based decision making. Proc Natl Acad Sci U S A 2019; 116:15871-15876. [PMID: 31320592 PMCID: PMC6689934 DOI: 10.1073/pnas.1821647116] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
It is widely accepted that agents learn action values based on experience in a “model-free” manner (i.e., without holding a model of the environment). Environments usually embody many features, where a subset is considered relevant for model-free outcome learning. In this study, we show that a putative model-free system assigns credit to outcome-irrelevant task representations, regardless of stimulus features. The degree of impact of these associations is strongly linked to deployment of model-based strategies. Our findings motivate a reconsideration of how model-free representations are formed and regulated according to the structure of the environment. Model-free learning enables an agent to make better decisions based on prior experience while representing only minimal knowledge about an environment’s structure. It is generally assumed that model-free state representations are based on outcome-relevant features of the environment. Here, we challenge this assumption by providing evidence that a putative model-free system assigns credit to task representations that are irrelevant to an outcome. We examined data from 769 individuals performing a well-described 2-step reward decision task where stimulus identity but not spatial-motor aspects of the task predicted reward. We show that participants assigned value to spatial-motor representations despite it being outcome irrelevant. Strikingly, spatial-motor value associations affected behavior across all outcome-relevant features and stages of the task, consistent with credit assignment to low-level state-independent task representations. Individual difference analyses suggested that the impact of spatial-motor value formation was attenuated for individuals who showed greater deployment of goal-directed (model-based) strategies. Our findings highlight a need for a reconsideration of how model-free representations are formed and regulated according to the structure of the environment.
Collapse
|
110
|
Toyama A, Katahira K, Ohira H. Reinforcement Learning With Parsimonious Computation and a Forgetting Process. Front Hum Neurosci 2019; 13:153. [PMID: 31143107 PMCID: PMC6520826 DOI: 10.3389/fnhum.2019.00153] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2019] [Accepted: 04/23/2019] [Indexed: 12/03/2022] Open
Abstract
Decision-making is assumed to be supported by model-free and model-based systems: the model-free system is based purely on experience, while the model-based system uses a cognitive map of the environment and is more accurate. The recently developed multistep decision-making task and its computational model can dissociate the contributions of the two systems and have been used widely. This study used this task and model to understand our value-based learning process and tested alternative algorithms for the model-free and model-based learning systems. The task used in this study had a deterministic transition structure, and the degree of use of this structure in learning is estimated as the relative contribution of the model-based system to choices. We obtained data from 29 participants and fitted them with various computational models that differ in the model-free and model-based assumptions. The results of model comparison and parameter estimation showed that the participants update the value of action sequences and not each action. Additionally, the model fit was improved substantially by assuming that the learning mechanism includes a forgetting process, where the values of unselected options change to a certain default value over time. We also examined the relationships between the estimated parameters and psychopathology and other traits measured by self-reported questionnaires, and the results suggested that the difference in model assumptions can change the conclusion. In particular, inclusion of the forgetting process in the computational models had a strong impact on estimation of the weighting parameter of the model-free and model-based systems.
Collapse
Affiliation(s)
- Asako Toyama
- Department of Psychology, Graduate School of Informatics, Nagoya University, Nagoya, Japan
| | - Kentaro Katahira
- Department of Psychology, Graduate School of Informatics, Nagoya University, Nagoya, Japan
| | - Hideki Ohira
- Department of Psychology, Graduate School of Informatics, Nagoya University, Nagoya, Japan
| |
Collapse
|
111
|
Hauser TU, Will GJ, Dubois M, Dolan RJ. Annual Research Review: Developmental computational psychiatry. J Child Psychol Psychiatry 2019; 60:412-426. [PMID: 30252127 DOI: 10.1111/jcpp.12964] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/03/2018] [Indexed: 11/29/2022]
Abstract
Most psychiatric disorders emerge during childhood and adolescence. This is also a period that coincides with the brain undergoing substantial growth and reorganisation. However, it remains unclear how a heightened vulnerability to psychiatric disorder relates to this brain maturation. Here, we propose 'developmental computational psychiatry' as a framework for linking brain maturation to cognitive development. We argue that through modelling some of the brain's fundamental cognitive computations, and relating them to brain development, we can bridge the gap between brain and cognitive development. This in turn can lead to a richer understanding of the ontogeny of psychiatric disorders. We illustrate this perspective with examples from reinforcement learning and dopamine function. Specifically, we show how computational modelling deepens an understanding of how cognitive processes, such as reward learning, effort learning, and social learning might go awry in psychiatric disorders. Finally, we sketch the promises and limitations of a developmental computational psychiatry.
Collapse
Affiliation(s)
- Tobias U Hauser
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London, UK.,Wellcome Centre for Human Neuroimaging, University College London, London, UK
| | - Geert-Jan Will
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London, UK.,Institute of Psychology, Leiden University, Leiden, The Netherlands
| | - Magda Dubois
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London, UK
| | - Raymond J Dolan
- Max Planck University College London Centre for Computational Psychiatry and Ageing Research, London, UK.,Wellcome Centre for Human Neuroimaging, University College London, London, UK
| |
Collapse
|
112
|
Vikbladh OM, Meager MR, King J, Blackmon K, Devinsky O, Shohamy D, Burgess N, Daw ND. Hippocampal Contributions to Model-Based Planning and Spatial Memory. Neuron 2019; 102:683-693.e4. [PMID: 30871859 DOI: 10.1016/j.neuron.2019.02.014] [Citation(s) in RCA: 75] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2018] [Revised: 12/21/2018] [Accepted: 02/08/2019] [Indexed: 12/30/2022]
Abstract
Little is known about the neural mechanisms that allow humans and animals to plan actions using knowledge of task contingencies. Emerging theories hypothesize that it involves the same hippocampal mechanisms that support self-localization and memory for locations. Yet limited direct evidence supports the link between planning and the hippocampal place map. We addressed this by investigating model-based planning and place memory in healthy controls and epilepsy patients treated using unilateral anterior temporal lobectomy with hippocampal resection. Both functions were impaired in the patient group. Specifically, the planning impairment was related to right hippocampal lesion size, controlling for overall lesion size. Furthermore, although planning and boundary-driven place memory covaried in the control group, this relationship was attenuated in patients, consistent with both functions relying on the same structure in the healthy brain. These findings clarify both the neural mechanism of model-based planning and the scope of hippocampal contributions to behavior.
Collapse
Affiliation(s)
- Oliver M Vikbladh
- Center for Neural Science, New York University School of Arts and Science, New York, NY 10003, USA.
| | - Michael R Meager
- Department of Psychology, New York University School of Arts and Science, New York, NY 10003, USA; Department of Neurology, New York University School of Medicine, New York, NY 10016, USA
| | - John King
- Division of Psychology & Language Sciences, Department of Clinical, Educational & Health Psychology, University College London, London WC1H 0AP, UK
| | - Karen Blackmon
- Department of Physiology, Neuroscience, and Behavioral Sciences, St. George's University School of Medicine, St. George, Grenada, West Indies
| | - Orrin Devinsky
- Department of Neurology, New York University School of Medicine, New York, NY 10016, USA; Department of Neurosurgery, New York University School of Medicine, New York, NY 10016, USA; Department of Psychiatry, New York University School of Medicine, New York, NY 10016, USA
| | - Daphna Shohamy
- Department of Psychology, Columbia University, New York, NY 10027, USA; Zuckerman Mind Brain Behavior Institute and Kavli Institute for Brain Science, Columbia University, New York, NY 10027, USA
| | - Neil Burgess
- Institute of Cognitive Neuroscience, University College London, London WC1N 3AZ, UK; Institute of Neurology, University College London, London WC1N 3BG, UK
| | - Nathaniel D Daw
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08540, USA; Department of Psychology, Princeton University, Princeton, NJ 08540, USA.
| |
Collapse
|
113
|
Rosenbaum GM, Hartley CA. Developmental perspectives on risky and impulsive choice. Philos Trans R Soc Lond B Biol Sci 2019; 374:20180133. [PMID: 30966918 PMCID: PMC6335462 DOI: 10.1098/rstb.2018.0133] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/20/2018] [Indexed: 12/28/2022] Open
Abstract
Epidemiological data suggest that risk taking in the real world increases from childhood into adolescence and declines into adulthood. However, developmental patterns of behaviour in laboratory assays of risk taking and impulsive choice are inconsistent. In this article, we review a growing literature using behavioural economic approaches to understand developmental changes in risk taking and impulsivity. We present findings that have begun to elucidate both the cognitive and neural processes that contribute to risky and impulsive choice, as well as how age-related changes in these neurocognitive processes give rise to shifts in choice behaviour. We highlight how variability in task parameters can be used to identify specific aspects of decision contexts that may differentially influence risky and impulsive choice behaviour across development. This article is part of the theme issue 'Risk taking and impulsive behaviour: fundamental discoveries, theoretical perspectives and clinical implications'.
Collapse
Affiliation(s)
- Gail M. Rosenbaum
- Department of Psychology, New York University, New York, NY 10003, USA
| | - Catherine A. Hartley
- Department of Psychology, New York University, New York, NY 10003, USA
- Center for Neural Science, New York University, New York, NY 10003, USA
| |
Collapse
|
114
|
Neural Variability Limits Adolescent Skill Learning. J Neurosci 2019; 39:2889-2902. [PMID: 30755494 DOI: 10.1523/jneurosci.2878-18.2019] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Revised: 01/24/2019] [Accepted: 01/26/2019] [Indexed: 12/31/2022] Open
Abstract
Skill learning is fundamental to the acquisition of many complex behaviors that emerge during development. For example, years of practice give rise to perceptual improvements that contribute to mature speech and language skills. While fully honed learning skills might be thought to offer an advantage during the juvenile period, the ability to learn actually continues to develop through childhood and adolescence, suggesting that the neural mechanisms that support skill learning are slow to mature. To address this issue, we asked whether the rate and magnitude of perceptual learning varies as a function of age as male and female gerbils trained on an auditory task. Adolescents displayed a slower rate of perceptual learning compared with their young and mature counterparts. We recorded auditory cortical neuron activity from a subset of adolescent and adult gerbils as they underwent perceptual training. While training enhanced the sensitivity of most adult units, the sensitivity of many adolescent units remained unchanged, or even declined across training days. Therefore, the average rate of cortical improvement was significantly slower in adolescents compared with adults. Both smaller differences between sound-evoked response magnitudes and greater trial-to-trial response fluctuations contributed to the poorer sensitivity of individual adolescent neurons. Together, these findings suggest that elevated sensory neural variability limits adolescent skill learning.SIGNIFICANCE STATEMENT The ability to learn new skills emerges gradually as children age. This prolonged development, often lasting well into adolescence, suggests that children, teens, and adults may rely on distinct neural strategies to improve their sensory and motor capabilities. Here, we found that practice-based improvement on a sound detection task is slower in adolescent gerbils than in younger or older animals. Neural recordings made during training revealed that practice enhanced the sound sensitivity of adult cortical neurons, but had a weaker effect in adolescents. This latter finding was partially explained by the fact that adolescent neural responses were more variable than in adults. Our results suggest that one mechanistic basis of adult-like skill learning is a reduction in neural response variability.
Collapse
|
115
|
Shahar N, Hauser TU, Moutoussis M, Moran R, Keramati M, Dolan RJ. Improving the reliability of model-based decision-making estimates in the two-stage decision task with reaction-times and drift-diffusion modeling. PLoS Comput Biol 2019; 15:e1006803. [PMID: 30759077 PMCID: PMC6391008 DOI: 10.1371/journal.pcbi.1006803] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Revised: 02/26/2019] [Accepted: 01/17/2019] [Indexed: 01/10/2023] Open
Abstract
A well-established notion in cognitive neuroscience proposes that multiple brain systems contribute to choice behaviour. These include: (1) a model-free system that uses values cached from the outcome history of alternative actions, and (2) a model-based system that considers action outcomes and the transition structure of the environment. The widespread use of this distinction, across a range of applications, renders it important to index their distinct influences with high reliability. Here we consider the two-stage task, widely considered as a gold standard measure for the contribution of model-based and model-free systems to human choice. We tested the internal/temporal stability of measures from this task, including those estimated via an established computational model, as well as an extended model using drift-diffusion. Drift-diffusion modeling suggested that both choice in the first stage, and RTs in the second stage, are directly affected by a model-based/free trade-off parameter. Both parameter recovery and the stability of model-based estimates were poor but improved substantially when both choice and RT were used (compared to choice only), and when more trials (than conventionally used in research practice) were included in our analysis. The findings have implications for interpretation of past and future studies based on the use of the two-stage task, as well as for characterising the contribution of model-based processes to choice behaviour.
Collapse
Affiliation(s)
- Nitzan Shahar
- Wellcome Centre for Human Neuroimaging, University College London, London, United Kingdom
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom
| | - Tobias U. Hauser
- Wellcome Centre for Human Neuroimaging, University College London, London, United Kingdom
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom
| | - Michael Moutoussis
- Wellcome Centre for Human Neuroimaging, University College London, London, United Kingdom
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom
| | - Rani Moran
- Wellcome Centre for Human Neuroimaging, University College London, London, United Kingdom
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom
| | - Mehdi Keramati
- Wellcome Centre for Human Neuroimaging, University College London, London, United Kingdom
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom
| | | | - Raymond J. Dolan
- Wellcome Centre for Human Neuroimaging, University College London, London, United Kingdom
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom
| |
Collapse
|
116
|
|
117
|
The computational basis of following advice in adolescents. J Exp Child Psychol 2019; 180:39-54. [PMID: 30611112 DOI: 10.1016/j.jecp.2018.11.019] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Revised: 11/13/2018] [Accepted: 11/24/2018] [Indexed: 12/20/2022]
Abstract
Advice taking helps one to quickly acquire knowledge and make decisions. This age-comparative study (in children [8- to 10-year-olds], adolescents [13- to 15-year-olds], and adults [18- to 22-year-olds]) investigated developmental differences in how advice, experience, and exploration influence learning. The results showed that adolescents were initially easily swayed to follow peer advice but also switched more rapidly to exploring alternatives like children. Whereas adults stayed with the advice over the task, adolescents put more weight on their own experience compared with adults. A social learning model showed that although social influence most strongly affects adolescents' initial expectations (i.e., their priors), adolescents showed higher exploration and discovered the other good option in the current task. Thus, our model resolved the apparently conflicting findings of adolescents being more and less sensitive to peer influence and provides novel insights into the dynamic interaction between social and individual learning.
Collapse
|
118
|
Gee DG, Bath KG, Johnson CM, Meyer HC, Murty VP, van den Bos W, Hartley CA. Neurocognitive Development of Motivated Behavior: Dynamic Changes across Childhood and Adolescence. J Neurosci 2018; 38:9433-9445. [PMID: 30381435 PMCID: PMC6209847 DOI: 10.1523/jneurosci.1674-18.2018] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2018] [Revised: 09/23/2018] [Accepted: 09/24/2018] [Indexed: 12/12/2022] Open
Abstract
The ability to anticipate and respond appropriately to the challenges and opportunities present in our environments is critical for adaptive behavior. Recent methodological innovations have led to substantial advances in our understanding of the neurocircuitry supporting such motivated behavior in adulthood. However, the neural circuits and cognitive processes that enable threat- and reward-motivated behavior undergo substantive changes over the course of development, and these changes are less well understood. In this article, we highlight recent research in human and animal models demonstrating how developmental changes in prefrontal-subcortical neural circuits give rise to corresponding changes in the processing of threats and rewards from infancy to adulthood. We discuss how these developmental trajectories are altered by experiential factors, such as early-life stress, and highlight the relevance of this research for understanding the developmental onset and treatment of psychiatric disorders characterized by dysregulation of motivated behavior.
Collapse
Affiliation(s)
- Dylan G Gee
- Department of Psychology, Yale University, New Haven, CT 06520,
| | - Kevin G Bath
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown University, Providence, RI 02912
| | - Carolyn M Johnson
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02138
| | - Heidi C Meyer
- Department of Psychiatry, Weill Cornell Medicine, New York, NY 10065
| | - Vishnu P Murty
- Department of Psychology, Temple University, Philadelphia, PA 19122
| | - Wouter van den Bos
- Department of Developmental Psychology, University of Amsterdam, Amsterdam, Netherlands, and
| | | |
Collapse
|
119
|
Computational Phenotyping: Using Models to Understand Individual Differences in Personality, Development, and Mental Illness. PERSONALITY NEUROSCIENCE 2018; 1:e18. [PMID: 32435735 PMCID: PMC7219680 DOI: 10.1017/pen.2018.14] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Accepted: 06/06/2018] [Indexed: 12/19/2022]
Abstract
This paper reviews progress in the application of computational models to
personality, developmental, and clinical neuroscience. We first describe the
concept of a computational phenotype, a collection of parameters derived from
computational models fit to behavioral and neural data. This approach represents
individuals as points in a continuous parameter space, complementing traditional
trait and symptom measures. One key advantage of this representation is that it
is mechanistic: The parameters have interpretations in terms of cognitive
processes, which can be translated into quantitative predictions about future
behavior and brain activity. We illustrate with several examples how this
approach has led to new scientific insights into individual differences,
developmental trajectories, and psychopathology. We then survey some of the
challenges that lay ahead.
Collapse
|
120
|
van den Bos W, Bruckner R, Nassar MR, Mata R, Eppinger B. Computational neuroscience across the lifespan: Promises and pitfalls. Dev Cogn Neurosci 2018; 33:42-53. [PMID: 29066078 PMCID: PMC5916502 DOI: 10.1016/j.dcn.2017.09.008] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2017] [Revised: 08/19/2017] [Accepted: 09/29/2017] [Indexed: 11/26/2022] Open
Abstract
In recent years, the application of computational modeling in studies on age-related changes in decision making and learning has gained in popularity. One advantage of computational models is that they provide access to latent variables that cannot be directly observed from behavior. In combination with experimental manipulations, these latent variables can help to test hypotheses about age-related changes in behavioral and neurobiological measures at a level of specificity that is not achievable with descriptive analysis approaches alone. This level of specificity can in turn be beneficial to establish the identity of the corresponding behavioral and neurobiological mechanisms. In this paper, we will illustrate applications of computational methods using examples of lifespan research on risk taking, strategy selection and reinforcement learning. We will elaborate on problems that can occur when computational neuroscience methods are applied to data of different age groups. Finally, we will discuss potential targets for future applications and outline general shortcomings of computational neuroscience methods for research on human lifespan development.
Collapse
Affiliation(s)
- Wouter van den Bos
- Center for Adaptive Rationality, Max Planck Institute for Human Development, Berlin, Germany; Department of Psychology, University of Amsterdam, Amsterdam, The Netherlands; International Max Planck Research School LIFE, Berlin, Germany.
| | - Rasmus Bruckner
- Department of Education and Psychology, Freie Universität Berlin, Berlin, Germany; International Max Planck Research School LIFE, Berlin, Germany
| | - Matthew R Nassar
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown University, Providence, USA
| | - Rui Mata
- Center for Cognitive and Decision Sciences, Department of Psychology, University of Basel, Basel, Switzerland
| | - Ben Eppinger
- Department of Psychology, Concordia University, Montreal, Canada; Department of Psychology, TU Dresden, Dresden, Germany.
| |
Collapse
|
121
|
Heller AS, Ezie CEC, Otto AR, Timpano KR. Model-based learning and individual differences in depression: The moderating role of stress. Behav Res Ther 2018; 111:19-26. [PMID: 30273768 DOI: 10.1016/j.brat.2018.09.007] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2017] [Revised: 09/19/2018] [Accepted: 09/24/2018] [Indexed: 11/19/2022]
Abstract
Inflexible decision-making has been proposed as a transdiagnostic risk factor for mood disorders. Evidence suggests that inflexible decision-making may emerge only when individuals are experiencing increased negative affect or stress. 151 participants completed symptom measures of depression and anxiety, followed by a two-stage decision-making task that distinguishes between habitual and goal-directed choice. An experimental manipulation to induce stress was introduced halfway through the task. Individuals with higher depression levels became less model-based after the manipulation than those with lower depression levels. There was no relationship between trait anxiety and the impact of the manipulation on decision-making. Controlling for main effects of anxiety did not attenuate the association between depression and impact of stress. Anhedonia was associated with the impact of the manipulation on model-based decision-making. These results suggest that risk for depression is associated with reflexive decision-making, but these effects may only emerge under conditions of stress.
Collapse
Affiliation(s)
- Aaron S Heller
- Department of Psychology, University of Miami, Coral Gables, FL, 33146, USA.
| | - C E Chiemeka Ezie
- Department of Psychology, University of Miami, Coral Gables, FL, 33146, USA
| | - A Ross Otto
- Department of Psychology, McGill University, 2001 McGill College Avenue, Montréal, QC, H3A 1G1, Canada
| | - Kiara R Timpano
- Department of Psychology, University of Miami, Coral Gables, FL, 33146, USA
| |
Collapse
|
122
|
Hasz BM, Redish AD. Deliberation and Procedural Automation on a Two-Step Task for Rats. Front Integr Neurosci 2018; 12:30. [PMID: 30123115 PMCID: PMC6085996 DOI: 10.3389/fnint.2018.00030] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2017] [Accepted: 07/02/2018] [Indexed: 11/25/2022] Open
Abstract
Current theories suggest that decision-making arises from multiple, competing action-selection systems. Rodent studies dissociate deliberation and procedural behavior, and find a transition from procedural to deliberative behavior with experience. However, it remains unknown how this transition from deliberative to procedural control evolves within single trials, or within blocks of repeated choices. We adapted for rats a two-step task which has been used to dissociate model-based from model-free decisions in humans. We found that a mixture of model-based and model-free algorithms was more likely to explain rat choice strategies on the task than either model-based or model-free algorithms alone. This task contained two choices per trial, which provides a more complex and non-discrete per-trial choice structure. This task structure enabled us to evaluate how deliberative and procedural behavior evolved within-trial and within blocks of repeated choice sequences. We found that vicarious trial and error (VTE), a behavioral correlate of deliberation in rodents, was correlated between the two choice points on a given lap. We also found that behavioral stereotypy, a correlate of procedural automation, increased with the number of repeated choices. While VTE at the first choice point decreased [corrected] with the number of repeated choices, VTE at the second choice point did not, and only increased after unexpected transitions within the task. This suggests that deliberation at the beginning of trials may correspond to changes in choice patterns, while mid-trial deliberation may correspond to an interruption of a procedural process.
Collapse
Affiliation(s)
- Brendan M. Hasz
- Graduate Program in Neuroscience, University of Minnesota Twin CitiesMinneapolis, MN, United States
| | - A. David Redish
- Department of Neuroscience, University of Minnesota Twin CitiesMinneapolis, MN, United States
| |
Collapse
|
123
|
Kasanova Z, Ceccarini J, Frank MJ, van Amelsvoort T, Booij J, van Duin E, Steinhart H, Vaessen T, Heinzel A, Mottaghy F, Myin-Germeys I. Intact striatal dopaminergic modulation of reward learning and daily-life reward-oriented behavior in first-degree relatives of individuals with psychotic disorder. Psychol Med 2018; 48:1909-1914. [PMID: 29233195 DOI: 10.1017/s0033291717003476] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
BACKGROUND Abnormalities in reward learning in psychotic disorders have been proposed to be linked to dysregulated subcortical dopaminergic (DA) neurotransmission, which in turn is a suspected mechanism for predisposition to psychosis. We therefore explored the striatal dopaminergic modulation of reward processing and its behavioral correlates in individuals at familial risk for psychosis. METHODS We performed a DA D2/3 receptor [18F]fallypride positron emission tomography scan during a probabilistic reinforcement learning task in 16 healthy first-degree relatives of patients with psychosis and 16 healthy volunteers, followed by a 6-day ecological momentary assessment study capturing reward-oriented behavior in the everyday life. RESULTS We detected significant reward-induced DA release in bilateral caudate, putamen and ventral striatum of both groups, with no group differences in its magnitude nor spatial extent. In both groups alike, greater extent of reward-induced DA release in all regions of interest was associated with better performance in the task, as well as in greater tendency to be engaged in reward-oriented behavior in the daily life. CONCLUSIONS These findings suggest intact striatal dopaminergic modulation of reinforcement learning and reward-oriented behavior in individuals with familial predisposition to psychosis. Furthermore, this study points towards a key link between striatal reward-related DA release and pursuit of ecologically relevant rewards.
Collapse
Affiliation(s)
- Zuzana Kasanova
- Center for Contextual Psychiatry, Department of Neurosciences, KU Leuven - Leuven University, Leuven, Belgium
| | - Jenny Ceccarini
- Division of Nuclear Medicine and Molecular Imaging, Department of Imaging & Pathology, University Hospitals Leuven, Leuven, Belgium
| | - Michael J Frank
- Department of Cognitive, Linguistic and Psychological Sciences, Brown University, Providence, USA
| | - Thérèse van Amelsvoort
- Department of Psychiatry and Neuropsychology, Maastricht University, Maastricht, The Netherlands
| | - Jan Booij
- Department of Nuclear Medicine, Academic Medical Center, University of Amsterdam, Amsterdam, The Netherlands
| | - Esther van Duin
- Department of Psychiatry and Neuropsychology, Maastricht University, Maastricht, The Netherlands
| | - Henrietta Steinhart
- Center for Contextual Psychiatry, Department of Neurosciences, KU Leuven - Leuven University, Leuven, Belgium
- Department of Psychiatry and Neuropsychology, Maastricht University, Maastricht, The Netherlands
| | - Thomas Vaessen
- Center for Contextual Psychiatry, Department of Neurosciences, KU Leuven - Leuven University, Leuven, Belgium
- Department of Psychiatry and Neuropsychology, Maastricht University, Maastricht, The Netherlands
| | - Alexander Heinzel
- Department of Nuclear Medicine, University Hospital RWTH Aachen University, Aachen, Germany
| | - Felix Mottaghy
- Department of Nuclear Medicine, University Hospital RWTH Aachen University, Aachen, Germany
| | - Inez Myin-Germeys
- Center for Contextual Psychiatry, Department of Neurosciences, KU Leuven - Leuven University, Leuven, Belgium
| |
Collapse
|
124
|
Developmental differences in the neural dynamics of observational learning. Neuropsychologia 2018; 119:12-23. [PMID: 30036542 DOI: 10.1016/j.neuropsychologia.2018.07.022] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Revised: 07/16/2018] [Accepted: 07/19/2018] [Indexed: 11/22/2022]
Abstract
Learning from vicarious experience is central for educational practice, but not well understood with respect to its ontogenetic development and underlying neural dynamics. In this age-comparative study we compared behavioral and electrophysiological markers of learning from vicarious and one's own experience in children (age 8-10) and young adults. Behaviorally both groups benefitted from integrating vicarious experience into their own choices however, adults learned much faster from social information than children. The electrophysiological results show learning-related changes in the P300 to experienced and observed rewards in adults, but not in children, indicating that adults were more efficient in integrating observed and experienced information during learning. In comparison to adults, children showed an enhanced FRN for observed and experienced feedback, indicating that they focus more on valence information than adults. Taken together, children compared to adults seem to be less able to rapidly assess the informational value of observed and experienced feedback during learning and consequently up-regulate their response to both, observed and experienced (particularly negative) feedback. When transferring the current findings to an applied context, educational practice should strengthen children's ability to use feedback information for learning and be particularly cautious with negative social feedback during supervised learning.
Collapse
|
125
|
Adolescent Development of Value-Guided Goal Pursuit. Trends Cogn Sci 2018; 22:725-736. [PMID: 29880333 DOI: 10.1016/j.tics.2018.05.003] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Revised: 05/12/2018] [Accepted: 05/16/2018] [Indexed: 12/21/2022]
Abstract
Adolescents are challenged to orchestrate goal-directed actions in increasingly independent and consequential ways. In doing so, it is advantageous to use information about value to select which goals to pursue and how much effort to devote to them. Here, we examine age-related changes in how individuals use value signals to orchestrate goal-directed behavior. Drawing on emerging literature on value-guided cognitive control and reinforcement learning, we demonstrate how value and task difficulty modulate the execution of goal-directed action in complex ways across development from childhood to adulthood. We propose that the scope of value-guided goal pursuit expands with age to include increasingly challenging cognitive demands, and scaffolds on the emergence of functional integration within brain networks supporting valuation, cognition, and action.
Collapse
|
126
|
|
127
|
LeDoux J, Daw ND. Surviving threats: neural circuit and computational implications of a new taxonomy of defensive behaviour. Nat Rev Neurosci 2018; 19:269-282. [PMID: 29593300 DOI: 10.1038/nrn.2018.22] [Citation(s) in RCA: 201] [Impact Index Per Article: 33.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Research on defensive behaviour in mammals has in recent years focused on elicited reactions; however, organisms also make active choices when responding to danger. We propose a hierarchical taxonomy of defensive behaviour on the basis of known psychological processes. Included are three categories of reactions (reflexes, fixed reactions and habits) and three categories of goal-directed actions (direct action-outcome behaviours and actions based on implicit or explicit forecasting of outcomes). We then use this taxonomy to guide a summary of findings regarding the underlying neural circuits.
Collapse
Affiliation(s)
- Joseph LeDoux
- Center for Neural Science and Department of Psychology, New York University, New York, NY, USA.,Department of Psychiatry and Department of Child and Adolescent Psychiatry, New York University Langone Medical School, New York, NY, USA.,Nathan Kline Institute for Psychiatry Research, Orangeburg, NY, USA
| | - Nathaniel D Daw
- Princeton Neuroscience Institute and Department of Psychology, Princeton University, Princeton, NJ, USA
| |
Collapse
|
128
|
Fakhari P, Khodadadi A, Busemeyer JR. The detour problem in a stochastic environment: Tolman revisited. Cogn Psychol 2018; 101:29-49. [PMID: 29294373 DOI: 10.1016/j.cogpsych.2017.12.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2017] [Revised: 12/22/2017] [Accepted: 12/23/2017] [Indexed: 10/18/2022]
Abstract
We designed a grid world task to study human planning and re-planning behavior in an unknown stochastic environment. In our grid world, participants were asked to travel from a random starting point to a random goal position while maximizing their reward. Because they were not familiar with the environment, they needed to learn its characteristics from experience to plan optimally. Later in the task, we randomly blocked the optimal path to investigate whether and how people adjust their original plans to find a detour. To this end, we developed and compared 12 different models. These models were different on how they learned and represented the environment and how they planned to catch the goal. The majority of our participants were able to plan optimally. We also showed that people were capable of revising their plans when an unexpected event occurred. The result from the model comparison showed that the model-based reinforcement learning approach provided the best account for the data and outperformed heuristics in explaining the behavioral data in the re-planning trials.
Collapse
Affiliation(s)
- Pegah Fakhari
- Indiana University, Department of Psychological and Brain Sciences, Bloomington, IN, United States.
| | - Arash Khodadadi
- Indiana University, Department of Psychological and Brain Sciences, Bloomington, IN, United States
| | - Jerome R Busemeyer
- Indiana University, Department of Psychological and Brain Sciences, Bloomington, IN, United States
| |
Collapse
|
129
|
Bolenz F, Reiter AMF, Eppinger B. Developmental Changes in Learning: Computational Mechanisms and Social Influences. Front Psychol 2017; 8:2048. [PMID: 29250006 PMCID: PMC5715389 DOI: 10.3389/fpsyg.2017.02048] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2017] [Accepted: 11/09/2017] [Indexed: 11/13/2022] Open
Abstract
Our ability to learn from the outcomes of our actions and to adapt our decisions accordingly changes over the course of the human lifespan. In recent years, there has been an increasing interest in using computational models to understand developmental changes in learning and decision-making. Moreover, extensions of these models are currently applied to study socio-emotional influences on learning in different age groups, a topic that is of great relevance for applications in education and health psychology. In this article, we aim to provide an introduction to basic ideas underlying computational models of reinforcement learning and focus on parameters and model variants that might be of interest to developmental scientists. We then highlight recent attempts to use reinforcement learning models to study the influence of social information on learning across development. The aim of this review is to illustrate how computational models can be applied in developmental science, what they can add to our understanding of developmental mechanisms and how they can be used to bridge the gap between psychological and neurobiological theories of development.
Collapse
Affiliation(s)
- Florian Bolenz
- Chair of Lifespan Developmental Neuroscience, Department of Psychology, Technische Universität Dresden, Dresden, Germany
| | - Andrea M F Reiter
- Chair of Lifespan Developmental Neuroscience, Department of Psychology, Technische Universität Dresden, Dresden, Germany.,Department of Neurology, Max-Planck-Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
| | - Ben Eppinger
- Chair of Lifespan Developmental Neuroscience, Department of Psychology, Technische Universität Dresden, Dresden, Germany.,Department of Psychology, Concordia University, Montreal, QC, Canada.,PERFORM Centre, Concordia University, Montreal, QC, Canada
| |
Collapse
|
130
|
Voon V, Reiter A, Sebold M, Groman S. Model-Based Control in Dimensional Psychiatry. Biol Psychiatry 2017; 82:391-400. [PMID: 28599832 DOI: 10.1016/j.biopsych.2017.04.006] [Citation(s) in RCA: 71] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/14/2016] [Revised: 04/11/2017] [Accepted: 04/12/2017] [Indexed: 01/13/2023]
Abstract
We use parallel interacting goal-directed and habitual strategies to make our daily decisions. The arbitration between these strategies is relevant to inflexible repetitive behaviors in psychiatric disorders. Goal-directed control, also known as model-based control, is based on an affective outcome relying on a learned internal model to prospectively make decisions. In contrast, habit control, also known as model-free control, is based on an integration of previous reinforced learning autonomous of the current outcome value and is implicit and more efficient but at the cost of greater inflexibility. The concept of model-based control can be further extended into pavlovian processes. Here we describe and compare tasks that tap into these constructs and emphasize the clinical relevance and translation of these tasks in psychiatric disorders. Together, these findings highlight a role for model-based control as a transdiagnostic impairment underlying compulsive behaviors and representing a promising therapeutic target.
Collapse
Affiliation(s)
- Valerie Voon
- Department of Psychiatry, University of Cambridge, Cambridge, United Kingdom; Cambridgeshire and Peterborough NHS Foundation Trust, Cambridge, United Kingdom.
| | - Andrea Reiter
- Lifespan Developmental Neuroscience, Department of Psychology, Dresden, Germany; Department of Neurology, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
| | - Miriam Sebold
- Department of Psychiatry and Psychotherapy, Charite-Universitatsmedizin Berlin, Berlin, Germany
| | - Stephanie Groman
- Department of Psychiatry, Yale University, New Haven, Connecticut
| |
Collapse
|
131
|
Sul S, Güroğlu B, Crone EA, Chang LJ. Medial prefrontal cortical thinning mediates shifts in other-regarding preferences during adolescence. Sci Rep 2017; 7:8510. [PMID: 28819107 PMCID: PMC5561198 DOI: 10.1038/s41598-017-08692-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2017] [Accepted: 07/12/2017] [Indexed: 11/24/2022] Open
Abstract
Adolescence is a time of significant cortical changes in the ‘social brain’, a set of brain regions involved in sophisticated social inference. However, there is limited evidence linking the structural changes in social brain to development of social behavior. The present study investigated how cortical development of the social brain relates to other-regarding behavior, in the context of fairness concerns. Participants aged between 9 to 23 years old responded to multiple rounds of ultimatum game proposals. The degree to which each participant considers fairness of intention (i.e., intention-based reciprocity) vs. outcome (i.e., egalitarianism) was quantified using economic utility models. We observed a gradual shift in other-regarding preferences from simple rule-based egalitarianism to complex intention-based reciprocity from early childhood to young adulthood. The preference shift was associated with cortical thinning of the dorsomedial prefrontal cortex and posterior temporal cortex. Meta-analytic reverse-inference analysis showed that these regions were involved in social inference. Importantly, the other-regarding preference shift was statistically mediated by cortical thinning in the dorsomedial prefrontal cortex. Together these findings suggest that development of the ability to perform sophisticated other-regarding social inference is associated with the structural changes of specific social brain regions in late adolescence.
Collapse
Affiliation(s)
- Sunhae Sul
- Department of Psychology, Pusan National University, Busan, Republic of Korea
| | - Berna Güroğlu
- Developmental and Educational Psychology Unit, Leiden University, Leiden, The Netherlands.,Leiden Institute for Brain and Cognition, Leiden University Medical Centre, Leiden, The Netherlands
| | - Eveline A Crone
- Developmental and Educational Psychology Unit, Leiden University, Leiden, The Netherlands.,Leiden Institute for Brain and Cognition, Leiden University Medical Centre, Leiden, The Netherlands
| | - Luke J Chang
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA.
| |
Collapse
|
132
|
Bang D, Frith CD. Making better decisions in groups. ROYAL SOCIETY OPEN SCIENCE 2017; 4:170193. [PMID: 28878973 PMCID: PMC5579088 DOI: 10.1098/rsos.170193] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Accepted: 07/10/2017] [Indexed: 06/07/2023]
Abstract
We review the literature to identify common problems of decision-making in individuals and groups. We are guided by a Bayesian framework to explain the interplay between past experience and new evidence, and the problem of exploring the space of hypotheses about all the possible states that the world could be in and all the possible actions that one could take. There are strong biases, hidden from awareness, that enter into these psychological processes. While biases increase the efficiency of information processing, they often do not lead to the most appropriate action. We highlight the advantages of group decision-making in overcoming biases and searching the hypothesis space for good models of the world and good solutions to problems. Diversity of group members can facilitate these achievements, but diverse groups also face their own problems. We discuss means of managing these pitfalls and make some recommendations on how to make better group decisions.
Collapse
Affiliation(s)
- Dan Bang
- Wellcome Trust Centre for Neuroimaging, University College London, London WC1N 3BG, UK
- Interacting Minds Centre, Aarhus University, 8000 Aarhus, Denmark
| | - Chris D. Frith
- Wellcome Trust Centre for Neuroimaging, University College London, London WC1N 3BG, UK
- Institute of Philosophy, University of London, London WC1E 7HU, UK
| |
Collapse
|
133
|
Kool W, Gershman SJ, Cushman FA. Cost-Benefit Arbitration Between Multiple Reinforcement-Learning Systems. Psychol Sci 2017; 28:1321-1333. [DOI: 10.1177/0956797617708288] [Citation(s) in RCA: 109] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Human behavior is sometimes determined by habit and other times by goal-directed planning. Modern reinforcement-learning theories formalize this distinction as a competition between a computationally cheap but inaccurate model-free system that gives rise to habits and a computationally expensive but accurate model-based system that implements planning. It is unclear, however, how people choose to allocate control between these systems. Here, we propose that arbitration occurs by comparing each system’s task-specific costs and benefits. To investigate this proposal, we conducted two experiments showing that people increase model-based control when it achieves greater accuracy than model-free control, and especially when the rewards of accurate performance are amplified. In contrast, they are insensitive to reward amplification when model-based and model-free control yield equivalent accuracy. This suggests that humans adaptively balance habitual and planned action through on-line cost-benefit analysis.
Collapse
Affiliation(s)
- Wouter Kool
- Department of Psychology, Harvard University
| | - Samuel J. Gershman
- Department of Psychology, Harvard University
- Center for Brain Science, Harvard University
| | | |
Collapse
|
134
|
Vahey NA, Bennett M, Whelan R. Conceptual advances in the cognitive neuroscience of learning: Implications for relational frame theory. JOURNAL OF CONTEXTUAL BEHAVIORAL SCIENCE 2017. [DOI: 10.1016/j.jcbs.2017.04.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
135
|
Friedel E, Sebold M, Kuitunen-Paul S, Nebe S, Veer IM, Zimmermann US, Schlagenhauf F, Smolka MN, Rapp M, Walter H, Heinz A. How Accumulated Real Life Stress Experience and Cognitive Speed Interact on Decision-Making Processes. Front Hum Neurosci 2017; 11:302. [PMID: 28642696 PMCID: PMC5462964 DOI: 10.3389/fnhum.2017.00302] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Accepted: 05/26/2017] [Indexed: 12/20/2022] Open
Abstract
Rationale: Advances in neurocomputational modeling suggest that valuation systems for goal-directed (deliberative) on one side, and habitual (automatic) decision-making on the other side may rely on distinct computational strategies for reinforcement learning, namely model-free vs. model-based learning. As a key theoretical difference, the model-based system strongly demands cognitive functions to plan actions prospectively based on an internal cognitive model of the environment, whereas valuation in the model-free system relies on rather simple learning rules from operant conditioning to retrospectively associate actions with their outcomes and is thus cognitively less demanding. Acute stress reactivity is known to impair model-based but not model-free choice behavior, with higher working memory capacity protecting the model-based system from acute stress. However, it is not clear which impact accumulated real life stress has on model-free and model-based decision systems and how this influence interacts with cognitive abilities. Methods: We used a sequential decision-making task distinguishing relative contributions of both learning strategies to choice behavior, the Social Readjustment Rating Scale questionnaire to assess accumulated real life stress, and the Digit Symbol Substitution Test to test cognitive speed in 95 healthy subjects. Results: Individuals reporting high stress exposure who had low cognitive speed showed reduced model-based but increased model-free behavioral control. In contrast, subjects exposed to accumulated real life stress with high cognitive speed displayed increased model-based performance but reduced model-free control. Conclusion: These findings suggest that accumulated real life stress exposure can enhance reliance on cognitive speed for model-based computations, which may ultimately protect the model-based system from the detrimental influences of accumulated real life stress. The combination of accumulated real life stress exposure and slower information processing capacities, however, might favor model-free strategies. Thus, the valence and preference of either system strongly depends on stressful experiences and individual cognitive capacities.
Collapse
Affiliation(s)
- Eva Friedel
- Department of Psychiatry and Psychotherapy, Charité Campus Mitte (CCM), Charité-Universitätsmedizin BerlinBerlin, Germany
- Biomedial Innovation Academy, Berlin Institute of HealthBerlin, Germany
| | - Miriam Sebold
- Department of Psychiatry and Psychotherapy, Charité Campus Mitte (CCM), Charité-Universitätsmedizin BerlinBerlin, Germany
- Department for Social and Preventive Medicine, University of PotsdamPotsdam, Germany
| | - Sören Kuitunen-Paul
- Department of Psychiatry and Psychotherapy, Institute of Clinical Psychology and Psychotherapy, Technische Universität DresdenDresden, Germany
| | - Stephan Nebe
- Department of Psychiatry and Psychotherapy, Institute of Clinical Psychology and Psychotherapy, Technische Universität DresdenDresden, Germany
- Neuroimaging Center, Technische Universität DresdenDresden, Germany
| | - Ilya M. Veer
- Department of Psychiatry and Psychotherapy, Charité Campus Mitte (CCM), Charité-Universitätsmedizin BerlinBerlin, Germany
| | - Ulrich S. Zimmermann
- Department of Psychiatry and Psychotherapy, Institute of Clinical Psychology and Psychotherapy, Technische Universität DresdenDresden, Germany
| | - Florian Schlagenhauf
- Department of Psychiatry and Psychotherapy, Charité Campus Mitte (CCM), Charité-Universitätsmedizin BerlinBerlin, Germany
- Department for Human Cognitive and Brain Sciences, Max Planck Institute LeipzigLeipzig, Germany
| | - Michael N. Smolka
- Department of Psychiatry and Psychotherapy, Institute of Clinical Psychology and Psychotherapy, Technische Universität DresdenDresden, Germany
- Neuroimaging Center, Technische Universität DresdenDresden, Germany
| | - Michael Rapp
- Department for Social and Preventive Medicine, University of PotsdamPotsdam, Germany
| | - Henrik Walter
- Department of Psychiatry and Psychotherapy, Charité Campus Mitte (CCM), Charité-Universitätsmedizin BerlinBerlin, Germany
- Biomedial Innovation Academy, Berlin Institute of HealthBerlin, Germany
| | - Andreas Heinz
- Department of Psychiatry and Psychotherapy, Charité Campus Mitte (CCM), Charité-Universitätsmedizin BerlinBerlin, Germany
- Biomedial Innovation Academy, Berlin Institute of HealthBerlin, Germany
| |
Collapse
|
136
|
Potter TCS, Bryce NV, Hartley CA. Cognitive components underpinning the development of model-based learning. Dev Cogn Neurosci 2016; 25:272-280. [PMID: 27825732 PMCID: PMC5410189 DOI: 10.1016/j.dcn.2016.10.005] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Revised: 08/09/2016] [Accepted: 10/20/2016] [Indexed: 11/07/2022] Open
Abstract
Reinforcement learning theory distinguishes “model-free” learning, which fosters reflexive repetition of previously rewarded actions, from “model-based” learning, which recruits a mental model of the environment to flexibly select goal-directed actions. Whereas model-free learning is evident across development, recruitment of model-based learning appears to increase with age. However, the cognitive processes underlying the development of model-based learning remain poorly characterized. Here, we examined whether age-related differences in cognitive processes underlying the construction and flexible recruitment of mental models predict developmental increases in model-based choice. In a cohort of participants aged 9–25, we examined whether the abilities to infer sequential regularities in the environment (“statistical learning”), maintain information in an active state (“working memory”) and integrate distant concepts to solve problems (“fluid reasoning”) predicted age-related improvements in model-based choice. We found that age-related improvements in statistical learning performance did not mediate the relationship between age and model-based choice. Ceiling performance on our working memory assay prevented examination of its contribution to model-based learning. However, age-related improvements in fluid reasoning statistically mediated the developmental increase in the recruitment of a model-based strategy. These findings suggest that gradual development of fluid reasoning may be a critical component process underlying the emergence of model-based learning.
Collapse
Affiliation(s)
- Tracey C S Potter
- Sackler Institute for Developmental Psychobiology, Weill Cornell Medicine, New York, NY 10065, United States; New York University, Department of Psychology, New York, NY 10003, United States
| | - Nessa V Bryce
- Sackler Institute for Developmental Psychobiology, Weill Cornell Medicine, New York, NY 10065, United States; New York University, Department of Psychology, New York, NY 10003, United States
| | - Catherine A Hartley
- Sackler Institute for Developmental Psychobiology, Weill Cornell Medicine, New York, NY 10065, United States; New York University, Department of Psychology, New York, NY 10003, United States.
| |
Collapse
|
137
|
Boehme R, Lorenz RC, Gleich T, Romund L, Pelz P, Golde S, Flemming E, Wold A, Deserno L, Behr J, Raufelder D, Heinz A, Beck A. Reversal learning strategy in adolescence is associated with prefrontal cortex activation. Eur J Neurosci 2016; 45:129-137. [PMID: 27628616 DOI: 10.1111/ejn.13401] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2016] [Revised: 08/24/2016] [Accepted: 09/12/2016] [Indexed: 01/14/2023]
Abstract
Adolescence is a critical maturation period for human cognitive control and executive function. In this study, a large sample of adolescents (n = 85) performed a reversal learning task during functional magnetic resonance imaging. We analyzed behavioral data using a reinforcement learning model to provide individually fitted parameters and imaging data with regard to reward prediction errors (PE). Following a model-based approach, we formed two groups depending on whether individuals tended to update expectations predominantly for the chosen stimulus or also for the unchosen one. These groups significantly differed in their problem behavior score obtained using the child behavior checklist (CBCL) and in a measure of their developmental stage. Imaging results showed that dorsolateral striatal areas covaried with PE. Participants who relied less on learning based on task structure showed less prefrontal activation compared with participants who relied more on task structure. An exploratory analysis revealed that PE-related activity was associated with pubertal development in prefrontal areas, insula and anterior cingulate. These findings support the hypothesis that the prefrontal cortex is implicated in mediating flexible goal-directed behavioral control.
Collapse
Affiliation(s)
- Rebecca Boehme
- Department of Psychiatry and Psychotherapy, Charité - Universitätsmedizin Berlin, Campus Mitte, Berlin, 10117, Germany.,Center for Social and Affective Neuroscience, Linköping University, Linköping, 58245, Sweden
| | - Robert C Lorenz
- Department of Psychiatry and Psychotherapy, Charité - Universitätsmedizin Berlin, Campus Mitte, Berlin, 10117, Germany.,Center for Adaptive Rationality, Max Planck Institute for Human Development, Berlin, Germany
| | - Tobias Gleich
- Department of Psychiatry and Psychotherapy, Charité - Universitätsmedizin Berlin, Campus Mitte, Berlin, 10117, Germany.,NeuroCure Excellence Cluster/Medical Neuroscience, Berlin, Germany
| | - Lydia Romund
- Department of Psychiatry and Psychotherapy, Charité - Universitätsmedizin Berlin, Campus Mitte, Berlin, 10117, Germany
| | - Patricia Pelz
- Department of Psychiatry and Psychotherapy, Charité - Universitätsmedizin Berlin, Campus Mitte, Berlin, 10117, Germany
| | - Sabrina Golde
- Department of Psychiatry and Psychotherapy, Charité - Universitätsmedizin Berlin, Campus Mitte, Berlin, 10117, Germany
| | - Eva Flemming
- Department of Psychiatry and Psychotherapy, Charité - Universitätsmedizin Berlin, Campus Mitte, Berlin, 10117, Germany
| | - Andrew Wold
- Center for Social and Affective Neuroscience, Linköping University, Linköping, 58245, Sweden
| | - Lorenz Deserno
- Department of Psychiatry and Psychotherapy, Charité - Universitätsmedizin Berlin, Campus Mitte, Berlin, 10117, Germany.,Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany.,Department of Neurology, Otto-von-Guericke University, Magdeburg, Germany
| | - Joachim Behr
- Department of Psychiatry and Psychotherapy, Charité - Universitätsmedizin Berlin, Campus Mitte, Berlin, 10117, Germany.,Institute of Neurophysiology, Charité - Universitätsmedizin Berlin, Berlin, Germany.,Department of Psychiatry, Psychotherapy and Psychosomatics, Medical School Brandenburg - Campus Neuruppin, Neuruppin, Germany
| | - Diana Raufelder
- Department of Educational Science and Psychology, Freie Universität, Berlin, Germany
| | - Andreas Heinz
- Department of Psychiatry and Psychotherapy, Charité - Universitätsmedizin Berlin, Campus Mitte, Berlin, 10117, Germany
| | - Anne Beck
- Department of Psychiatry and Psychotherapy, Charité - Universitätsmedizin Berlin, Campus Mitte, Berlin, 10117, Germany
| |
Collapse
|
138
|
|
139
|
Abstract
Many accounts of decision making and reinforcement learning posit the existence of two distinct systems that control choice: a fast, automatic system and a slow, deliberative system. Recent research formalizes this distinction by mapping these systems to “model-free” and “model-based” strategies in reinforcement learning. Model-free strategies are computationally cheap, but sometimes inaccurate, because action values can be accessed by inspecting a look-up table constructed through trial-and-error. In contrast, model-based strategies compute action values through planning in a causal model of the environment, which is more accurate but also more cognitively demanding. It is assumed that this trade-off between accuracy and computational demand plays an important role in the arbitration between the two strategies, but we show that the hallmark task for dissociating model-free and model-based strategies, as well as several related variants, do not embody such a trade-off. We describe five factors that reduce the effectiveness of the model-based strategy on these tasks by reducing its accuracy in estimating reward outcomes and decreasing the importance of its choices. Based on these observations, we describe a version of the task that formally and empirically obtains an accuracy-demand trade-off between model-free and model-based strategies. Moreover, we show that human participants spontaneously increase their reliance on model-based control on this task, compared to the original paradigm. Our novel task and our computational analyses may prove important in subsequent empirical investigations of how humans balance accuracy and demand. When you make a choice about what groceries to get for dinner, you can rely on two different strategies. You can make your choice by relying on habit, simply buying the items you need to make a meal that is second nature to you. However, you can also plan your actions in a more deliberative way, realizing that the friend who will join you is a vegetarian, and therefore you should not make the burgers that have become a staple in your cooking. These two strategies differ in how computationally demanding and accurate they are. While the habitual strategy is less computationally demanding (costs less effort and time), the deliberative strategy is more accurate. Scientists have been able to study the distinction between these strategies using a task that allows them to measure how much people rely on habit and planning strategies. Interestingly, we have discovered that in this task, the deliberative strategy does not increase performance accuracy, and hence does not induce a trade-off between accuracy and demand. We describe why this happens, and improve the task so that it embodies an accuracy-demand trade-off, providing evidence for theories of cost-based arbitration between cognitive strategies.
Collapse
|
140
|
Murty VP, Calabro F, Luna B. The role of experience in adolescent cognitive development: Integration of executive, memory, and mesolimbic systems. Neurosci Biobehav Rev 2016; 70:46-58. [PMID: 27477444 DOI: 10.1016/j.neubiorev.2016.07.034] [Citation(s) in RCA: 75] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2016] [Revised: 07/25/2016] [Accepted: 07/27/2016] [Indexed: 01/14/2023]
Abstract
Adolescence marks a time of unique neurocognitive development, in which executive functions reach adult levels of maturation. While many core facets of executive function may reach maturation in childhood, these processes continue to be refined and stabilized during adolescence. We propose that this is mediated, in part, by interactions between the hippocampus and prefrontal cortex. Specifically, we propose that development of this circuit refines adolescents' ability to extract relevant information from prior experience to support task-relevant behavior. In support of this model, we review evidence for protracted structural and functional development both within and across the hippocampus and prefrontal cortex. We describe emerging research demonstrating the refinement of adolescents' ability to integrate prior experiences to support goal-oriented behavior, which parallel hippocampal-prefrontal integration. Finally, we speculate that the development of this circuit is mediated by increases in dopaminergic neuromodulation present in adolescence, which may underlie memory processing, plasticity, and circuit integration. This model provides a novel characterization of how memory and executive systems integrate throughout adolescence to support adaptive behavior.
Collapse
Affiliation(s)
- Vishnu P Murty
- Psychiatry Departments, University of Pittsburgh, United States.
| | | | - Beatriz Luna
- Psychiatry Departments, University of Pittsburgh, United States; Psychology Departments, University of Pittsburgh, United States
| |
Collapse
|