1
|
Yoo AH, Keglovits H, Collins AGE. Lowered inter-stimulus discriminability hurts incremental contributions to learning. COGNITIVE, AFFECTIVE & BEHAVIORAL NEUROSCIENCE 2023; 23:1346-1364. [PMID: 37656373 PMCID: PMC10545593 DOI: 10.3758/s13415-023-01104-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 04/13/2023] [Indexed: 09/02/2023]
Abstract
How does the similarity between stimuli affect our ability to learn appropriate response associations for them? In typical laboratory experiments learning is investigated under somewhat ideal circumstances, where stimuli are easily discriminable. This is not representative of most real-life learning, where overlapping "stimuli" can result in different "rewards" and may be learned simultaneously (e.g., you may learn over repeated interactions that a specific dog is friendly, but that a very similar looking one isn't). With two experiments, we test how humans learn in three stimulus conditions: one "best case" condition in which stimuli have idealized and highly discriminable visual and semantic representations, and two in which stimuli have overlapping representations, making them less discriminable. We find that, unsurprisingly, decreasing stimuli discriminability decreases performance. We develop computational models to test different hypotheses about how reinforcement learning (RL) and working memory (WM) processes are affected by different stimulus conditions. Our results replicate earlier studies demonstrating the importance of both processes to capture behavior. However, our results extend previous studies by demonstrating that RL, and not WM, is affected by stimulus distinctness: people learn slower and have higher across-stimulus value confusion at decision when stimuli are more similar to each other. These results illustrate strong effects of stimulus type on learning and demonstrate the importance of considering parallel contributions of different cognitive processes when studying behavior.
Collapse
Affiliation(s)
- Aspen H Yoo
- Department of Psychology, University of California, Berkeley, USA
- Helen Wills Neuroscience Institute, University of California, Berkeley, USA
| | - Haley Keglovits
- Department of Cognitive, Linguistic and Psychological Sciences, Brown University, Providence, USA
| | - Anne G E Collins
- Department of Psychology, University of California, Berkeley, USA.
- Helen Wills Neuroscience Institute, University of California, Berkeley, USA.
| |
Collapse
|
2
|
Song M, Baah PA, Cai MB, Niv Y. Humans combine value learning and hypothesis testing strategically in multi-dimensional probabilistic reward learning. PLoS Comput Biol 2022; 18:e1010699. [PMID: 36417419 PMCID: PMC9683628 DOI: 10.1371/journal.pcbi.1010699] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Accepted: 10/31/2022] [Indexed: 11/24/2022] Open
Abstract
Realistic and complex decision tasks often allow for many possible solutions. How do we find the correct one? Introspection suggests a process of trying out solutions one after the other until success. However, such methodical serial testing may be too slow, especially in environments with noisy feedback. Alternatively, the underlying learning process may involve implicit reinforcement learning that learns about many possibilities in parallel. Here we designed a multi-dimensional probabilistic active-learning task tailored to study how people learn to solve such complex problems. Participants configured three-dimensional stimuli by selecting features for each dimension and received probabilistic reward feedback. We manipulated task complexity by changing how many feature dimensions were relevant to maximizing reward, as well as whether this information was provided to the participants. To investigate how participants learn the task, we examined models of serial hypothesis testing, feature-based reinforcement learning, and combinations of the two strategies. Model comparison revealed evidence for hypothesis testing that relies on reinforcement-learning when selecting what hypothesis to test. The extent to which participants engaged in hypothesis testing depended on the instructed task complexity: people tended to serially test hypotheses when instructed that there were fewer relevant dimensions, and relied more on gradual and parallel learning of feature values when the task was more complex. This demonstrates a strategic use of task information to balance the costs and benefits of the two methods of learning.
Collapse
Affiliation(s)
- Mingyu Song
- Princeton Neuroscience Institute, Princeton University, Princeton, New Jersey, United States of America
- * E-mail:
| | - Persis A. Baah
- Department of Psychology, Princeton University, Princeton, New Jersey, United States of America
| | - Ming Bo Cai
- International Research Center for Neurointelligence (WPI-IRCN), The University of Tokyo, Tokyo, Japan
| | - Yael Niv
- Princeton Neuroscience Institute, Princeton University, Princeton, New Jersey, United States of America
- Department of Psychology, Princeton University, Princeton, New Jersey, United States of America
| |
Collapse
|
3
|
Adámek P, Langová V, Horáček J. Early-stage visual perception impairment in schizophrenia, bottom-up and back again. SCHIZOPHRENIA (HEIDELBERG, GERMANY) 2022; 8:27. [PMID: 35314712 PMCID: PMC8938488 DOI: 10.1038/s41537-022-00237-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Accepted: 02/17/2022] [Indexed: 01/01/2023]
Abstract
Visual perception is one of the basic tools for exploring the world. However, in schizophrenia, this modality is disrupted. So far, there has been no clear answer as to whether the disruption occurs primarily within the brain or in the precortical areas of visual perception (the retina, visual pathways, and lateral geniculate nucleus [LGN]). A web-based comprehensive search of peer-reviewed journals was conducted based on various keyword combinations including schizophrenia, saliency, visual cognition, visual pathways, retina, and LGN. Articles were chosen with respect to topic relevance. Searched databases included Google Scholar, PubMed, and Web of Science. This review describes the precortical circuit and the key changes in biochemistry and pathophysiology that affect the creation and characteristics of the retinal signal as well as its subsequent modulation and processing in other parts of this circuit. Changes in the characteristics of the signal and the misinterpretation of visual stimuli associated with them may, as a result, contribute to the development of schizophrenic disease.
Collapse
Affiliation(s)
- Petr Adámek
- Third Faculty of Medicine, Charles University, Prague, Czech Republic. .,Center for Advanced Studies of Brain and Consciousness, National Institute of Mental Health, Klecany, Czech Republic.
| | - Veronika Langová
- Third Faculty of Medicine, Charles University, Prague, Czech Republic.,Center for Advanced Studies of Brain and Consciousness, National Institute of Mental Health, Klecany, Czech Republic
| | - Jiří Horáček
- Third Faculty of Medicine, Charles University, Prague, Czech Republic.,Center for Advanced Studies of Brain and Consciousness, National Institute of Mental Health, Klecany, Czech Republic
| |
Collapse
|
4
|
Farashahi S, Soltani A. Computational mechanisms of distributed value representations and mixed learning strategies. Nat Commun 2021; 12:7191. [PMID: 34893597 PMCID: PMC8664930 DOI: 10.1038/s41467-021-27413-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Accepted: 11/16/2021] [Indexed: 11/25/2022] Open
Abstract
Learning appropriate representations of the reward environment is challenging in the real world where there are many options, each with multiple attributes or features. Despite existence of alternative solutions for this challenge, neural mechanisms underlying emergence and adoption of value representations and learning strategies remain unknown. To address this, we measure learning and choice during a multi-dimensional probabilistic learning task in humans and trained recurrent neural networks (RNNs) to capture our experimental observations. We find that human participants estimate stimulus-outcome associations by learning and combining estimates of reward probabilities associated with the informative feature followed by those of informative conjunctions. Through analyzing representations, connectivity, and lesioning of the RNNs, we demonstrate this mixed learning strategy relies on a distributed neural code and opponency between excitatory and inhibitory neurons through value-dependent disinhibition. Together, our results suggest computational and neural mechanisms underlying emergence of complex learning strategies in naturalistic settings.
Collapse
Affiliation(s)
- Shiva Farashahi
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA.
- Center for Computational Neuroscience, Flatiron Institute, Simons Foundation, New York, NY, USA.
| | - Alireza Soltani
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA.
| |
Collapse
|
5
|
Farashahi S, Xu J, Wu SW, Soltani A. Learning arbitrary stimulus-reward associations for naturalistic stimuli involves transition from learning about features to learning about objects. Cognition 2020; 205:104425. [PMID: 32958287 DOI: 10.1016/j.cognition.2020.104425] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2019] [Revised: 07/29/2020] [Accepted: 07/31/2020] [Indexed: 10/23/2022]
Abstract
Most cognitive processes are studied using abstract or synthetic stimuli with specific features to fully control what is presented to subjects. However, recent studies have revealed enhancements of cognitive capacities (such as working memory) when processing naturalistic versus abstract stimuli. Using abstract stimuli constructed from distinct visual features (e.g., color and shape), we have recently shown that human subjects can learn multidimensional stimulus-reward associations via initially estimating reward value of individual features (feature-based learning) before gradually switching to learning about reward value of individual stimuli (object-based learning). Here, we examined whether similar strategies are adopted during learning about naturalistic stimuli that are clearly perceived as objects (instead of a combination of features) and contain both task-relevant and irrelevant features. We found that similar to learning about abstract stimuli, subjects initially adopted feature-based learning more strongly before transitioning to object-based learning. However, there were three key differences between learning about naturalistic and abstract stimuli. First, compared with abstract stimuli, the initial learning strategy was less feature-based for naturalistic stimuli. Second, subjects transitioned to object-based learning faster for naturalistic stimuli. Third, unexpectedly, subjects were more likely to adopt feature-based learning for naturalistic stimuli, both at the steady state and overall. These results suggest that despite the stronger tendency to perceive naturalistic stimuli as objects, which leads to greater likelihood of using object-based learning as the initial strategy and a faster transition to object-based learning, the influence of individual features on learning is stronger for these stimuli such that ultimately the object-based strategy is adopted less. Overall, our findings suggest that feature-based learning is a general initial strategy for learning about reward value of all types of multi-dimensional stimuli.
Collapse
Affiliation(s)
- Shiva Farashahi
- Department of Psychological and Brain Sciences, Dartmouth College, NH 03755, United States of America; Flatiron Institute, Simons Foundation, New York, NY 10010, United States of America
| | - Jane Xu
- Department of Psychological and Brain Sciences, Dartmouth College, NH 03755, United States of America
| | - Shih-Wei Wu
- Institute of Neuroscience, National Yang-Ming University, Taipei, Taiwan; Brain Research Center, National Yang-Ming University, Taipei, Taiwan
| | - Alireza Soltani
- Department of Psychological and Brain Sciences, Dartmouth College, NH 03755, United States of America.
| |
Collapse
|
6
|
Radulescu A, Niv Y, Ballard I. Holistic Reinforcement Learning: The Role of Structure and Attention. Trends Cogn Sci 2019; 23:278-292. [PMID: 30824227 PMCID: PMC6472955 DOI: 10.1016/j.tics.2019.01.010] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Revised: 01/20/2019] [Accepted: 01/24/2019] [Indexed: 10/27/2022]
Abstract
Compact representations of the environment allow humans to behave efficiently in a complex world. Reinforcement learning models capture many behavioral and neural effects but do not explain recent findings showing that structure in the environment influences learning. In parallel, Bayesian cognitive models predict how humans learn structured knowledge but do not have a clear neurobiological implementation. We propose an integration of these two model classes in which structured knowledge learned via approximate Bayesian inference acts as a source of selective attention. In turn, selective attention biases reinforcement learning towards relevant dimensions of the environment. An understanding of structure learning will help to resolve the fundamental challenge in decision science: explaining why people make the decisions they do.
Collapse
Affiliation(s)
- Angela Radulescu
- Psychology Department, Princeton University, Princeton, NJ, USA; Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Yael Niv
- Psychology Department, Princeton University, Princeton, NJ, USA; Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Ian Ballard
- Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA, USA.
| |
Collapse
|
7
|
Marković D, Reiter AMF, Kiebel SJ. Predicting change: Approximate inference under explicit representation of temporal structure in changing environments. PLoS Comput Biol 2019; 15:e1006707. [PMID: 30703108 PMCID: PMC6372216 DOI: 10.1371/journal.pcbi.1006707] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2018] [Revised: 02/12/2019] [Accepted: 12/11/2018] [Indexed: 11/18/2022] Open
Abstract
In our daily lives timing of our actions plays an essential role when we navigate the complex everyday environment. It is an open question though how the representations of the temporal structure of the world influence our behavior. Here we propose a probabilistic model with an explicit representation of state durations which may provide novel insights in how the brain predicts upcoming changes. We illustrate several properties of the behavioral model using a standard reversal learning design and compare its task performance to standard reinforcement learning models. Furthermore, using experimental data, we demonstrate how the model can be applied to identify participants' beliefs about the latent temporal task structure. We found that roughly one quarter of participants seem to have learned the latent temporal structure and used it to anticipate changes, whereas the remaining participants' behavior did not show signs of anticipatory responses, suggesting a lack of precise temporal expectations. We expect that the introduced behavioral model will allow, in future studies, for a systematic investigation of how participants learn the underlying temporal structure of task environments and how these representations shape behavior.
Collapse
Affiliation(s)
- Dimitrije Marković
- Department of Psychology, Technische Universität Dresden, Dresden, Germany
| | | | - Stefan J. Kiebel
- Department of Psychology, Technische Universität Dresden, Dresden, Germany
| |
Collapse
|
8
|
Katthagen T, Mathys C, Deserno L, Walter H, Kathmann N, Heinz A, Schlagenhauf F. Modeling subjective relevance in schizophrenia and its relation to aberrant salience. PLoS Comput Biol 2018; 14:e1006319. [PMID: 30096179 PMCID: PMC6105009 DOI: 10.1371/journal.pcbi.1006319] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2017] [Revised: 08/22/2018] [Accepted: 06/20/2018] [Indexed: 01/09/2023] Open
Abstract
In schizophrenia, increased aberrant salience to irrelevant events and reduced learning of relevant information may relate to an underlying deficit in relevance detection. So far, subjective estimates of relevance have not been probed in schizophrenia patients. The mechanisms underlying belief formation about relevance and their translation into decisions are unclear. Using novel computational methods, we investigated relevance detection during implicit learning in 42 schizophrenia patients and 42 healthy individuals. Participants underwent functional magnetic resonance imaging while detecting the outcomes in a learning task. These were preceded by cues differing in color and shape, which were either relevant or irrelevant for outcome prediction. We provided a novel definition of relevance based on Bayesian precision and modeled reaction times as a function of relevance weighted unsigned prediction errors (UPE). For aberrant salience, we assessed responses to subjectively irrelevant cue manifestations. Participants learned the contingencies and slowed down their responses following unexpected events. Model selection revealed that individuals inferred the relevance of cue features and used it for behavioral adaption to the relevant cue feature. Relevance weighted UPEs correlated with dorsal anterior cingulate cortex activation and hippocampus deactivation. In patients, the aberrant salience bias to subjectively task-irrelevant information was increased and correlated with decreased striatal UPE activation and increased negative symptoms. This study shows that relevance estimates based on Bayesian precision can be inferred from observed behavior. This underscores the importance of relevance detection as an underlying mechanism for behavioral adaptation in complex environments and enhances the understanding of aberrant salience in schizophrenia. Schizophrenia patients display deficits in the appropriate attribution of meaningfulness to stimuli; such as aberrantly increased processing of irrelevant and insufficient processing of relevant information. We aimed to investigate the subjective nature of relevance detection and its deficit in schizophrenia and developed an implicit learning paradigm that allowed for parallel learning from relevant and irrelevant information. Based on the idea that subjective relevance might be captured by Bayesian precision we set up different computational models of how subjective relevance guides learning and behavioral adaptation. We found that subjects use Bayesian precision to estimate stimulus relevance in order to integrate multidimensional information and adapt more to the subjectively relevant stimuli. This relevance weighted adaptation correlated with brain activation within the salience network. Further, schizophrenia patients displayed an increased aberrant tendency to irrelevant events which related to decreased striatal coding of the relevant learning signal. To conclude, our findings demonstrate how individual beliefs about relevance can be inferred from computational models. Furthermore, we suggest that aberrant salience observed in patients with schizophrenia reflects an idiosyncratic bias in states of high subjective uncertainty.
Collapse
Affiliation(s)
- Teresa Katthagen
- Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Department of Psychiatry and Psychotherapy CCM, Berlin, Germany
- Berlin School of Mind and Brain, Humboldt-Universität zu Berlin, Berlin, Germany
- * E-mail:
| | - Christoph Mathys
- Scuola Internazionale Superiore di Studi Avanzati (SISSA), Trieste, Italy
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, United Kingdom
- Wellcome Trust Centre for Neuroimaging at UCL, London, United Kingdom
| | - Lorenz Deserno
- Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Department of Psychiatry and Psychotherapy CCM, Berlin, Germany
- Department of Child and Adolescent Psychiatry, Psychotherapy and Psychosomatics, University of Leipzig, Leipzig, Germany
- Max-Planck-Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
| | - Henrik Walter
- Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Department of Psychiatry and Psychotherapy CCM, Berlin, Germany
| | - Norbert Kathmann
- Department of Psychology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Andreas Heinz
- Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Department of Psychiatry and Psychotherapy CCM, Berlin, Germany
| | - Florian Schlagenhauf
- Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Department of Psychiatry and Psychotherapy CCM, Berlin, Germany
- Max-Planck-Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
| |
Collapse
|
9
|
Feature-based learning improves adaptability without compromising precision. Nat Commun 2017; 8:1768. [PMID: 29170381 PMCID: PMC5700946 DOI: 10.1038/s41467-017-01874-w] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2017] [Accepted: 10/18/2017] [Indexed: 12/14/2022] Open
Abstract
Learning from reward feedback is essential for survival but can become extremely challenging with myriad choice options. Here, we propose that learning reward values of individual features can provide a heuristic for estimating reward values of choice options in dynamic, multi-dimensional environments. We hypothesize that this feature-based learning occurs not just because it can reduce dimensionality, but more importantly because it can increase adaptability without compromising precision of learning. We experimentally test this hypothesis and find that in dynamic environments, human subjects adopt feature-based learning even when this approach does not reduce dimensionality. Even in static, low-dimensional environments, subjects initially adopt feature-based learning and gradually switch to learning reward values of individual options, depending on how accurately objects’ values can be predicted by combining feature values. Our computational models reproduce these results and highlight the importance of neurons coding feature values for parallel learning of values for features and objects. Learning about a rewarded outcome is complicated by the fact that a choice often incorporates multiple features with differing association with the reward. Here the authors demonstrate that feature-based learning is an efficient and adaptive strategy in dynamically changing environments.
Collapse
|
10
|
Dynamic Interaction between Reinforcement Learning and Attention in Multidimensional Environments. Neuron 2017; 93:451-463. [PMID: 28103483 DOI: 10.1016/j.neuron.2016.12.040] [Citation(s) in RCA: 157] [Impact Index Per Article: 22.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2016] [Revised: 11/03/2016] [Accepted: 12/28/2016] [Indexed: 11/24/2022]
Abstract
Little is known about the relationship between attention and learning during decision making. Using eye tracking and multivariate pattern analysis of fMRI data, we measured participants' dimensional attention as they performed a trial-and-error learning task in which only one of three stimulus dimensions was relevant for reward at any given time. Analysis of participants' choices revealed that attention biased both value computation during choice and value update during learning. Value signals in the ventromedial prefrontal cortex and prediction errors in the striatum were similarly biased by attention. In turn, participants' focus of attention was dynamically modulated by ongoing learning. Attentional switches across dimensions correlated with activity in a frontoparietal attention network, which showed enhanced connectivity with the ventromedial prefrontal cortex between switches. Our results suggest a bidirectional interaction between attention and learning: attention constrains learning to relevant dimensions of the environment, while we learn what to attend to via trial and error.
Collapse
|
11
|
Barahona-Corrêa JB, Camacho M, Castro-Rodrigues P, Costa R, Oliveira-Maia AJ. From Thought to Action: How the Interplay Between Neuroscience and Phenomenology Changed Our Understanding of Obsessive-Compulsive Disorder. Front Psychol 2015; 6:1798. [PMID: 26635696 PMCID: PMC4655583 DOI: 10.3389/fpsyg.2015.01798] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2015] [Accepted: 11/07/2015] [Indexed: 01/25/2023] Open
Abstract
The understanding of obsessive-compulsive disorder (OCD) has evolved with the knowledge of behavior, the brain, and their relationship. Modern views of OCD as a neuropsychiatric disorder originated from early lesion studies, with more recent models incorporating detailed neuropsychological findings, such as perseveration in set-shifting tasks, and findings of altered brain structure and function, namely of orbitofrontal corticostriatal circuits and their limbic connections. Interestingly, as neurobiological models of OCD evolved from cortical and cognitive to sub-cortical and behavioral, the focus of OCD phenomenology also moved from thought control and contents to new concepts rooted in animal models of action control. Most recently, the proposed analogy between habitual action control and compulsive behavior has led to the hypothesis that individuals suffering from OCD may be predisposed to rely excessively on habitual rather than on goal-directed behavioral strategies. Alternatively, compulsions have been proposed to result either from hyper-valuation of certain actions and/or their outcomes, or from excessive uncertainty in the monitoring of action performance, both leading to perseveration in prepotent actions such as washing or checking. In short, the last decades have witnessed a formidable renovation in the pathophysiology, phenomenology, and even semantics, of OCD. Nevertheless, such progress is challenged by several caveats, not least psychopathological oversimplification and overgeneralization of animal to human extrapolations. Here we present an historical overview of the understanding of OCD, highlighting converging studies and trends in neuroscience, psychiatry and neuropsychology, and how they influenced current perspectives on the nosology and phenomenology of this disorder.
Collapse
Affiliation(s)
- J Bernardo Barahona-Corrêa
- Department of Psychiatry and Mental Health, Faculdade de Ciências Médicas, Nova Medical School , Lisbon, Portugal ; Department of Psychiatry and Mental Health, Centro Hospitalar de Lisboa Ocidental , Lisbon, Portugal ; Champalimaud Clinical Centre, Champalimaud Centre for the Unknown , Lisbon, Portugal ; Centro de Apoio ao Desenvolvimento Infantil , Cascais, Portugal
| | - Marta Camacho
- Champalimaud Clinical Centre, Champalimaud Centre for the Unknown , Lisbon, Portugal
| | - Pedro Castro-Rodrigues
- Champalimaud Clinical Centre, Champalimaud Centre for the Unknown , Lisbon, Portugal ; Centro Hospitalar Psiquiátrico de Lisboa , Lisbon, Portugal
| | - Rui Costa
- Champalimaud Research, Champalimaud Centre for the Unknown , Lisbon, Portugal
| | - Albino J Oliveira-Maia
- Department of Psychiatry and Mental Health, Centro Hospitalar de Lisboa Ocidental , Lisbon, Portugal ; Champalimaud Clinical Centre, Champalimaud Centre for the Unknown , Lisbon, Portugal ; Champalimaud Research, Champalimaud Centre for the Unknown , Lisbon, Portugal
| |
Collapse
|
12
|
O’Doherty JP, Lee SW, McNamee D. The structure of reinforcement-learning mechanisms in the human brain. Curr Opin Behav Sci 2015. [DOI: 10.1016/j.cobeha.2014.10.004] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
13
|
Haber SN, Behrens TEJ. The neural network underlying incentive-based learning: implications for interpreting circuit disruptions in psychiatric disorders. Neuron 2014; 83:1019-39. [PMID: 25189208 PMCID: PMC4255982 DOI: 10.1016/j.neuron.2014.08.031] [Citation(s) in RCA: 143] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/18/2014] [Indexed: 02/03/2023]
Abstract
Coupling stimuli and actions with positive or negative outcomes facilitates the selection of appropriate actions. Several brain regions are involved in the development of goal-directed behaviors and habit formation during incentive-based learning. This Review focuses on higher cognitive control of decision making and the cortical and subcortical structures and connections that attribute value to stimuli, associate that value with choices, and select an action plan. Delineating the connectivity between these areas is fundamental for understanding how brain regions work together to evaluate stimuli, develop actions plans, and modify behavior, as well as for elucidating the pathophysiology of psychiatric diseases.
Collapse
Affiliation(s)
- Suzanne N Haber
- Department of Pharmacology and Physiology, University of Rochester School of Medicine, Rochester, NY 14642, USA.
| | - Timothy E J Behrens
- FMRIB Centre, University of Oxford, Oxford, OX3 9DU, UK; Wellcome Trust Centre for Neuroimaging, University College London, London, WC1N 3BG, UK
| |
Collapse
|
14
|
Markou A, Salamone JD, Bussey TJ, Mar AC, Brunner D, Gilmour G, Balsam P. Measuring reinforcement learning and motivation constructs in experimental animals: relevance to the negative symptoms of schizophrenia. Neurosci Biobehav Rev 2013; 37:2149-65. [PMID: 23994273 PMCID: PMC3849135 DOI: 10.1016/j.neubiorev.2013.08.007] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2013] [Revised: 08/12/2013] [Accepted: 08/16/2013] [Indexed: 10/26/2022]
Abstract
The present review article summarizes and expands upon the discussions that were initiated during a meeting of the Cognitive Neuroscience Treatment Research to Improve Cognition in Schizophrenia (CNTRICS; http://cntrics.ucdavis.edu) meeting. A major goal of the CNTRICS meeting was to identify experimental procedures and measures that can be used in laboratory animals to assess psychological constructs that are related to the psychopathology of schizophrenia. The issues discussed in this review reflect the deliberations of the Motivation Working Group of the CNTRICS meeting, which included most of the authors of this article as well as additional participants. After receiving task nominations from the general research community, this working group was asked to identify experimental procedures in laboratory animals that can assess aspects of reinforcement learning and motivation that may be relevant for research on the negative symptoms of schizophrenia, as well as other disorders characterized by deficits in reinforcement learning and motivation. The tasks described here that assess reinforcement learning are the Autoshaping Task, Probabilistic Reward Learning Tasks, and the Response Bias Probabilistic Reward Task. The tasks described here that assess motivation are Outcome Devaluation and Contingency Degradation Tasks and Effort-Based Tasks. In addition to describing such methods and procedures, the present article provides a working vocabulary for research and theory in this field, as well as an industry perspective about how such tasks may be used in drug discovery. It is hoped that this review can aid investigators who are conducting research in this complex area, promote translational studies by highlighting shared research goals and fostering a common vocabulary across basic and clinical fields, and facilitate the development of medications for the treatment of symptoms mediated by reinforcement learning and motivational deficits.
Collapse
Affiliation(s)
- Athina Markou
- Department of Psychiatry, School of Medicine, University of California San Diego, 9500 Gilman Drive, M/C0603, La Jolla, CA 92093-0603, USA.
| | | | | | | | | | | | | |
Collapse
|
15
|
Sánchez-Lara K, Arrieta O, Pasaye E, Laviano A, Mercadillo RE, Sosa-Sánchez R, Méndez-Sánchez N. Brain activity correlated with food preferences: A functional study comparing advanced non-small cell lung cancer patients with and without anorexia. Nutrition 2013; 29:1013-9. [DOI: 10.1016/j.nut.2013.01.020] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2012] [Revised: 01/16/2013] [Accepted: 01/30/2013] [Indexed: 11/30/2022]
|
16
|
Abstract
Rewards in real life are rarely received without incurring costs and successful reward harvesting often involves weighing and minimizing different types of costs. In the natural environment, such costs often include the physical effort required to obtain rewards and potential risks attached to them. Costs may also include potential risks. In this study, we applied fMRI to explore the neural coding of physical effort costs as opposed to costs associated with risky rewards. Using an incentive-compatible valuation mechanism, we separately measured the subjective costs associated with effortful and risky options. As expected, subjective costs of options increased with both increasing effort and increasing risk. Despite the similar nature of behavioral discounting of effort and risk, distinct regions of the brain coded these two cost types separately, with anterior insula primarily processing risk costs and midcingulate and supplementary motor area (SMA) processing effort costs. To investigate integration of the two cost types, we also presented participants with options that combined effortful and risky elements. We found that the frontal pole integrates effort and risk costs through functional coupling with the SMA and insula. The degree to which the latter two regions influenced frontal pole activity correlated with participant-specific behavioral sensitivity to effort and risk costs. These data support the notion that, although physical effort costs may appear to be behaviorally similar to other types of costs, such as risk, they are treated separately at the neural level and are integrated only if there is a need to do so.
Collapse
|
17
|
Insights from the application of computational neuroimaging to social neuroscience. Curr Opin Neurobiol 2013; 23:387-92. [PMID: 23518140 DOI: 10.1016/j.conb.2013.02.007] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2012] [Revised: 01/17/2013] [Accepted: 02/15/2013] [Indexed: 11/22/2022]
Abstract
A recent approach in social neuroscience has been the application of formal computational models for a particular social-cognitive process to neuroimaging data. Here we review preliminary findings from this nascent subfield, focusing on observational learning and strategic interactions. We present evidence consistent with the existence of three distinct learning systems that may contribute to social cognition: an observational-reward-learning system involved in updating expectations of future reward based on observing rewards obtained by others, an action-observational learning system involved in learning about the action tendencies of others, and a third system engaged when it is necessary to learn about the hidden mental-states or traits of another. These three systems appear to map onto distinct neuroanatomical substrates, and depend on unique computational signals.
Collapse
|
18
|
Gluth S, Rieskamp J, Büchel C. Neural evidence for adaptive strategy selection in value-based decision-making. Cereb Cortex 2013; 24:2009-21. [PMID: 23476024 DOI: 10.1093/cercor/bht049] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In everyday life, humans often encounter complex environments in which multiple sources of information can influence their decisions. We propose that in such situations, people select and apply different strategies representing different cognitive models of the decision problem. Learning advances by evaluating the success of using a strategy and eventually by switching between strategies. To test our strategy selection model, we investigated how humans solve a dynamic learning task with complex auditory and visual information, and assessed the underlying neural mechanisms with functional magnetic resonance imaging. Using the model, we were able to capture participants' choices and to successfully attribute expected values and reward prediction errors to activations in the dopaminoceptive system (e.g., ventral striatum [VS]) as well as decision conflict to signals in the anterior cingulate cortex. The model outperformed an alternative approach that did not update decision strategies, but the relevance of information itself. Activation of sensory areas depended on whether the selected strategy made use of the respective source of information. Selection of a strategy also determined how value-related information influenced effective connectivity between sensory systems and the VS. Our results suggest that humans can structure their search for and use of relevant information by adaptively selecting between decision strategies.
Collapse
Affiliation(s)
- Sebastian Gluth
- Department of Systems Neuroscience, University Medical Center Hamburg-Eppendorf, Hamburg D-20246, Germany and
| | - Jörg Rieskamp
- Department of Psychology, University of Basel, Basel CH-4055, Switzerland
| | - Christian Büchel
- Department of Systems Neuroscience, University Medical Center Hamburg-Eppendorf, Hamburg D-20246, Germany and
| |
Collapse
|
19
|
Simpson EH, Waltz JA, Kellendonk C, Balsam PD. Schizophrenia in translation: dissecting motivation in schizophrenia and rodents. Schizophr Bull 2012; 38:1111-7. [PMID: 23015686 PMCID: PMC3494038 DOI: 10.1093/schbul/sbs114] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/13/2012] [Accepted: 08/22/2012] [Indexed: 11/14/2022]
Abstract
The negative symptoms of schizophrenia include deficits in motivation, for which there is currently no treatment available. Animal models provide a powerful tool for identifying the potential pathophysiological mechanisms underlying the motivation deficits of schizophrenia with the aim of discovering novel treatment targets. The success of such an approach critically depends on meticulously detailed analysis of motivational phenotypes in patients and in animal models. Here, we review the results of recent human behavioral and imaging studies of motivation, and we relate those findings to the results from animal studies, including a mouse model of striatal dopamine D2 receptor hyperfunction. The motivational deficit in patients with schizophrenia is not due to an inability to experience pleasure in the moment as hedonic reaction appears intact in patients. Instead, the motivation deficit represents a reduced capacity for anticipating future pleasure resulting from goal-directed action. The diminished anticipation appears to be a consequence of an inability to accurately represent the expected reward values of actions. A strikingly similar phenotype in incentive motivation has also been described in mice with striatal dopamine D2 receptor hyperfunction. These convergent findings identify potential pathophysiological mechanisms that underlie the deficit in anticipatory motivation, and importantly, the mouse model provides a tool for investigating novel treatment strategies, which we discuss here.
Collapse
|
20
|
|
21
|
Wilson RC, Niv Y. Inferring relevance in a changing world. Front Hum Neurosci 2012; 5:189. [PMID: 22291631 PMCID: PMC3264902 DOI: 10.3389/fnhum.2011.00189] [Citation(s) in RCA: 82] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2010] [Accepted: 12/29/2011] [Indexed: 11/15/2022] Open
Abstract
Reinforcement learning models of human and animal learning usually concentrate on how we learn the relationship between different stimuli or actions and rewards. However, in real-world situations “stimuli” are ill-defined. On the one hand, our immediate environment is extremely multidimensional. On the other hand, in every decision making scenario only a few aspects of the environment are relevant for obtaining reward, while most are irrelevant. Thus a key question is how do we learn these relevant dimensions, that is, how do we learn what to learn about? We investigated this process of “representation learning” experimentally, using a task in which one stimulus dimension was relevant for determining reward at each point in time. As in real life situations, in our task the relevant dimension can change without warning, adding ever-present uncertainty engendered by a constantly changing environment. We show that human performance on this task is better described by a suboptimal strategy based on selective attention and serial-hypothesis-testing rather than a normative strategy based on probabilistic inference. From this, we conjecture that the problem of inferring relevance in general scenarios is too computationally demanding for the brain to solve optimally. As a result the brain utilizes approximations, employing these even in simplified scenarios in which optimal representation learning is tractable, such as the one in our experiment.
Collapse
Affiliation(s)
- Robert C Wilson
- Department of Psychology, Neuroscience Institute, Princeton University Princeton, NJ, USA
| | | |
Collapse
|