Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Lloyd K, Leslie DS. Context-dependent decision-making: a simple Bayesian model. J R Soc Interface 2013;10:20130069. [PMID: 23427101 PMCID: PMC3627089 DOI: 10.1098/rsif.2013.0069] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2013] [Accepted: 01/29/2013] [Indexed: 11/12/2022] Open

For:	Lloyd K, Leslie DS. Context-dependent decision-making: a simple Bayesian model. J R Soc Interface 2013;10:20130069. [PMID: 23427101 PMCID: PMC3627089 DOI: 10.1098/rsif.2013.0069] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2013] [Accepted: 01/29/2013] [Indexed: 11/12/2022] Open

Number

Cited by Other Article(s)

Kang P, Tobler PN, Dayan P. Bayesian reinforcement learning: A basic overview. Neurobiol Learn Mem 2024;211:107924. [PMID: 38579896 DOI: 10.1016/j.nlm.2024.107924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 03/21/2024] [Accepted: 04/02/2024] [Indexed: 04/07/2024]

Sandhu TR, Xiao B, Lawson RP. Transdiagnostic computations of uncertainty: towards a new lens on intolerance of uncertainty. Neurosci Biobehav Rev 2023;148:105123. [PMID: 36914079 DOI: 10.1016/j.neubiorev.2023.105123] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Revised: 02/21/2023] [Accepted: 03/08/2023] [Indexed: 03/13/2023]

Fields C, Friston K, Glazebrook JF, Levin M. A free energy principle for generic quantum systems. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2022;173:36-59. [PMID: 35618044 DOI: 10.1016/j.pbiomolbio.2022.05.006] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2022] [Revised: 05/04/2022] [Accepted: 05/18/2022] [Indexed: 01/17/2023]

Yu LQ, Wilson RC, Nassar MR. Adaptive learning is structure learning in time. Neurosci Biobehav Rev 2021;128:270-281. [PMID: 34144114 PMCID: PMC8422504 DOI: 10.1016/j.neubiorev.2021.06.024] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 04/19/2021] [Accepted: 06/11/2021] [Indexed: 10/21/2022]

Sajid N, Ball PJ, Parr T, Friston KJ. Active Inference: Demystified and Compared. Neural Comput 2021;33:674-712. [PMID: 33400903 DOI: 10.1162/neco_a_01357] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]

Abstract

Active inference is a first principle account of how autonomous agents operate in dynamic, nonstationary environments. This problem is also considered in reinforcement learning, but limited work exists on comparing the two approaches on the same discrete-state environments. In this letter, we provide (1) an accessible overview of the discrete-state formulation of active inference, highlighting natural behaviors in active inference that are generally engineered in reinforcement learning, and (2) an explicit discrete-state comparison between active inference and reinforcement learning on an OpenAI gym baseline. We begin by providing a condensed overview of the active inference literature, in particular viewing the various natural behaviors of active inference agents through the lens of reinforcement learning. We show that by operating in a pure belief-based setting, active inference agents can carry out epistemic exploration-and account for uncertainty about their environment-in a Bayes-optimal fashion. Furthermore, we show that the reliance on an explicit reward signal in reinforcement learning is removed in active inference, where reward can simply be treated as another observation we have a preference over; even in the total absence of rewards, agent behaviors are learned through preference learning. We make these properties explicit by showing two scenarios in which active inference agents can infer behaviors in reward-free environments compared to both Q-learning and Bayesian model-based reinforcement learning agents and by placing zero prior preferences over rewards and learning the prior preferences over the observations corresponding to reward. We conclude by noting that this formalism can be applied to more complex settings (e.g., robotic arm movement, Atari games) if appropriate generative models can be formulated. In short, we aim to demystify the behavior of active inference agents by presenting an accessible discrete state-space and time formulation and demonstrate these behaviors in a OpenAI gym environment, alongside reinforcement learning agents.

Collapse

Hämäläinen L, Thorogood R. The signal detection problem of aposematic prey revisited: integrating prior social and personal experience. Philos Trans R Soc Lond B Biol Sci 2020;375:20190473. [PMID: 32420858 PMCID: PMC7331014 DOI: 10.1098/rstb.2019.0473] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/28/2020] [Indexed: 11/12/2022] Open

Schulz E, Franklin NT, Gershman SJ. Finding structure in multi-armed bandits. Cogn Psychol 2020;119:101261. [PMID: 32059133 DOI: 10.1016/j.cogpsych.2019.101261] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2018] [Revised: 11/10/2019] [Accepted: 12/02/2019] [Indexed: 12/24/2022]

Li Q, Tu Y, Chen J, Shan J, Yung PSH, Ling SKK, Hua Y. Reverse anterolateral drawer test is more sensitive and accurate for diagnosing chronic anterior talofibular ligament injury. Knee Surg Sports Traumatol Arthrosc 2020;28:55-62. [PMID: 31559464 DOI: 10.1007/s00167-019-05705-x] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/24/2018] [Accepted: 09/11/2019] [Indexed: 12/26/2022]

Parr T, Friston KJ. Generalised free energy and active inference. BIOLOGICAL CYBERNETICS 2019;113:495-513. [PMID: 31562544 PMCID: PMC6848054 DOI: 10.1007/s00422-019-00805-w] [Citation(s) in RCA: 70] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2017] [Accepted: 09/13/2019] [Indexed: 05/30/2023]

Abstract

Active inference is an approach to understanding behaviour that rests upon the idea that the brain uses an internal generative model to predict incoming sensory data. The fit between this model and data may be improved in two ways. The brain could optimise probabilistic beliefs about the variables in the generative model (i.e. perceptual inference). Alternatively, by acting on the world, it could change the sensory data, such that they are more consistent with the model. This implies a common objective function (variational free energy) for action and perception that scores the fit between an internal model and the world. We compare two free energy functionals for active inference in the framework of Markov decision processes. One of these is a functional of beliefs (i.e. probability distributions) about states and policies, but a function of observations, while the second is a functional of beliefs about all three. In the former (expected free energy), prior beliefs about outcomes are not part of the generative model (because they are absorbed into the prior over policies). Conversely, in the second (generalised free energy), priors over outcomes become an explicit component of the generative model. When using the free energy function, which is blind to future observations, we equip the generative model with a prior over policies that ensure preferred (i.e. priors over) outcomes are realised. In other words, if we expect to encounter a particular kind of outcome, this lends plausibility to those policies for which this outcome is a consequence. In addition, this formulation ensures that selected policies minimise uncertainty about future outcomes by minimising the free energy expected in the future. When using the free energy functional-that effectively treats future observations as hidden states-we show that policies are inferred or selected that realise prior preferences by minimising the free energy of future expectations. Interestingly, the form of posterior beliefs about policies (and associated belief updating) turns out to be identical under both formulations, but the quantities used to compute them are not.

Collapse

Stamps JA, Krishnan V. Age-dependent changes in behavioural plasticity: insights from Bayesian models of development. Anim Behav 2017. [DOI: 10.1016/j.anbehav.2017.01.013] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Qian T, Jaeger TF, Aslin RN. Incremental implicit learning of bundles of statistical patterns. Cognition 2016;157:156-173. [PMID: 27639552 PMCID: PMC5181648 DOI: 10.1016/j.cognition.2016.09.002] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2014] [Revised: 09/02/2016] [Accepted: 09/08/2016] [Indexed: 11/26/2022]

Li Y, Nakae K, Ishii S, Naoki H. Uncertainty-Dependent Extinction of Fear Memory in an Amygdala-mPFC Neural Circuit Model. PLoS Comput Biol 2016;12:e1005099. [PMID: 27617747 PMCID: PMC5019407 DOI: 10.1371/journal.pcbi.1005099] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2016] [Accepted: 08/11/2016] [Indexed: 11/29/2022] Open

Abstract

Uncertainty of fear conditioning is crucial for the acquisition and extinction of fear memory. Fear memory acquired through partial pairings of a conditioned stimulus (CS) and an unconditioned stimulus (US) is more resistant to extinction than that acquired through full pairings; this effect is known as the partial reinforcement extinction effect (PREE). Although the PREE has been explained by psychological theories, the neural mechanisms underlying the PREE remain largely unclear. Here, we developed a neural circuit model based on three distinct types of neurons (fear, persistent and extinction neurons) in the amygdala and medial prefrontal cortex (mPFC). In the model, the fear, persistent and extinction neurons encode predictions of net severity, of unconditioned stimulus (US) intensity, and of net safety, respectively. Our simulation successfully reproduces the PREE. We revealed that unpredictability of the US during extinction was represented by the combined responses of the three types of neurons, which are critical for the PREE. In addition, we extended the model to include amygdala subregions and the mPFC to address a recent finding that the ventral mPFC (vmPFC) is required for consolidating extinction memory but not for memory retrieval. Furthermore, model simulations led us to propose a novel procedure to enhance extinction learning through re-conditioning with a stronger US; strengthened fear memory up-regulates the extinction neuron, which, in turn, further inhibits the fear neuron during re-extinction. Thus, our models increased the understanding of the functional roles of the amygdala and vmPFC in the processing of uncertainty in fear conditioning and extinction.

Animals live in environments that contain uncertainty. To adapt to uncertain situations, they flexibly learn to associate environmental cues with rewards and punishments. Understanding how the brain processes uncertainty has remained an important issue in neuroscience. To address this question, we focused on neural processing in the amygdala and mPFC during fear conditioning and extinction. We developed a neural circuit model that incorporates distinct neural populations in the amygdala and mPFC. Our model first successfully reproduced uncertainty-dependent resistance to the extinction of fear memory. An extension of the model provided a possible explanation for observations made during optogenetic manipulation of the ventral mPFC. Finally, we proposed a procedure to accelerate the efficacy of subsequent extinction based on our model.

Collapse

Context-dependent learning and causal structure. Psychon Bull Rev 2016;24:557-565. [DOI: 10.3758/s13423-016-1110-x] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Stamps JA, Frankenhuis WE. Bayesian Models of Development. Trends Ecol Evol 2016;31:260-268. [PMID: 26896042 DOI: 10.1016/j.tree.2016.01.012] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2015] [Revised: 01/19/2016] [Accepted: 01/20/2016] [Indexed: 10/22/2022]

Iigaya K. Adaptive learning and decision-making under uncertainty by metaplastic synapses guided by a surprise detection system. eLife 2016;5:e18073. [PMID: 27504806 PMCID: PMC5008908 DOI: 10.7554/elife.18073] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2016] [Accepted: 08/08/2016] [Indexed: 01/27/2023] Open

Gershman SJ, Norman KA, Niv Y. Discovering latent causes in reinforcement learning. Curr Opin Behav Sci 2015. [DOI: 10.1016/j.cobeha.2015.07.007] [Citation(s) in RCA: 81] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]

Stamps JA. Individual differences in behavioural plasticities. Biol Rev Camb Philos Soc 2015;91:534-67. [PMID: 25865135 DOI: 10.1111/brv.12186] [Citation(s) in RCA: 154] [Impact Index Per Article: 17.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Revised: 03/14/2015] [Accepted: 03/18/2015] [Indexed: 01/06/2023]

Abstract

Interest in individual differences in animal behavioural plasticities has surged in recent years, but research in this area has been hampered by semantic confusion as different investigators use the same terms (e.g. plasticity, flexibility, responsiveness) to refer to different phenomena. The first goal of this review is to suggest a framework for categorizing the many different types of behavioural plasticities, describe examples of each, and indicate why using reversibility as a criterion for categorizing behavioural plasticities is problematic. This framework is then used to address a number of timely questions about individual differences in behavioural plasticities. One set of questions concerns the experimental designs that can be used to study individual differences in various types of behavioural plasticities. Although within-individual designs are the default option for empirical studies of many types of behavioural plasticities, in some situations (e.g. when experience at an early age affects the behaviour expressed at subsequent ages), 'replicate individual' designs can provide useful insights into individual differences in behavioural plasticities. To date, researchers using within-individual and replicate individual designs have documented individual differences in all of the major categories of behavioural plasticities described herein. Another important question is whether and how different types of behavioural plasticities are related to one another. Currently there is empirical evidence that many behavioural plasticities [e.g. contextual plasticity, learning rates, IIV (intra-individual variability), endogenous plasticities, ontogenetic plasticities) can themselves vary as a function of experiences earlier in life, that is, many types of behavioural plasticity are themselves developmentally plastic. These findings support the assumption that differences among individuals in prior experiences may contribute to individual differences in behavioural plasticities observed at a given age. Several authors have predicted correlations across individuals between different types of behavioural plasticities, i.e. that some individuals will be generally more plastic than others. However, empirical support for most of these predictions, including indirect evidence from studies of relationships between personality traits and plasticities, is currently sparse and equivocal. The final section of this review suggests how an appreciation of the similarities and differences between different types of behavioural plasticities may help theoreticians formulate testable models to explain the evolution of individual differences in behavioural plasticities and the evolutionary and ecological consequences of individual differences in behavioural plasticities.

Collapse

Stamps JA, Krishnan VV. Individual differences in the potential and realized developmental plasticity of personality traits. Front Ecol Evol 2014. [DOI: 10.3389/fevo.2014.00069] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open

Learning bundles of stimuli renders stimulus order as a cue, not a confound. Proc Natl Acad Sci U S A 2014;111:14400-5. [PMID: 25246587 DOI: 10.1073/pnas.1416109111] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open