1
|
Blackwell KT, Doya K. Enhancing reinforcement learning models by including direct and indirect pathways improves performance on striatal dependent tasks. PLoS Comput Biol 2023; 19:e1011385. [PMID: 37594982 PMCID: PMC10479916 DOI: 10.1371/journal.pcbi.1011385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 09/05/2023] [Accepted: 07/25/2023] [Indexed: 08/20/2023] Open
Abstract
A major advance in understanding learning behavior stems from experiments showing that reward learning requires dopamine inputs to striatal neurons and arises from synaptic plasticity of cortico-striatal synapses. Numerous reinforcement learning models mimic this dopamine-dependent synaptic plasticity by using the reward prediction error, which resembles dopamine neuron firing, to learn the best action in response to a set of cues. Though these models can explain many facets of behavior, reproducing some types of goal-directed behavior, such as renewal and reversal, require additional model components. Here we present a reinforcement learning model, TD2Q, which better corresponds to the basal ganglia with two Q matrices, one representing direct pathway neurons (G) and another representing indirect pathway neurons (N). Unlike previous two-Q architectures, a novel and critical aspect of TD2Q is to update the G and N matrices utilizing the temporal difference reward prediction error. A best action is selected for N and G using a softmax with a reward-dependent adaptive exploration parameter, and then differences are resolved using a second selection step applied to the two action probabilities. The model is tested on a range of multi-step tasks including extinction, renewal, discrimination; switching reward probability learning; and sequence learning. Simulations show that TD2Q produces behaviors similar to rodents in choice and sequence learning tasks, and that use of the temporal difference reward prediction error is required to learn multi-step tasks. Blocking the update rule on the N matrix blocks discrimination learning, as observed experimentally. Performance in the sequence learning task is dramatically improved with two matrices. These results suggest that including additional aspects of basal ganglia physiology can improve the performance of reinforcement learning models, better reproduce animal behaviors, and provide insight as to the role of direct- and indirect-pathway striatal neurons.
Collapse
Affiliation(s)
- Kim T Blackwell
- Department of Bioengineering, Volgenau School of Engineering, George Mason University, Fairfax, Virginia, United States of America
| | - Kenji Doya
- Neural Computation Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
| |
Collapse
|
2
|
Grillner S, Robertson B, Kotaleski JH. Basal Ganglia—A Motion Perspective. Compr Physiol 2020; 10:1241-1275. [DOI: 10.1002/cphy.c190045] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
3
|
Giordano N, Iemolo A, Mancini M, Cacace F, De Risi M, Latagliata EC, Ghiglieri V, Bellenchi GC, Puglisi-Allegra S, Calabresi P, Picconi B, De Leonibus E. Motor learning and metaplasticity in striatal neurons: relevance for Parkinson's disease. Brain 2019; 141:505-520. [PMID: 29281030 DOI: 10.1093/brain/awx351] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2017] [Accepted: 10/29/2017] [Indexed: 01/08/2023] Open
Abstract
Nigro-striatal dopamine transmission is central to a wide range of neuronal functions, including skill learning, which is disrupted in several pathologies such as Parkinson's disease. The synaptic plasticity mechanisms, by which initial motor learning is stored for long time periods in striatal neurons, to then be gradually optimized upon subsequent training, remain unexplored. Addressing this issue is crucial to identify the synaptic and molecular mechanisms involved in striatal-dependent learning impairment in Parkinson's disease. In this study, we took advantage of interindividual differences between outbred rodents in reaching plateau performance in the rotarod incremental motor learning protocol, to study striatal synaptic plasticity ex vivo. We then assessed how this process is modulated by dopamine receptors and the dopamine active transporter, and whether it is impaired by overexpression of human α-synuclein in the mesencephalon; the latter is a progressive animal model of Parkinson's disease. We found that the initial acquisition of motor learning induced a dopamine active transporter and D1 receptors mediated long-term potentiation, under a protocol of long-term depression in striatal medium spiny neurons. This effect disappeared in animals reaching performance plateau. Overexpression of human α-synuclein reduced striatal dopamine active transporter levels, impaired motor learning, and prevented the learning-induced long-term potentiation, before the appearance of dopamine neuronal loss. Our findings provide evidence of a reorganization of cellular plasticity within the dorsolateral striatum that is mediated by dopamine receptors and dopamine active transporter during the acquisition of a skill. This newly identified mechanism of cellular memory is a form of metaplasticity that is disrupted in the early stage of synucleinopathies, such as Parkinson's disease, and that might be relevant for other striatal pathologies, such as drug abuse.
Collapse
Affiliation(s)
- Nadia Giordano
- Institute of Genetics and Biophysics (IGB), National Research Council, Naples, Italy.,Telethon Institute of Genetics and Medicine, Telethon Foundation, Pozzuoli, Italy
| | - Attilio Iemolo
- Institute of Genetics and Biophysics (IGB), National Research Council, Naples, Italy
| | - Maria Mancini
- Laboratory of Neurophysiology, Santa Lucia Foundation, IRCCS, Rome, Italy
| | - Fabrizio Cacace
- Laboratory of Neurophysiology, Santa Lucia Foundation, IRCCS, Rome, Italy
| | - Maria De Risi
- Institute of Genetics and Biophysics (IGB), National Research Council, Naples, Italy.,Telethon Institute of Genetics and Medicine, Telethon Foundation, Pozzuoli, Italy
| | - Emanuele Claudio Latagliata
- Laboratory of Neurophysiology, Santa Lucia Foundation, IRCCS, Rome, Italy.,Department of Psychology, University of Rome La Sapienza, Rome, Italy
| | - Veronica Ghiglieri
- Laboratory of Neurophysiology, Santa Lucia Foundation, IRCCS, Rome, Italy.,Department of Philosophy, Human, Social and Educational Sciences, University of Perugia, Perugia, Italy
| | - Gian Carlo Bellenchi
- Institute of Genetics and Biophysics (IGB), National Research Council, Naples, Italy
| | - Stefano Puglisi-Allegra
- Laboratory of Neurophysiology, Santa Lucia Foundation, IRCCS, Rome, Italy.,Department of Psychology, University of Rome La Sapienza, Rome, Italy
| | - Paolo Calabresi
- Laboratory of Neurophysiology, Santa Lucia Foundation, IRCCS, Rome, Italy.,Department of Medicine, Neurology Unit, University of Perugia, S. Andrea delle Fratte, Perugia, Italy
| | - Barbara Picconi
- Laboratory of Neurophysiology, Santa Lucia Foundation, IRCCS, Rome, Italy
| | - Elvira De Leonibus
- Institute of Genetics and Biophysics (IGB), National Research Council, Naples, Italy.,Telethon Institute of Genetics and Medicine, Telethon Foundation, Pozzuoli, Italy
| |
Collapse
|
4
|
Flores-Hernández J, Garzón-Vázquez JA, Hernández-Carballo G, Nieto-Mendoza E, Ruíz-Luna EA, Hernández-Echeagaray E. Striatal Neurodegeneration that Mimics Huntington's Disease Modifies GABA-induced Currents. Brain Sci 2018; 8:E217. [PMID: 30563250 PMCID: PMC6316731 DOI: 10.3390/brainsci8120217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2018] [Revised: 11/20/2018] [Accepted: 12/04/2018] [Indexed: 11/17/2022] Open
Abstract
Huntington's Disease (HD) is a degenerative disease which produces cognitive and motor disturbances. Treatment with GABAergic agonists improves the behavior and activity of mitochondrial complexes in rodents treated with 3-nitropropionic acid to mimic HD symptomatology. Apparently, GABA receptors activity may protect striatal medium spiny neurons (MSNs) from excitotoxic damage. This study evaluates whether mitochondrial inhibition with 3-NP that mimics the early stages of HD, modifies the kinetics and pharmacology of GABA receptors in patch clamp recorded dissociated MSNs cells. The results show that MSNs from mice treated with 3-NP exhibited differences in GABA-induced dose-response currents and pharmacological responses that suggests the presence of GABAC receptors in MSNs. Furthermore, there was a reduction in the effect of the GABAC antagonist that demonstrates a lessening of this GABA receptor subtype activity as a result of mitochondria inhibition.
Collapse
Affiliation(s)
- Jorge Flores-Hernández
- Instituto de Fisiología, Benemérita Universidad Autónoma de Puebla, Puebla C.P.72570, México.
| | | | | | - Elizabeth Nieto-Mendoza
- Laboratorio de neurofisiología del desarrollo y la neurodegeneración, UBIMED, FES-Iztacala, Universidad Nacional Autónoma de México, México, FES-Iztacala, Av. de Los Barrios #1, Los Reyes Iztacala, Tlalnepantla C.P.54090, México.
| | - Evelyn A Ruíz-Luna
- Instituto de Fisiología, Benemérita Universidad Autónoma de Puebla, Puebla C.P.72570, México.
| | - Elizabeth Hernández-Echeagaray
- Laboratorio de neurofisiología del desarrollo y la neurodegeneración, UBIMED, FES-Iztacala, Universidad Nacional Autónoma de México, México, FES-Iztacala, Av. de Los Barrios #1, Los Reyes Iztacala, Tlalnepantla C.P.54090, México.
| |
Collapse
|
5
|
Lindroos R, Dorst MC, Du K, Filipović M, Keller D, Ketzef M, Kozlov AK, Kumar A, Lindahl M, Nair AG, Pérez-Fernández J, Grillner S, Silberberg G, Hellgren Kotaleski J. Basal Ganglia Neuromodulation Over Multiple Temporal and Structural Scales-Simulations of Direct Pathway MSNs Investigate the Fast Onset of Dopaminergic Effects and Predict the Role of Kv4.2. Front Neural Circuits 2018; 12:3. [PMID: 29467627 PMCID: PMC5808142 DOI: 10.3389/fncir.2018.00003] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Accepted: 01/09/2018] [Indexed: 12/16/2022] Open
Abstract
The basal ganglia are involved in the motivational and habitual control of motor and cognitive behaviors. Striatum, the largest basal ganglia input stage, integrates cortical and thalamic inputs in functionally segregated cortico-basal ganglia-thalamic loops, and in addition the basal ganglia output nuclei control targets in the brainstem. Striatal function depends on the balance between the direct pathway medium spiny neurons (D1-MSNs) that express D1 dopamine receptors and the indirect pathway MSNs that express D2 dopamine receptors. The striatal microstructure is also divided into striosomes and matrix compartments, based on the differential expression of several proteins. Dopaminergic afferents from the midbrain and local cholinergic interneurons play crucial roles for basal ganglia function, and striatal signaling via the striosomes in turn regulates the midbrain dopaminergic system directly and via the lateral habenula. Consequently, abnormal functions of the basal ganglia neuromodulatory system underlie many neurological and psychiatric disorders. Neuromodulation acts on multiple structural levels, ranging from the subcellular level to behavior, both in health and disease. For example, neuromodulation affects membrane excitability and controls synaptic plasticity and thus learning in the basal ganglia. However, it is not clear on what time scales these different effects are implemented. Phosphorylation of ion channels and the resulting membrane effects are typically studied over minutes while it has been shown that neuromodulation can affect behavior within a few hundred milliseconds. So how do these seemingly contradictory effects fit together? Here we first briefly review neuromodulation of the basal ganglia, with a focus on dopamine. We furthermore use biophysically detailed multi-compartmental models to integrate experimental data regarding dopaminergic effects on individual membrane conductances with the aim to explain the resulting cellular level dopaminergic effects. In particular we predict dopaminergic effects on Kv4.2 in D1-MSNs. Finally, we also explore dynamical aspects of the onset of neuromodulation effects in multi-scale computational models combining biochemical signaling cascades and multi-compartmental neuron models.
Collapse
Affiliation(s)
- Robert Lindroos
- Department of Neuroscience, Nobel Institute for Neurophysiology, Stockholm, Sweden
| | - Matthijs C. Dorst
- Department of Neuroscience, Nobel Institute for Neurophysiology, Stockholm, Sweden
| | - Kai Du
- Department of Neuroscience, Nobel Institute for Neurophysiology, Stockholm, Sweden
| | - Marko Filipović
- Bernstein Center Freiburg, University of Freiburg, Freiburg im Breisgau, Germany
| | - Daniel Keller
- Blue Brain Project, Ecole Polytechnique Fédérale de Lausanne, Geneva, Switzerland
| | - Maya Ketzef
- Department of Neuroscience, Nobel Institute for Neurophysiology, Stockholm, Sweden
| | - Alexander K. Kozlov
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Solna, Sweden
| | - Arvind Kumar
- Bernstein Center Freiburg, University of Freiburg, Freiburg im Breisgau, Germany
- Department Computational Science and Technology, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Mikael Lindahl
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Solna, Sweden
| | - Anu G. Nair
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Solna, Sweden
| | - Juan Pérez-Fernández
- Department of Neuroscience, Nobel Institute for Neurophysiology, Stockholm, Sweden
| | - Sten Grillner
- Department of Neuroscience, Nobel Institute for Neurophysiology, Stockholm, Sweden
| | - Gilad Silberberg
- Department of Neuroscience, Nobel Institute for Neurophysiology, Stockholm, Sweden
| | - Jeanette Hellgren Kotaleski
- Department of Neuroscience, Nobel Institute for Neurophysiology, Stockholm, Sweden
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Solna, Sweden
| |
Collapse
|
6
|
Corrigendum to "Dopaminergic Modulation of Striatal Inhibitory Transmission and Long-Term Plasticity". Neural Plast 2017; 2017:3143428. [PMID: 28352478 PMCID: PMC5352900 DOI: 10.1155/2017/3143428] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2016] [Accepted: 12/20/2016] [Indexed: 11/17/2022] Open
Abstract
[This corrects the article DOI: 10.1155/2015/789502.].
Collapse
|
7
|
Hernández-Echeagaray E. Dopamine regulation of striatal inhibitory transmission and plasticity: dopamine, low or high? Neural Regen Res 2016; 11:214. [PMID: 27073360 PMCID: PMC4810971 DOI: 10.4103/1673-5374.177715] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/11/2015] [Indexed: 11/22/2022] Open
Affiliation(s)
- Elizabeth Hernández-Echeagaray
- Laboratorio de Neurofisiología del Desarrollo y la Neurodegeneración, Unidad de Biomedicina, FES-I, Universidad Nacional Autónoma de México, Los Reyes Iztacala, C.P., Tlalnepantla México
| |
Collapse
|