Action selection performance of a reconfigurable Basal Ganglia inspired model with Hebbian-Bayesian Go-NoGo connectivity

Several studies have shown a strong involvement of the basal ganglia (BG) in action selection and dopamine dependent learning. The dopaminergic signal to striatum, the input stage of the BG, has been commonly described as coding a reward prediction error (RPE), i.e. the difference between the predic...

Full description

Bibliographic Details
Main Authors:	Pierre eBerthet, Jeanette eHellgren Kotaleski, Anders eLansner
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2012-10-01
Series:	Frontiers in Behavioral Neuroscience
Subjects:	Basal Ganglia Dopamine reinforcement learning BCPNN behaviour selection direct-indirect pathway
Online Access:	http://journal.frontiersin.org/Journal/10.3389/fnbeh.2012.00065/full

id	doaj-ae2ce0d87ab9419f9cca5c741ae717d8
record_format	Article
spelling	doaj-ae2ce0d87ab9419f9cca5c741ae717d82020-11-24T21:34:00ZengFrontiers Media S.A.Frontiers in Behavioral Neuroscience1662-51532012-10-01610.3389/fnbeh.2012.0006521746Action selection performance of a reconfigurable Basal Ganglia inspired model with Hebbian-Bayesian Go-NoGo connectivityPierre eBerthet0Pierre eBerthet1Pierre eBerthet2Jeanette eHellgren Kotaleski3Jeanette eHellgren Kotaleski4Anders eLansner5Anders eLansner6Anders eLansner7Stockholms UniversitetKTH Royal Institute of TechnologyStockholm Brain InstituteKTH Royal Institute of TechnologyStockholm Brain InstituteStockholms UniversitetKTH Royal Institute of TechnologyStockholm Brain InstituteSeveral studies have shown a strong involvement of the basal ganglia (BG) in action selection and dopamine dependent learning. The dopaminergic signal to striatum, the input stage of the BG, has been commonly described as coding a reward prediction error (RPE), i.e. the difference between the predicted and actual reward. The RPE has been hypothesized to be critical in the modulation of the synaptic plasticity in cortico-striatal synapses in the direct and indirect pathway. We developed an abstract computational model of the BG, with a dual pathway structure functionally corresponding to the direct and indirect pathways, and compared its behaviour to biological data as well as other reinforcement learning models. The computations in our model are inspired by Bayesian inference, and the synaptic plasticity changes depend on a three factor Hebbian-Bayesian learning rule based on co-activation of pre- and post-synaptic units and on the value of the RPE. The model builds on a modified Actor-Critic architecture and implements the direct (Go) and the indirect (NoGo) pathway, as well as the reward prediction (RP) system, acting in a complementary fashion. We investigated the performance of the model system when different configurations of the Go, NoGo and RP system were utilized, e.g. using only the Go, NoGo, or RP system, or combinations of those. Learning performance was investigated in several types of learning paradigms, such as learning-relearning, successive learning, stochastic learning, reversal learning and a two-choice task. The RPE and the activity of the model during learning were similar to monkey electrophysiological and behavioural data. Our results, however, show that there is not a unique best way to configure this BG model to handle well all the learning paradigms tested. We thus suggest that an agent might dynamically configure its action selection mode, possibly depending on task characteristics and also on how much time is available.http://journal.frontiersin.org/Journal/10.3389/fnbeh.2012.00065/fullBasal GangliaDopaminereinforcement learningBCPNNbehaviour selectiondirect-indirect pathway
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Pierre eBerthet Pierre eBerthet Pierre eBerthet Jeanette eHellgren Kotaleski Jeanette eHellgren Kotaleski Anders eLansner Anders eLansner Anders eLansner
spellingShingle	Pierre eBerthet Pierre eBerthet Pierre eBerthet Jeanette eHellgren Kotaleski Jeanette eHellgren Kotaleski Anders eLansner Anders eLansner Anders eLansner Action selection performance of a reconfigurable Basal Ganglia inspired model with Hebbian-Bayesian Go-NoGo connectivity Frontiers in Behavioral Neuroscience Basal Ganglia Dopamine reinforcement learning BCPNN behaviour selection direct-indirect pathway
author_facet	Pierre eBerthet Pierre eBerthet Pierre eBerthet Jeanette eHellgren Kotaleski Jeanette eHellgren Kotaleski Anders eLansner Anders eLansner Anders eLansner
author_sort	Pierre eBerthet
title	Action selection performance of a reconfigurable Basal Ganglia inspired model with Hebbian-Bayesian Go-NoGo connectivity
title_short	Action selection performance of a reconfigurable Basal Ganglia inspired model with Hebbian-Bayesian Go-NoGo connectivity
title_full	Action selection performance of a reconfigurable Basal Ganglia inspired model with Hebbian-Bayesian Go-NoGo connectivity
title_fullStr	Action selection performance of a reconfigurable Basal Ganglia inspired model with Hebbian-Bayesian Go-NoGo connectivity
title_full_unstemmed	Action selection performance of a reconfigurable Basal Ganglia inspired model with Hebbian-Bayesian Go-NoGo connectivity
title_sort	action selection performance of a reconfigurable basal ganglia inspired model with hebbian-bayesian go-nogo connectivity
publisher	Frontiers Media S.A.
series	Frontiers in Behavioral Neuroscience
issn	1662-5153
publishDate	2012-10-01
description	Several studies have shown a strong involvement of the basal ganglia (BG) in action selection and dopamine dependent learning. The dopaminergic signal to striatum, the input stage of the BG, has been commonly described as coding a reward prediction error (RPE), i.e. the difference between the predicted and actual reward. The RPE has been hypothesized to be critical in the modulation of the synaptic plasticity in cortico-striatal synapses in the direct and indirect pathway. We developed an abstract computational model of the BG, with a dual pathway structure functionally corresponding to the direct and indirect pathways, and compared its behaviour to biological data as well as other reinforcement learning models. The computations in our model are inspired by Bayesian inference, and the synaptic plasticity changes depend on a three factor Hebbian-Bayesian learning rule based on co-activation of pre- and post-synaptic units and on the value of the RPE. The model builds on a modified Actor-Critic architecture and implements the direct (Go) and the indirect (NoGo) pathway, as well as the reward prediction (RP) system, acting in a complementary fashion. We investigated the performance of the model system when different configurations of the Go, NoGo and RP system were utilized, e.g. using only the Go, NoGo, or RP system, or combinations of those. Learning performance was investigated in several types of learning paradigms, such as learning-relearning, successive learning, stochastic learning, reversal learning and a two-choice task. The RPE and the activity of the model during learning were similar to monkey electrophysiological and behavioural data. Our results, however, show that there is not a unique best way to configure this BG model to handle well all the learning paradigms tested. We thus suggest that an agent might dynamically configure its action selection mode, possibly depending on task characteristics and also on how much time is available.
topic	Basal Ganglia Dopamine reinforcement learning BCPNN behaviour selection direct-indirect pathway
url	http://journal.frontiersin.org/Journal/10.3389/fnbeh.2012.00065/full
work_keys_str_mv	AT pierreeberthet actionselectionperformanceofareconfigurablebasalgangliainspiredmodelwithhebbianbayesiangonogoconnectivity AT pierreeberthet actionselectionperformanceofareconfigurablebasalgangliainspiredmodelwithhebbianbayesiangonogoconnectivity AT pierreeberthet actionselectionperformanceofareconfigurablebasalgangliainspiredmodelwithhebbianbayesiangonogoconnectivity AT jeanetteehellgrenkotaleski actionselectionperformanceofareconfigurablebasalgangliainspiredmodelwithhebbianbayesiangonogoconnectivity AT jeanetteehellgrenkotaleski actionselectionperformanceofareconfigurablebasalgangliainspiredmodelwithhebbianbayesiangonogoconnectivity AT anderselansner actionselectionperformanceofareconfigurablebasalgangliainspiredmodelwithhebbianbayesiangonogoconnectivity AT anderselansner actionselectionperformanceofareconfigurablebasalgangliainspiredmodelwithhebbianbayesiangonogoconnectivity AT anderselansner actionselectionperformanceofareconfigurablebasalgangliainspiredmodelwithhebbianbayesiangonogoconnectivity
_version_	1725950884297310208

Action selection performance of a reconfigurable Basal Ganglia inspired model with Hebbian-Bayesian Go-NoGo connectivity

Similar Items