Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention

Reinforcement learning describes the process by which during a series of trial-and-error attempts, actions that culminate in reward are reinforced, becoming more likely to be chosen in similar circumstances. When decisions are based on sensory stimuli, an association is formed between the stimulus,...

Full description

Bibliographic Details
Main Authors:	Flavia Aluisi, Anna Rubinchik, Genela Morris
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2018-06-01
Series:	Frontiers in Neuroscience
Subjects:	reinforcement learning attention set-shifting rule-learning animal behavior
Online Access:	https://www.frontiersin.org/article/10.3389/fnins.2018.00356/full

id	doaj-7177bb27824842f4bc1e0440f7aec794
record_format	Article
spelling	doaj-7177bb27824842f4bc1e0440f7aec7942020-11-24T21:11:34ZengFrontiers Media S.A.Frontiers in Neuroscience1662-453X2018-06-011210.3389/fnins.2018.00356367421Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of AttentionFlavia Aluisi0Anna Rubinchik1Genela Morris2Sagol Department of Neurobiology, University of Haifa, Haifa, IsraelDepartment of Economics, University of Haifa, Haifa, IsraelSagol Department of Neurobiology, University of Haifa, Haifa, IsraelReinforcement learning describes the process by which during a series of trial-and-error attempts, actions that culminate in reward are reinforced, becoming more likely to be chosen in similar circumstances. When decisions are based on sensory stimuli, an association is formed between the stimulus, the action and the reward. Computational, behavioral and neurobiological accounts of this process successfully explain simple learning of stimuli that differ in one aspect, or along a single stimulus dimension. However, when stimuli may vary across several dimensions, identifying which features are relevant for the reward is not trivial, and the underlying cognitive process is poorly understood. To study this we adapted an intra-dimensional/ extra-dimensional set-shifting paradigm to train rats on a multi-sensory discrimination task. In our setup, stimuli of different modalities (spatial, olfactory and visual) are combined into complex cues and manipulated independently. In each set, only a single stimulus dimension is relevant for reward. To distinguish between learning and decision-making we suggest a weighted attention model (WAM). Our model learns by assigning a separate learning rule for the values of features of each dimension (e.g., for each color), reinforced after every experience. Decisions are made by comparing weighted averages of the learnt values, factored by dimension specific weights. Based on the observed behavior of the rats we estimated the parameters of the WAM and demonstrated that it outperforms an alternative model, in which a learnt value is assigned to each combination of features. Estimated decision weights of the WAM reveal an experience-based bias in learning. In the first experimental set the weights associated with all dimensions were similar. The extra-dimensional shift rendered this dimension irrelevant. However, its decision weight remained high for the early learning stage in this last set, providing an explanation for the poor performance of the animals. Thus, estimated weights can be viewed as a possible way to quantify the experience-based bias.https://www.frontiersin.org/article/10.3389/fnins.2018.00356/fullreinforcement learningattentionset-shiftingrule-learninganimal behavior
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Flavia Aluisi Anna Rubinchik Genela Morris
spellingShingle	Flavia Aluisi Anna Rubinchik Genela Morris Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention Frontiers in Neuroscience reinforcement learning attention set-shifting rule-learning animal behavior
author_facet	Flavia Aluisi Anna Rubinchik Genela Morris
author_sort	Flavia Aluisi
title	Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention
title_short	Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention
title_full	Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention
title_fullStr	Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention
title_full_unstemmed	Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention
title_sort	animal learning in a multidimensional discrimination task as explained by dimension-specific allocation of attention
publisher	Frontiers Media S.A.
series	Frontiers in Neuroscience
issn	1662-453X
publishDate	2018-06-01
description	Reinforcement learning describes the process by which during a series of trial-and-error attempts, actions that culminate in reward are reinforced, becoming more likely to be chosen in similar circumstances. When decisions are based on sensory stimuli, an association is formed between the stimulus, the action and the reward. Computational, behavioral and neurobiological accounts of this process successfully explain simple learning of stimuli that differ in one aspect, or along a single stimulus dimension. However, when stimuli may vary across several dimensions, identifying which features are relevant for the reward is not trivial, and the underlying cognitive process is poorly understood. To study this we adapted an intra-dimensional/ extra-dimensional set-shifting paradigm to train rats on a multi-sensory discrimination task. In our setup, stimuli of different modalities (spatial, olfactory and visual) are combined into complex cues and manipulated independently. In each set, only a single stimulus dimension is relevant for reward. To distinguish between learning and decision-making we suggest a weighted attention model (WAM). Our model learns by assigning a separate learning rule for the values of features of each dimension (e.g., for each color), reinforced after every experience. Decisions are made by comparing weighted averages of the learnt values, factored by dimension specific weights. Based on the observed behavior of the rats we estimated the parameters of the WAM and demonstrated that it outperforms an alternative model, in which a learnt value is assigned to each combination of features. Estimated decision weights of the WAM reveal an experience-based bias in learning. In the first experimental set the weights associated with all dimensions were similar. The extra-dimensional shift rendered this dimension irrelevant. However, its decision weight remained high for the early learning stage in this last set, providing an explanation for the poor performance of the animals. Thus, estimated weights can be viewed as a possible way to quantify the experience-based bias.
topic	reinforcement learning attention set-shifting rule-learning animal behavior
url	https://www.frontiersin.org/article/10.3389/fnins.2018.00356/full
work_keys_str_mv	AT flaviaaluisi animallearninginamultidimensionaldiscriminationtaskasexplainedbydimensionspecificallocationofattention AT annarubinchik animallearninginamultidimensionaldiscriminationtaskasexplainedbydimensionspecificallocationofattention AT genelamorris animallearninginamultidimensionaldiscriminationtaskasexplainedbydimensionspecificallocationofattention
_version_	1716753034904600576

Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention

Similar Items