Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention

Reinforcement learning describes the process by which during a series of trial-and-error attempts, actions that culminate in reward are reinforced, becoming more likely to be chosen in similar circumstances. When decisions are based on sensory stimuli, an association is formed between the stimulus,...

Full description

Bibliographic Details
Main Authors: Flavia Aluisi, Anna Rubinchik, Genela Morris
Format: Article
Language:English
Published: Frontiers Media S.A. 2018-06-01
Series:Frontiers in Neuroscience
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fnins.2018.00356/full
id doaj-7177bb27824842f4bc1e0440f7aec794
record_format Article
spelling doaj-7177bb27824842f4bc1e0440f7aec7942020-11-24T21:11:34ZengFrontiers Media S.A.Frontiers in Neuroscience1662-453X2018-06-011210.3389/fnins.2018.00356367421Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of AttentionFlavia Aluisi0Anna Rubinchik1Genela Morris2Sagol Department of Neurobiology, University of Haifa, Haifa, IsraelDepartment of Economics, University of Haifa, Haifa, IsraelSagol Department of Neurobiology, University of Haifa, Haifa, IsraelReinforcement learning describes the process by which during a series of trial-and-error attempts, actions that culminate in reward are reinforced, becoming more likely to be chosen in similar circumstances. When decisions are based on sensory stimuli, an association is formed between the stimulus, the action and the reward. Computational, behavioral and neurobiological accounts of this process successfully explain simple learning of stimuli that differ in one aspect, or along a single stimulus dimension. However, when stimuli may vary across several dimensions, identifying which features are relevant for the reward is not trivial, and the underlying cognitive process is poorly understood. To study this we adapted an intra-dimensional/ extra-dimensional set-shifting paradigm to train rats on a multi-sensory discrimination task. In our setup, stimuli of different modalities (spatial, olfactory and visual) are combined into complex cues and manipulated independently. In each set, only a single stimulus dimension is relevant for reward. To distinguish between learning and decision-making we suggest a weighted attention model (WAM). Our model learns by assigning a separate learning rule for the values of features of each dimension (e.g., for each color), reinforced after every experience. Decisions are made by comparing weighted averages of the learnt values, factored by dimension specific weights. Based on the observed behavior of the rats we estimated the parameters of the WAM and demonstrated that it outperforms an alternative model, in which a learnt value is assigned to each combination of features. Estimated decision weights of the WAM reveal an experience-based bias in learning. In the first experimental set the weights associated with all dimensions were similar. The extra-dimensional shift rendered this dimension irrelevant. However, its decision weight remained high for the early learning stage in this last set, providing an explanation for the poor performance of the animals. Thus, estimated weights can be viewed as a possible way to quantify the experience-based bias.https://www.frontiersin.org/article/10.3389/fnins.2018.00356/fullreinforcement learningattentionset-shiftingrule-learninganimal behavior
collection DOAJ
language English
format Article
sources DOAJ
author Flavia Aluisi
Anna Rubinchik
Genela Morris
spellingShingle Flavia Aluisi
Anna Rubinchik
Genela Morris
Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention
Frontiers in Neuroscience
reinforcement learning
attention
set-shifting
rule-learning
animal behavior
author_facet Flavia Aluisi
Anna Rubinchik
Genela Morris
author_sort Flavia Aluisi
title Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention
title_short Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention
title_full Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention
title_fullStr Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention
title_full_unstemmed Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention
title_sort animal learning in a multidimensional discrimination task as explained by dimension-specific allocation of attention
publisher Frontiers Media S.A.
series Frontiers in Neuroscience
issn 1662-453X
publishDate 2018-06-01
description Reinforcement learning describes the process by which during a series of trial-and-error attempts, actions that culminate in reward are reinforced, becoming more likely to be chosen in similar circumstances. When decisions are based on sensory stimuli, an association is formed between the stimulus, the action and the reward. Computational, behavioral and neurobiological accounts of this process successfully explain simple learning of stimuli that differ in one aspect, or along a single stimulus dimension. However, when stimuli may vary across several dimensions, identifying which features are relevant for the reward is not trivial, and the underlying cognitive process is poorly understood. To study this we adapted an intra-dimensional/ extra-dimensional set-shifting paradigm to train rats on a multi-sensory discrimination task. In our setup, stimuli of different modalities (spatial, olfactory and visual) are combined into complex cues and manipulated independently. In each set, only a single stimulus dimension is relevant for reward. To distinguish between learning and decision-making we suggest a weighted attention model (WAM). Our model learns by assigning a separate learning rule for the values of features of each dimension (e.g., for each color), reinforced after every experience. Decisions are made by comparing weighted averages of the learnt values, factored by dimension specific weights. Based on the observed behavior of the rats we estimated the parameters of the WAM and demonstrated that it outperforms an alternative model, in which a learnt value is assigned to each combination of features. Estimated decision weights of the WAM reveal an experience-based bias in learning. In the first experimental set the weights associated with all dimensions were similar. The extra-dimensional shift rendered this dimension irrelevant. However, its decision weight remained high for the early learning stage in this last set, providing an explanation for the poor performance of the animals. Thus, estimated weights can be viewed as a possible way to quantify the experience-based bias.
topic reinforcement learning
attention
set-shifting
rule-learning
animal behavior
url https://www.frontiersin.org/article/10.3389/fnins.2018.00356/full
work_keys_str_mv AT flaviaaluisi animallearninginamultidimensionaldiscriminationtaskasexplainedbydimensionspecificallocationofattention
AT annarubinchik animallearninginamultidimensionaldiscriminationtaskasexplainedbydimensionspecificallocationofattention
AT genelamorris animallearninginamultidimensionaldiscriminationtaskasexplainedbydimensionspecificallocationofattention
_version_ 1716753034904600576