Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention
Reinforcement learning describes the process by which during a series of trial-and-error attempts, actions that culminate in reward are reinforced, becoming more likely to be chosen in similar circumstances. When decisions are based on sensory stimuli, an association is formed between the stimulus,...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2018-06-01
|
Series: | Frontiers in Neuroscience |
Subjects: | |
Online Access: | https://www.frontiersin.org/article/10.3389/fnins.2018.00356/full |
id |
doaj-7177bb27824842f4bc1e0440f7aec794 |
---|---|
record_format |
Article |
spelling |
doaj-7177bb27824842f4bc1e0440f7aec7942020-11-24T21:11:34ZengFrontiers Media S.A.Frontiers in Neuroscience1662-453X2018-06-011210.3389/fnins.2018.00356367421Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of AttentionFlavia Aluisi0Anna Rubinchik1Genela Morris2Sagol Department of Neurobiology, University of Haifa, Haifa, IsraelDepartment of Economics, University of Haifa, Haifa, IsraelSagol Department of Neurobiology, University of Haifa, Haifa, IsraelReinforcement learning describes the process by which during a series of trial-and-error attempts, actions that culminate in reward are reinforced, becoming more likely to be chosen in similar circumstances. When decisions are based on sensory stimuli, an association is formed between the stimulus, the action and the reward. Computational, behavioral and neurobiological accounts of this process successfully explain simple learning of stimuli that differ in one aspect, or along a single stimulus dimension. However, when stimuli may vary across several dimensions, identifying which features are relevant for the reward is not trivial, and the underlying cognitive process is poorly understood. To study this we adapted an intra-dimensional/ extra-dimensional set-shifting paradigm to train rats on a multi-sensory discrimination task. In our setup, stimuli of different modalities (spatial, olfactory and visual) are combined into complex cues and manipulated independently. In each set, only a single stimulus dimension is relevant for reward. To distinguish between learning and decision-making we suggest a weighted attention model (WAM). Our model learns by assigning a separate learning rule for the values of features of each dimension (e.g., for each color), reinforced after every experience. Decisions are made by comparing weighted averages of the learnt values, factored by dimension specific weights. Based on the observed behavior of the rats we estimated the parameters of the WAM and demonstrated that it outperforms an alternative model, in which a learnt value is assigned to each combination of features. Estimated decision weights of the WAM reveal an experience-based bias in learning. In the first experimental set the weights associated with all dimensions were similar. The extra-dimensional shift rendered this dimension irrelevant. However, its decision weight remained high for the early learning stage in this last set, providing an explanation for the poor performance of the animals. Thus, estimated weights can be viewed as a possible way to quantify the experience-based bias.https://www.frontiersin.org/article/10.3389/fnins.2018.00356/fullreinforcement learningattentionset-shiftingrule-learninganimal behavior |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Flavia Aluisi Anna Rubinchik Genela Morris |
spellingShingle |
Flavia Aluisi Anna Rubinchik Genela Morris Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention Frontiers in Neuroscience reinforcement learning attention set-shifting rule-learning animal behavior |
author_facet |
Flavia Aluisi Anna Rubinchik Genela Morris |
author_sort |
Flavia Aluisi |
title |
Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention |
title_short |
Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention |
title_full |
Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention |
title_fullStr |
Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention |
title_full_unstemmed |
Animal Learning in a Multidimensional Discrimination Task as Explained by Dimension-Specific Allocation of Attention |
title_sort |
animal learning in a multidimensional discrimination task as explained by dimension-specific allocation of attention |
publisher |
Frontiers Media S.A. |
series |
Frontiers in Neuroscience |
issn |
1662-453X |
publishDate |
2018-06-01 |
description |
Reinforcement learning describes the process by which during a series of trial-and-error attempts, actions that culminate in reward are reinforced, becoming more likely to be chosen in similar circumstances. When decisions are based on sensory stimuli, an association is formed between the stimulus, the action and the reward. Computational, behavioral and neurobiological accounts of this process successfully explain simple learning of stimuli that differ in one aspect, or along a single stimulus dimension. However, when stimuli may vary across several dimensions, identifying which features are relevant for the reward is not trivial, and the underlying cognitive process is poorly understood. To study this we adapted an intra-dimensional/ extra-dimensional set-shifting paradigm to train rats on a multi-sensory discrimination task. In our setup, stimuli of different modalities (spatial, olfactory and visual) are combined into complex cues and manipulated independently. In each set, only a single stimulus dimension is relevant for reward. To distinguish between learning and decision-making we suggest a weighted attention model (WAM). Our model learns by assigning a separate learning rule for the values of features of each dimension (e.g., for each color), reinforced after every experience. Decisions are made by comparing weighted averages of the learnt values, factored by dimension specific weights. Based on the observed behavior of the rats we estimated the parameters of the WAM and demonstrated that it outperforms an alternative model, in which a learnt value is assigned to each combination of features. Estimated decision weights of the WAM reveal an experience-based bias in learning. In the first experimental set the weights associated with all dimensions were similar. The extra-dimensional shift rendered this dimension irrelevant. However, its decision weight remained high for the early learning stage in this last set, providing an explanation for the poor performance of the animals. Thus, estimated weights can be viewed as a possible way to quantify the experience-based bias. |
topic |
reinforcement learning attention set-shifting rule-learning animal behavior |
url |
https://www.frontiersin.org/article/10.3389/fnins.2018.00356/full |
work_keys_str_mv |
AT flaviaaluisi animallearninginamultidimensionaldiscriminationtaskasexplainedbydimensionspecificallocationofattention AT annarubinchik animallearninginamultidimensionaldiscriminationtaskasexplainedbydimensionspecificallocationofattention AT genelamorris animallearninginamultidimensionaldiscriminationtaskasexplainedbydimensionspecificallocationofattention |
_version_ |
1716753034904600576 |