What can associative learning do for planning?

There is a new associative learning paradox. The power of associative learning for producing flexible behaviour in non-human animals is downplayed or ignored by researchers in animal cognition, whereas artificial intelligence research shows that associative learning models can beat humans in chess....

Full description

Bibliographic Details
Main Author: Johan Lind
Format: Article
Language:English
Published: The Royal Society 2018-01-01
Series:Royal Society Open Science
Subjects:
Online Access:https://royalsocietypublishing.org/doi/pdf/10.1098/rsos.180778
id doaj-53e6331bcc5d45568b228f2ef240145c
record_format Article
spelling doaj-53e6331bcc5d45568b228f2ef240145c2020-11-25T04:06:37ZengThe Royal SocietyRoyal Society Open Science2054-57032018-01-0151110.1098/rsos.180778180778What can associative learning do for planning?Johan LindThere is a new associative learning paradox. The power of associative learning for producing flexible behaviour in non-human animals is downplayed or ignored by researchers in animal cognition, whereas artificial intelligence research shows that associative learning models can beat humans in chess. One phenomenon in which associative learning often is ruled out as an explanation for animal behaviour is flexible planning. However, planning studies have been criticized and questions have been raised regarding both methodological validity and interpretations of results. Due to the power of associative learning and the uncertainty of what causes planning behaviour in non-human animals, I explored what associative learning can do for planning. A previously published sequence learning model which combines Pavlovian and instrumental conditioning was used to simulate two planning studies, namely Mulcahy & Call 2006 ‘Apes save tools for future use.’ Science 312, 1038–1040 and Kabadayi & Osvath 2017 ‘Ravens parallel great apes in flexible planning for tool-use and bartering.’ Science 357, 202–204. Simulations show that behaviour matching current definitions of flexible planning can emerge through associative learning. Through conditioned reinforcement, the learning model gives rise to planning behaviour by learning that a behaviour towards a current stimulus will produce high value food at a later stage; it can make decisions about future states not within current sensory scope. The simulations tracked key patterns both between and within studies. It is concluded that one cannot rule out that these studies of flexible planning in apes and corvids can be completely accounted for by associative learning. Future empirical studies of flexible planning in non-human animals can benefit from theoretical developments within artificial intelligence and animal learning.https://royalsocietypublishing.org/doi/pdf/10.1098/rsos.180778planningassociative learningreinforcement learninganimal intelligenceflexible behaviour
collection DOAJ
language English
format Article
sources DOAJ
author Johan Lind
spellingShingle Johan Lind
What can associative learning do for planning?
Royal Society Open Science
planning
associative learning
reinforcement learning
animal intelligence
flexible behaviour
author_facet Johan Lind
author_sort Johan Lind
title What can associative learning do for planning?
title_short What can associative learning do for planning?
title_full What can associative learning do for planning?
title_fullStr What can associative learning do for planning?
title_full_unstemmed What can associative learning do for planning?
title_sort what can associative learning do for planning?
publisher The Royal Society
series Royal Society Open Science
issn 2054-5703
publishDate 2018-01-01
description There is a new associative learning paradox. The power of associative learning for producing flexible behaviour in non-human animals is downplayed or ignored by researchers in animal cognition, whereas artificial intelligence research shows that associative learning models can beat humans in chess. One phenomenon in which associative learning often is ruled out as an explanation for animal behaviour is flexible planning. However, planning studies have been criticized and questions have been raised regarding both methodological validity and interpretations of results. Due to the power of associative learning and the uncertainty of what causes planning behaviour in non-human animals, I explored what associative learning can do for planning. A previously published sequence learning model which combines Pavlovian and instrumental conditioning was used to simulate two planning studies, namely Mulcahy & Call 2006 ‘Apes save tools for future use.’ Science 312, 1038–1040 and Kabadayi & Osvath 2017 ‘Ravens parallel great apes in flexible planning for tool-use and bartering.’ Science 357, 202–204. Simulations show that behaviour matching current definitions of flexible planning can emerge through associative learning. Through conditioned reinforcement, the learning model gives rise to planning behaviour by learning that a behaviour towards a current stimulus will produce high value food at a later stage; it can make decisions about future states not within current sensory scope. The simulations tracked key patterns both between and within studies. It is concluded that one cannot rule out that these studies of flexible planning in apes and corvids can be completely accounted for by associative learning. Future empirical studies of flexible planning in non-human animals can benefit from theoretical developments within artificial intelligence and animal learning.
topic planning
associative learning
reinforcement learning
animal intelligence
flexible behaviour
url https://royalsocietypublishing.org/doi/pdf/10.1098/rsos.180778
work_keys_str_mv AT johanlind whatcanassociativelearningdoforplanning
_version_ 1724431328654393344