Linear combination of one-step predictive information with an external reward in an episodic policy gradient setting: a critical analysis

One of the main challenges in the field of embodied artificial intelligence is the open-ended autonomous learning of complex behaviours. Our approach is to use task-independent, information-driven intrinsic motivation(s) to support task-dependent learning. The work presented here is a preliminary st...

Full description

Bibliographic Details
Main Authors: Keyan eZahedi, Georg eMartius, Nihat eAy
Format: Article
Language:English
Published: Frontiers Media S.A. 2013-11-01
Series:Frontiers in Psychology
Subjects:
Online Access:http://journal.frontiersin.org/Journal/10.3389/fpsyg.2013.00801/full