Scalable reward learning from demonstration

Reward learning from demonstration is the task of inferring the intents or goals of an agent demonstrating a task. Inverse reinforcement learning methods utilize the Markov decision process (MDP) framework to learn rewards, but typically scale poorly since they rely on the calculation of optimal val...

Full description

Bibliographic Details
Main Authors: Michini, Bernard J. (Contributor), How, Jonathan P. (Contributor), Cutler, Mark Johnson (Contributor)
Other Authors: Massachusetts Institute of Technology. Aerospace Controls Laboratory (Contributor), Massachusetts Institute of Technology. Department of Aeronautics and Astronautics (Contributor)
Format: Article
Language:English
Published: Institute of Electrical and Electronics Engineers (IEEE), 2015-05-08T18:42:15Z.
Subjects:
Online Access:Get fulltext