Scalable reward learning from demonstration
Reward learning from demonstration is the task of inferring the intents or goals of an agent demonstrating a task. Inverse reinforcement learning methods utilize the Markov decision process (MDP) framework to learn rewards, but typically scale poorly since they rely on the calculation of optimal val...
Main Authors: | Michini, Bernard J. (Contributor), How, Jonathan P. (Contributor), Cutler, Mark Johnson (Contributor) |
---|---|
Other Authors: | Massachusetts Institute of Technology. Aerospace Controls Laboratory (Contributor), Massachusetts Institute of Technology. Department of Aeronautics and Astronautics (Contributor) |
Format: | Article |
Language: | English |
Published: |
Institute of Electrical and Electronics Engineers (IEEE),
2015-05-08T18:42:15Z.
|
Subjects: | |
Online Access: | Get fulltext |
Similar Items
-
Bayesian nonparametric reward learning from demonstration
by: Michini, Bernard (Bernard J.)
Published: (2014) -
Lightweight infrared sensing for relative navigation of quadrotors
by: Cutler, Mark Johnson, et al.
Published: (2013) -
Bayesian Nonparametric Inverse Reinforcement Learning
by: How, Jonathan P., et al.
Published: (2013) -
Improving the efficiency of Bayesian inverse reinforcement learning
by: How, Jonathan P., et al.
Published: (2013) -
Comparison of Fixed and Variable Pitch Actuators for Agile Quadrotors
by: Cutler, Mark Johnson, et al.
Published: (2013)