Meta-inverse Reinforcement Learning Method Based on Relative Entropy

Aiming at the problem that traditional inverse reinforcement learning algorithms are slow,imprecise,or even unsolvable when solving the reward function owing to insufficient expert demonstration samples and unknown state transition probabilitie,a meta-reinforcement learning method based on relative...

Full description

Bibliographic Details
Main Author: WU Shao-bo, FU Qi-ming, CHEN Jian-ping, WU Hong-jie, LU You
Format: Article
Language:zho
Published: Editorial office of Computer Science 2021-09-01
Series:Jisuanji kexue
Subjects:
Online Access:http://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2021-9-257.pdf