An Active Exploration Method for Data Efficient Reinforcement Learning

Reinforcement learning (RL) constitutes an effective method of controlling dynamic systems without prior knowledge. One of the most important and difficult problems in RL is the improvement of data efficiency. Probabilistic inference for learning control (PILCO) is a state-of-the-art data-efficient...

Full description

Bibliographic Details
Main Authors:	Zhao Dongfang, Liu Jiafeng, Wu Rui, Cheng Dansong, Tang Xianglong
Format:	Article
Language:	English
Published:	Sciendo 2019-06-01
Series:	International Journal of Applied Mathematics and Computer Science
Subjects:	reinforcement learning information entropy pilco data efficiency
Online Access:	https://doi.org/10.2478/amcs-2019-0026

Internet

https://doi.org/10.2478/amcs-2019-0026

An Active Exploration Method for Data Efficient Reinforcement Learning

Internet

Similar Items