An Active Exploration Method for Data Efficient Reinforcement Learning

Reinforcement learning (RL) constitutes an effective method of controlling dynamic systems without prior knowledge. One of the most important and difficult problems in RL is the improvement of data efficiency. Probabilistic inference for learning control (PILCO) is a state-of-the-art data-efficient...

Full description

Bibliographic Details
Main Authors: Zhao Dongfang, Liu Jiafeng, Wu Rui, Cheng Dansong, Tang Xianglong
Format: Article
Language:English
Published: Sciendo 2019-06-01
Series:International Journal of Applied Mathematics and Computer Science
Subjects:
Online Access:https://doi.org/10.2478/amcs-2019-0026