Visual-based Parameterized Proximal Policy Optimization

碩士 === 國立交通大學 === 資訊科學與工程研究所 === 107 === We proposes a visual-based proximal policy optimization in parameterized (structured) action spaces based on the actor critic network. The optimization, named parameterized proximal policy optimization (P3O), is applied to RoboCup soccer simulation, robotic a...

Full description

Bibliographic Details
Main Authors: Huang, Ming-Xu, 黃明旭
Other Authors: Wu, I-Chen
Format: Others
Language:en_US
Published: 2018
Online Access:http://ndltd.ncl.edu.tw/handle/c2d58n