Visual-based Parameterized Proximal Policy Optimization
碩士 === 國立交通大學 === 資訊科學與工程研究所 === 107 === We proposes a visual-based proximal policy optimization in parameterized (structured) action spaces based on the actor critic network. The optimization, named parameterized proximal policy optimization (P3O), is applied to RoboCup soccer simulation, robotic a...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2018
|
Online Access: | http://ndltd.ncl.edu.tw/handle/c2d58n |