Design and Implementation of Reinforce Learning Based Fuzzy Gait Controller for Humanoid Robot

碩士 === 國立成功大學 === 電機工程學系碩博士班 === 97 === This thesis mainly proposes the implementation of gait learning control and the fuzzy based gait synthesis system for a small-sized humanoid robot. We accomplish the whole system on a biped robot named aiRobot-3. The machine learning approach we applied is pol...

Full description

Bibliographic Details
Main Authors:	Shao-wei Lai, 賴劭韋
Other Authors:	Tzuu-hseng Li
Format:	Others
Language:	en_US
Published:	2009
Online Access:	http://ndltd.ncl.edu.tw/handle/73722576689767430478

id	ndltd-TW-097NCKU5442125
record_format	oai_dc
spelling	ndltd-TW-097NCKU54421252016-05-04T04:25:26Z http://ndltd.ncl.edu.tw/handle/73722576689767430478 Design and Implementation of Reinforce Learning Based Fuzzy Gait Controller for Humanoid Robot 人形機器人之加強式模糊步態控制法之設計與實現 Shao-wei Lai 賴劭韋碩士國立成功大學電機工程學系碩博士班 97 This thesis mainly proposes the implementation of gait learning control and the fuzzy based gait synthesis system for a small-sized humanoid robot. We accomplish the whole system on a biped robot named aiRobot-3. The machine learning approach we applied is policy gradient reinforcement learning (PGRL) which can execute the real-time performance and directly adjust the policy without calculating action value function. Given a parameterized walking motion designed for our robot, PGRL algorithm automatically searches the set of possible parameters and finds the faster possible walking motion. The reward function we mainly considered is the velocity of our robot which can be estimated from the vision system on itself. However, our experiment illustrates that there are some stability problems in the learning process. In order to solve these problems, we also attempt to employ the desired Zero Moment Position (ZMP) trajectory as another reward for the reward function. The results show that the robot learned its gait from 30.6 mm/s to 130.6 mm/s in about 1.3 hours. It is faster than manual tuning parameters that we used before. Besides, for some advanced performance of our robot, we also apply fuzzy logic controller (FLC) in our strategy system. We use the information of its vision system as the input of the FLC and integrate the robot’s gait to perform such the tracking tasks. To acquire the motion that mapping to the output value, we employ Lagrange polynomial interpolation to transform the existing motions to the motion we want. Finally, we implement these fuzzy based gait synthesis strategies to the tasks such as chasing a ball and tracing a line for FIRA and Robocup competitions. Tzuu-hseng Li 李祖聖 2009 學位論文 ; thesis 75 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立成功大學 === 電機工程學系碩博士班 === 97 === This thesis mainly proposes the implementation of gait learning control and the fuzzy based gait synthesis system for a small-sized humanoid robot. We accomplish the whole system on a biped robot named aiRobot-3. The machine learning approach we applied is policy gradient reinforcement learning (PGRL) which can execute the real-time performance and directly adjust the policy without calculating action value function. Given a parameterized walking motion designed for our robot, PGRL algorithm automatically searches the set of possible parameters and finds the faster possible walking motion. The reward function we mainly considered is the velocity of our robot which can be estimated from the vision system on itself. However, our experiment illustrates that there are some stability problems in the learning process. In order to solve these problems, we also attempt to employ the desired Zero Moment Position (ZMP) trajectory as another reward for the reward function. The results show that the robot learned its gait from 30.6 mm/s to 130.6 mm/s in about 1.3 hours. It is faster than manual tuning parameters that we used before. Besides, for some advanced performance of our robot, we also apply fuzzy logic controller (FLC) in our strategy system. We use the information of its vision system as the input of the FLC and integrate the robot’s gait to perform such the tracking tasks. To acquire the motion that mapping to the output value, we employ Lagrange polynomial interpolation to transform the existing motions to the motion we want. Finally, we implement these fuzzy based gait synthesis strategies to the tasks such as chasing a ball and tracing a line for FIRA and Robocup competitions.
author2	Tzuu-hseng Li
author_facet	Tzuu-hseng Li Shao-wei Lai 賴劭韋
author	Shao-wei Lai 賴劭韋
spellingShingle	Shao-wei Lai 賴劭韋 Design and Implementation of Reinforce Learning Based Fuzzy Gait Controller for Humanoid Robot
author_sort	Shao-wei Lai
title	Design and Implementation of Reinforce Learning Based Fuzzy Gait Controller for Humanoid Robot
title_short	Design and Implementation of Reinforce Learning Based Fuzzy Gait Controller for Humanoid Robot
title_full	Design and Implementation of Reinforce Learning Based Fuzzy Gait Controller for Humanoid Robot
title_fullStr	Design and Implementation of Reinforce Learning Based Fuzzy Gait Controller for Humanoid Robot
title_full_unstemmed	Design and Implementation of Reinforce Learning Based Fuzzy Gait Controller for Humanoid Robot
title_sort	design and implementation of reinforce learning based fuzzy gait controller for humanoid robot
publishDate	2009
url	http://ndltd.ncl.edu.tw/handle/73722576689767430478
work_keys_str_mv	AT shaoweilai designandimplementationofreinforcelearningbasedfuzzygaitcontrollerforhumanoidrobot AT làishàowéi designandimplementationofreinforcelearningbasedfuzzygaitcontrollerforhumanoidrobot AT shaoweilai rénxíngjīqìrénzhījiāqiángshìmóhúbùtàikòngzhìfǎzhīshèjìyǔshíxiàn AT làishàowéi rénxíngjīqìrénzhījiāqiángshìmóhúbùtàikòngzhìfǎzhīshèjìyǔshíxiàn
_version_	1718257347383525376

Design and Implementation of Reinforce Learning Based Fuzzy Gait Controller for Humanoid Robot

Similar Items