Stuck state avoidance through PID estimation training of Q-learning agent

Reinforcement learning is conceptually based on an agent learning through interaction with its environment. This trial-and-error learning method makes the process prone to situations in which the agent is stuck in a dead-end, from which it cannot keep learning. This thesis studies a method to dimini...

Full description

Bibliographic Details
Main Authors:	Moritz, Johan, Winkelmann, Albin
Format:	Others
Language:	English
Published:	KTH, Skolan för elektroteknik och datavetenskap (EECS) 2019
Subjects:	Q-learning QL PID wheeled inverted pendulum WIP reinforcement learning estimation training självbalanserande robot estimeringsträning Computer and Information Sciences Data- och informationsvetenskap
Online Access:	http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-264562

Internet

http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-264562

Stuck state avoidance through PID estimation training of Q-learning agent

Internet

Similar Items