Stuck state avoidance through PID estimation training of Q-learning agent

Reinforcement learning is conceptually based on an agent learning through interaction with its environment. This trial-and-error learning method makes the process prone to situations in which the agent is stuck in a dead-end, from which it cannot keep learning. This thesis studies a method to dimini...

Full description

Bibliographic Details
Main Authors: Moritz, Johan, Winkelmann, Albin
Format: Others
Language:English
Published: KTH, Skolan för elektroteknik och datavetenskap (EECS) 2019
Subjects:
QL
PID
WIP
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-264562