Measuring and Influencing Sequential Joint Agent Behaviours
Algorithmically designed reward functions can influence groups of learning agents toward measurable desired sequential joint behaviours. Influencing learning agents toward desirable behaviours is non-trivial due to the difficulties of assigning credit for global success to the deserving agents and o...
Main Author: | |
---|---|
Language: | en |
Published: |
University of Canterbury. Electrical and Computer Engineering
2013
|
Subjects: | |
Online Access: | http://hdl.handle.net/10092/7472 |