Measuring and Influencing Sequential Joint Agent Behaviours

Algorithmically designed reward functions can influence groups of learning agents toward measurable desired sequential joint behaviours. Influencing learning agents toward desirable behaviours is non-trivial due to the difficulties of assigning credit for global success to the deserving agents and o...

Full description

Bibliographic Details
Main Author: Raffensperger, Peter Abraham
Language:en
Published: University of Canterbury. Electrical and Computer Engineering 2013
Subjects:
Online Access:http://hdl.handle.net/10092/7472