Dyna learning with deep belief networks

The objective of reinforcement learning is to find "good" actions in an environment where feedback is provided through a numerical reward, and the current state (i.e. sensory input) is assumed to be available at each time step. The notion of "good" is defined as maximizing the e...

Full description

Bibliographic Details
Main Author: Faulkner, Ryan
Other Authors: Doina Precup (Internal/Supervisor)
Format: Others
Language:en
Published: McGill University 2011
Subjects:
Online Access:http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=97177