Dyna learning with deep belief networks

The objective of reinforcement learning is to find "good" actions in an environment where feedback is provided through a numerical reward, and the current state (i.e. sensory input) is assumed to be available at each time step. The notion of "good" is defined as maximizing the e...

Full description

Bibliographic Details
Main Author:	Faulkner, Ryan
Other Authors:	Doina Precup (Internal/Supervisor)
Format:	Others
Language:	en
Published:	McGill University 2011
Subjects:	Applied Sciences - Computer Science
Online Access:	http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=97177

Internet

http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=97177

Dyna learning with deep belief networks

Internet

Similar Items