Reinforcement learning using a continuous time actor-critic framework with spiking neurons.

Animals repeat rewarded behaviors, but the physiological basis of reward-based learning has only been partially elucidated. On one hand, experimental evidence shows that the neuromodulator dopamine carries information about rewards and affects synaptic plasticity. On the other hand, the theory of re...

Full description

Bibliographic Details
Main Authors: Nicolas Frémaux, Henning Sprekeler, Wulfram Gerstner
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-04-01
Series:PLoS Computational Biology
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23592970/?tool=EBI