Transition Based Discount Factor for Model Free Algorithms in Reinforcement Learning

Reinforcement Learning (RL) enables an agent to learn control policies for achieving its long-term goals. One key parameter of RL algorithms is a discount factor that scales down future cost in the state’s current value estimate. This study introduces and analyses a transition-based discount factor...

Full description

Bibliographic Details
Main Authors: Abhinav Sharma, Ruchir Gupta, K. Lakshmanan, Atul Gupta
Format: Article
Language:English
Published: MDPI AG 2021-07-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/13/7/1197