Improved Q-Learning Method for Linear Discrete-Time Systems

In this paper, the Q-learning method for quadratic optimal control problem of discrete-time linear systems is reconsidered. The theoretical results prove that the quadratic optimal controller cannot be solved directly due to the linear correlation of the data sets. The following corollaries have bee...

Full description

Bibliographic Details
Main Authors: Jian Chen, Jinhua Wang, Jie Huang
Format: Article
Language:English
Published: MDPI AG 2020-03-01
Series:Processes
Subjects:
Online Access:https://www.mdpi.com/2227-9717/8/3/368