Improved Q-Learning Method for Linear Discrete-Time Systems
In this paper, the Q-learning method for quadratic optimal control problem of discrete-time linear systems is reconsidered. The theoretical results prove that the quadratic optimal controller cannot be solved directly due to the linear correlation of the data sets. The following corollaries have bee...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-03-01
|
Series: | Processes |
Subjects: | |
Online Access: | https://www.mdpi.com/2227-9717/8/3/368 |