Least Squares Temporal Difference Methods: An Analysis under General Conditions

Least Squares Temporal Difference Methods: An Analysis under General Conditions

We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) with the least squares temporal difference (LSTD) algorithm, LSTD($\lambda$), in an exploration-enhanced learning context, where policy costs are computed from observations of a Markov chain differe...

Full description

Bibliographic Details
Main Author:	Yu, Huizhen (Contributor)
Other Authors:	Massachusetts Institute of Technology. Laboratory for Information and Decision Systems (Contributor)
Format:	Article
Language:	English
Published:	Society for Industrial and Applied Mathematics, 2013-03-12T18:09:37Z.
Subjects:	Article
Online Access:	Get fulltext

Similar Items

Convergence Results for Some Temporal Difference Methods Based on Least Squares
by: Yu, Huizhen, et al.
Published: (2012)

Gauss–Newton–Secant Method for Solving Nonlinear Least Squares Problems under Generalized Lipschitz Conditions
by: Ioannis K. Argyros, et al.
Published: (2021-07-01)

Kernel Recursive Least-Squares Temporal Difference Algorithms with Sparsification and Regularization
by: Chunyuan Zhang, et al.
Published: (2016-01-01)

Inequalities and equalities associated with ordinary least squares and generalized least squares in partitioned linear models
by: Chu, Ka Lok, 1975-
Published: (2004)

Analysis of MIMO Receiver Using Generalized Least Squares Method in Colored Environments
by: Mohamed Lassaad Ammari, et al.
Published: (2014-01-01)

Total Least Squares Methods
by: Markovsky, Ivan, et al.
Published: (2010)

Methods for nonlinear least squares
by: Al-Baali, Mehiddin
Published: (1984)

The Moving Least Square Method
by: Jian-ShuoHuang, et al.
Published: (2010)

The generalized least square estimation of polychoric correlation.
Published: (1985)

Condition numbers of the minimum norm least squares solution for the least squares problem involving Kronecker products
by: Lingsheng Meng, et al.
Published: (2021-06-01)

Solving Ill-Conditioned Least Squares Problems
by: CHUNG WEN-CHIEH, et al.
Published: (2004)

Chebyshev Approximations by Least Squares Method
by: V.I. Zorkaltsev, et al.
Published: (2020-09-01)

Time Scale in Least Square Method
by: Özgür Yeniay, et al.
Published: (2014-01-01)

Overview of total least squares methods
by: Markovsky, Ivan, et al.
Published: (2007)

The General Least Square Deviation OWA Operator Problem
by: Dug Hun Hong, et al.
Published: (2019-04-01)

Least-squares methods for computational electromagnetics
by: Kolev, Tzanio Valentinov
Published: (2004)

Least Square Method for Concave Regression
by: Kuo-Lung Wang, et al.
Published: (2010)

Phylogenetic inference by generalized least squares : computational aspects
by: Abu Safia, Ahmed.
Published: (2005)

Theory of the generalized least squares estimator in parametric estimation
by: Hsing-Ti Wu, et al.
Published: (1993)

An Application of Least Squares Method for Image Process
by: Shih-WeiYu, et al.
Published: (2019)

Study on Partial Regularized Least Squares Method
by: Yu-Ren Chiou, et al.
Published: (2008)

Constructive Analysis for Least Squares Regression with Generalized K-Norm Regularization
by: Cheng Wang, et al.
Published: (2014-01-01)

Buckling Analysis of Plates by the Moving Least Square Method
by: Hao-ChunChuang, et al.
Published: (2012)

An Adaptive Policy Evaluation Network Based on Recursive Least Squares Temporal Difference With Gradient Correction
by: Dazi Li, et al.
Published: (2018-01-01)

Meshfree Least Square-based Finite Difference method in CFD applications
by: Sandnes, Pål Grøthe
Published: (2011)

Solving ill-conditioned Least Square Problems by the Truncated SVD Related Methods
by: Kai-Chun Wang, et al.
Published: (2013)

The Simulation of Model Selection Method for General Adaptive Penalized Least Squares and Comparison with Other Methods
by: 陳柏錞

Least Squares for Practitioners
by: J. A. Rod Blais
Published: (2010-01-01)

General Total Least Squares Theory for Geodetic Coordinate Transformations
by: Yuxin Qin, et al.
Published: (2020-04-01)

Constrained generalized least squares estimation of multivariate polychoric correlation.
Published: (1987)

Generalized Least Squares Based Channel Estimation for FBMC-OQAM
by: Vibhutesh Kumar Singh, et al.
Published: (2019-01-01)

On the nonnegative least squares
by: Santiago, Claudio Prata
Published: (2010)

Problems in least squares
by: Armbrust, Edward Leon.
Published: (2015)

Least squares approximations
by: Wiener, Marvin
Published: (2017)

Buckling Analysis of Close Cylindrical Shells by the Moving Least Square Method
by: Yu-FenHsiao, et al.
Published: (2012)

Deformation analysis with Total Least Squares
by: M. Acar, et al.
Published: (2006-01-01)

Least Squares Differential Quadrature Method for the Generalized Bagley–Torvik Fractional Differential Equation
by: Constantin Bota, et al.
Published: (2020-01-01)

A moving least squares meshless method for solving the generalized Kuramoto-Sivashinsky equation
by: E. Dabboura, et al.
Published: (2016-09-01)

Semismooth least squares methods for complementarity problems
by: Petra, Stefania
Published: (2006)

LEAST – SQUARES METHOD FOR ESTIMATING DIFFUSION COEFFICIENT
by: Abdolsadeh Neisy
Published: (2007-01-01)