Markov Decision Process Based Akashic Record Learning Method and Its Application

碩士 === 國立虎尾科技大學 === 資訊工程系碩士班 === 107 === This thesis starts with the Markov decision process. First, we briefly introduce the current mainstream of reinforcement learning, the Q-function learning method, and the improved research method based on the Q-function learning method. The system that reduce...

Full description

Bibliographic Details
Main Authors: CHENG, CHENG-SHAO, 鄭丞劭
Other Authors: JENG, JIN-TSONG
Format: Others
Language:en_US
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/qgrh8s