Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language Processing

The first part of a two-part series of papers provides a survey on recent advances in Deep Reinforcement Learning (DRL) applications for solving partially observable Markov decision processes (POMDP) problems. Reinforcement Learning (RL) is an approach to simulate the human’s natural learning proces...

Full description

Bibliographic Details
Main Authors: Xuanchen Xiang, Simon Foo
Format: Article
Language:English
Published: MDPI AG 2021-07-01
Series:Machine Learning and Knowledge Extraction
Subjects:
Online Access:https://www.mdpi.com/2504-4990/3/3/29
id doaj-4aaaaa9105f34a119951e453feeae065
record_format Article
spelling doaj-4aaaaa9105f34a119951e453feeae0652021-09-26T00:35:18ZengMDPI AGMachine Learning and Knowledge Extraction2504-49902021-07-0132955458110.3390/make3030029Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language ProcessingXuanchen Xiang0Simon Foo1Department of Electrical and Computer Engineering, FAMU-FSU College of Engineering, Tallahassee, FL 32310, USADepartment of Electrical and Computer Engineering, FAMU-FSU College of Engineering, Tallahassee, FL 32310, USAThe first part of a two-part series of papers provides a survey on recent advances in Deep Reinforcement Learning (DRL) applications for solving partially observable Markov decision processes (POMDP) problems. Reinforcement Learning (RL) is an approach to simulate the human’s natural learning process, whose key is to let the agent learn by interacting with the stochastic environment. The fact that the agent has limited access to the information of the environment enables AI to be applied efficiently in most fields that require self-learning. Although efficient algorithms are being widely used, it seems essential to have an organized investigation—we can make good comparisons and choose the best structures or algorithms when applying DRL in various applications. In this overview, we introduce Markov Decision Processes (MDP) problems and Reinforcement Learning and applications of DRL for solving POMDP problems in games, robotics, and natural language processing. A follow-up paper will cover applications in transportation, communications and networking, and industries.https://www.mdpi.com/2504-4990/3/3/29reinforcement learningdeep reinforcement learningMarkov decision processpartially observable Markov decision process
collection DOAJ
language English
format Article
sources DOAJ
author Xuanchen Xiang
Simon Foo
spellingShingle Xuanchen Xiang
Simon Foo
Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language Processing
Machine Learning and Knowledge Extraction
reinforcement learning
deep reinforcement learning
Markov decision process
partially observable Markov decision process
author_facet Xuanchen Xiang
Simon Foo
author_sort Xuanchen Xiang
title Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language Processing
title_short Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language Processing
title_full Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language Processing
title_fullStr Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language Processing
title_full_unstemmed Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language Processing
title_sort recent advances in deep reinforcement learning applications for solving partially observable markov decision processes (pomdp) problems: part 1—fundamentals and applications in games, robotics and natural language processing
publisher MDPI AG
series Machine Learning and Knowledge Extraction
issn 2504-4990
publishDate 2021-07-01
description The first part of a two-part series of papers provides a survey on recent advances in Deep Reinforcement Learning (DRL) applications for solving partially observable Markov decision processes (POMDP) problems. Reinforcement Learning (RL) is an approach to simulate the human’s natural learning process, whose key is to let the agent learn by interacting with the stochastic environment. The fact that the agent has limited access to the information of the environment enables AI to be applied efficiently in most fields that require self-learning. Although efficient algorithms are being widely used, it seems essential to have an organized investigation—we can make good comparisons and choose the best structures or algorithms when applying DRL in various applications. In this overview, we introduce Markov Decision Processes (MDP) problems and Reinforcement Learning and applications of DRL for solving POMDP problems in games, robotics, and natural language processing. A follow-up paper will cover applications in transportation, communications and networking, and industries.
topic reinforcement learning
deep reinforcement learning
Markov decision process
partially observable Markov decision process
url https://www.mdpi.com/2504-4990/3/3/29
work_keys_str_mv AT xuanchenxiang recentadvancesindeepreinforcementlearningapplicationsforsolvingpartiallyobservablemarkovdecisionprocessespomdpproblemspart1fundamentalsandapplicationsingamesroboticsandnaturallanguageprocessing
AT simonfoo recentadvancesindeepreinforcementlearningapplicationsforsolvingpartiallyobservablemarkovdecisionprocessespomdpproblemspart1fundamentalsandapplicationsingamesroboticsandnaturallanguageprocessing
_version_ 1716870335687557120