Quantum partially observable Markov decision processes

We present quantum observable Markov decision processes (QOMDPs), the quantum analogs of partially observable Markov decision processes (POMDPs). In a QOMDP, an agent is acting in a world where the state is represented as a quantum state and the agent can choose a superoperator to apply. This is sim...

Full description

Bibliographic Details
Main Authors: Barry, Jennifer (Author), Barry, Daniel T. (Author), Aaronson, Scott (Contributor)
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory (Contributor), Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor)
Format: Article
Language:English
Published: American Physical Society, 2014-09-12T17:39:21Z.
Subjects:
Online Access:Get fulltext
LEADER 01703 am a22002413u 4500
001 89468
042 |a dc 
100 1 0 |a Barry, Jennifer  |e author 
100 1 0 |a Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory  |e contributor 
100 1 0 |a Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science  |e contributor 
100 1 0 |a Aaronson, Scott  |e contributor 
700 1 0 |a Barry, Daniel T.  |e author 
700 1 0 |a Aaronson, Scott  |e author 
245 0 0 |a Quantum partially observable Markov decision processes 
260 |b American Physical Society,   |c 2014-09-12T17:39:21Z. 
856 |z Get fulltext  |u http://hdl.handle.net/1721.1/89468 
520 |a We present quantum observable Markov decision processes (QOMDPs), the quantum analogs of partially observable Markov decision processes (POMDPs). In a QOMDP, an agent is acting in a world where the state is represented as a quantum state and the agent can choose a superoperator to apply. This is similar to the POMDP belief state, which is a probability distribution over world states and evolves via a stochastic matrix. We show that the existence of a policy of at least a certain value has the same complexity for QOMDPs and POMDPs in the polynomial and infinite horizon cases. However, we also prove that the existence of a policy that can reach a goal state is decidable for goal POMDPs and undecidable for goal QOMDPs. 
520 |a National Science Foundation (U.S.) (Grant 0844626) 
520 |a National Science Foundation (U.S.) (Grant 1122374) 
520 |a National Science Foundation (U.S.) (Waterman Award) 
546 |a en 
655 7 |a Article 
773 |t Physical Review A