|
|
|
|
LEADER |
01703 am a22002413u 4500 |
001 |
89468 |
042 |
|
|
|a dc
|
100 |
1 |
0 |
|a Barry, Jennifer
|e author
|
100 |
1 |
0 |
|a Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
|e contributor
|
100 |
1 |
0 |
|a Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
|e contributor
|
100 |
1 |
0 |
|a Aaronson, Scott
|e contributor
|
700 |
1 |
0 |
|a Barry, Daniel T.
|e author
|
700 |
1 |
0 |
|a Aaronson, Scott
|e author
|
245 |
0 |
0 |
|a Quantum partially observable Markov decision processes
|
260 |
|
|
|b American Physical Society,
|c 2014-09-12T17:39:21Z.
|
856 |
|
|
|z Get fulltext
|u http://hdl.handle.net/1721.1/89468
|
520 |
|
|
|a We present quantum observable Markov decision processes (QOMDPs), the quantum analogs of partially observable Markov decision processes (POMDPs). In a QOMDP, an agent is acting in a world where the state is represented as a quantum state and the agent can choose a superoperator to apply. This is similar to the POMDP belief state, which is a probability distribution over world states and evolves via a stochastic matrix. We show that the existence of a policy of at least a certain value has the same complexity for QOMDPs and POMDPs in the polynomial and infinite horizon cases. However, we also prove that the existence of a policy that can reach a goal state is decidable for goal POMDPs and undecidable for goal QOMDPs.
|
520 |
|
|
|a National Science Foundation (U.S.) (Grant 0844626)
|
520 |
|
|
|a National Science Foundation (U.S.) (Grant 1122374)
|
520 |
|
|
|a National Science Foundation (U.S.) (Waterman Award)
|
546 |
|
|
|a en
|
655 |
7 |
|
|a Article
|
773 |
|
|
|t Physical Review A
|