An online algorithm for constrained POMDPs

This work seeks to address the problem of planning in the presence of uncertainty and constraints. Such problems arise in many situations, including the basis of this work, which involves planning for a team of first responders (both humans and robots) operating in an urban environment. The problem...

Full description

Bibliographic Details
Main Authors: Undurti, Aditya (Contributor), How, Jonathan P. (Contributor)
Other Authors: Massachusetts Institute of Technology. Department of Aeronautics and Astronautics (Contributor)
Format: Article
Language:English
Published: Institute of Electrical and Electronics Engineers, 2011-08-31T15:29:46Z.
Subjects:
Online Access:Get fulltext
LEADER 01687 am a22002173u 4500
001 65564
042 |a dc 
100 1 0 |a Undurti, Aditya  |e author 
100 1 0 |a Massachusetts Institute of Technology. Department of Aeronautics and Astronautics  |e contributor 
100 1 0 |a How, Jonathan P.  |e contributor 
100 1 0 |a Undurti, Aditya  |e contributor 
100 1 0 |a How, Jonathan P.  |e contributor 
700 1 0 |a How, Jonathan P.  |e author 
245 0 0 |a An online algorithm for constrained POMDPs 
260 |b Institute of Electrical and Electronics Engineers,   |c 2011-08-31T15:29:46Z. 
856 |z Get fulltext  |u http://hdl.handle.net/1721.1/65564 
520 |a This work seeks to address the problem of planning in the presence of uncertainty and constraints. Such problems arise in many situations, including the basis of this work, which involves planning for a team of first responders (both humans and robots) operating in an urban environment. The problem is framed as a Partially-Observable Markov Decision Process (POMDP) with constraints, and it is shown that even in a relatively simple planning problem, modeling constraints as large penalties does not lead to good solutions. The main contribution of the work is a new online algorithm that explicitly ensures constraint feasibility while remaining computationally tractable. Its performance is demonstrated on an example problem and it is demonstrated that our online algorithm generates policies comparable to an offline constrained POMDP algorithm. 
520 |a United States. Office of Naval Research (grant N00014-07-1-0749) 
546 |a en_US 
655 7 |a Article 
773 |t 2010 IEEE International Conference on Robotics and Automation (ICRA)