Average-Case Performance of Rollout Algorithms for Knapsack Problems

Rollout algorithms have demonstrated excellent performance on a variety of dynamic and discrete optimization problems. Interpreted as an approximate dynamic programming algorithm, a rollout algorithm estimates the value-to-go at each decision stage by simulating future events while following a heuri...

Full description

Bibliographic Details
Main Authors:	Mastin, Andrew (Contributor), Jaillet, Patrick (Contributor)
Other Authors:	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor), Massachusetts Institute of Technology. Laboratory for Information and Decision Systems (Contributor)
Format:	Article
Language:	English
Published:	Springer-Verlag, 2015-12-18T15:07:17Z.
Subjects:	Article
Online Access:	Get fulltext


LEADER	02140 am a22002413u 4500
001	100430
042			\|a dc
100	1	0	\|a Mastin, Andrew \|e author
100	1	0	\|a Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science \|e contributor
100	1	0	\|a Massachusetts Institute of Technology. Laboratory for Information and Decision Systems \|e contributor
100	1	0	\|a Mastin, Andrew \|e contributor
100	1	0	\|a Jaillet, Patrick \|e contributor
700	1	0	\|a Jaillet, Patrick \|e author
245	0	0	\|a Average-Case Performance of Rollout Algorithms for Knapsack Problems
260			\|b Springer-Verlag, \|c 2015-12-18T15:07:17Z.
856			\|z Get fulltext \|u http://hdl.handle.net/1721.1/100430
520			\|a Rollout algorithms have demonstrated excellent performance on a variety of dynamic and discrete optimization problems. Interpreted as an approximate dynamic programming algorithm, a rollout algorithm estimates the value-to-go at each decision stage by simulating future events while following a heuristic policy, referred to as the base policy. While in many cases rollout algorithms are guaranteed to perform as well as their base policies, there have been few theoretical results showing additional improvement in performance. In this paper, we perform a probabilistic analysis of the subset sum problem and 0-1 knapsack problem, giving theoretical evidence that rollout algorithms perform strictly better than their base policies. Using a stochastic model from the existing literature, we analyze two rollout methods that we refer to as the exhaustive rollout and consecutive rollout, both of which employ a simple greedy base policy. We prove that both methods yield a significant improvement in expected performance after a single iteration of the rollout algorithm, relative to the base policy.
520			\|a National Science Foundation (U.S.) (Grant 1029603)
520			\|a United States. Office of Naval Research (Grant N00014-12-1-0033)
520			\|a National Science Foundation (U.S.). Graduate Research Fellowship
546			\|a en_US
655	7		\|a Article
773			\|t Journal of Optimization Theory and Applications

Average-Case Performance of Rollout Algorithms for Knapsack Problems

Similar Items