Learning to Plan via Deep Optimistic Value Exploration

Learning to Plan via Deep Optimistic Value Exploration

Deep exploration requires coordinated long-term planning. We present a model-based reinforcement learning algorithm that guides policy learning through a value function that exhibits optimism in the face of uncertainty. We capture uncertainty over values by combining predictions from an ensemble of...

Full description

Bibliographic Details
Main Authors:	Seyde, Tim (Author), Schwarting, Wilko (Author), Karaman, Sertac (Author), Rus, Daniela L (Author)
Other Authors:	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory (Contributor), Massachusetts Institute of Technology. Laboratory for Information and Decision Systems (Contributor)
Format:	Article
Language:	English
Published:	2020-05-11T19:59:29Z.
Subjects:	Article
Online Access:	Get fulltext

Similar Items

Stochastic Dynamic Games in Belief Space
by: Schwarting, Wilko, et al.
Published: (2022)

Semi-Cooperative Control for Autonomous Emergency Vehicles
by: Buckman, Noam, et al.
Published: (2022)

Sharing is Caring: Socially-Compliant Autonomous Intersection Negotiation
by: Buckman, Noam, et al.
Published: (2020)

Parallel Autonomy in Automated Vehicles: Safe Motion Generation with Minimal Intervention
by: Schwarting, Wilko, et al.
Published: (2017)

Variational Autoencoder for End-to-End Control of Autonomous Driving with Novelty Detection and Training De-biasing
by: Amini, Alexander, et al.
Published: (2018)

Dynamic Risk Density for Autonomous Navigation in Cluttered Environments without Object Detection
by: Pierson, Alyssa, et al.
Published: (2020)

Joint Multi-Policy Behavior Estimation and Receding-Horizon Trajectory Planning for Automated Urban Driving
by: Zhou, Bingyu, et al.
Published: (2020)

PDDLStream: Integrating symbolic planners and blackbox samplers via optimistic adaptive planning
by: Garrett, Caelan Reed, et al.
Published: (2021)

Compositional and Contract-based Verification for Autonomous Driving on Road Networks
by: DeCastro, Jonathan, et al.
Published: (2018)

Learning and flow control in optimistic simulation
by: Solomon, Luiza
Published: (2002)

Shared Linear Quadratic Regulation Control: A Reinforcement Learning Approach
by: Abu-Khalaf, Murad, et al.
Published: (2021)

Optimistic and Pessimistic Result of Planning and Scheduling Dynamic Processes
by: Wieslaw Wajs
Published: (1999-01-01)

The Optimists' Birthday

Optimistic computation
by: Bubenik, Richard G.
Published: (2009)

Recording the optimistic
by: Leonor Matos Silva
Published: (2017-03-01)

How overconfident and optimistic manager will affect firm value
by: Shih, Wei Chu, et al.
Published: (2010)

Optimistically Engaging in the Present
by: Eric A. Fenkl, et al.
Published: (2014-07-01)

Uncovering and Mitigating Algorithmic Bias through Learned Latent Structure
by: Soleimany, Ava, et al.
Published: (2019)

Optimal planning with temporal logic specifications
by: Karaman, Sertac
Published: (2010)

Optimistic gittins indices
by: Gutin, Eli, et al.
Published: (2020)

Are E-values too optimistic or too pessimistic? Both and neither!
by: Greenland, S., et al.
Published: (2022)

Laughing rats are optimistic.
by: Rafal Rygula, et al.
Published: (2012-01-01)

Optimistic semantic synchronization
by: Sreeram, Jaswanth
Published: (2012)

Microclustered optimistic simulation
by: Bradley, Colin Bueth
Published: (2008)

Optimistic Sampling Strategy for Data-Efficient Reinforcement Learning
by: Dongfang Zhao, et al.
Published: (2019-01-01)

Sampling-based algorithms for optimal path planning problems
by: Karaman, Sertac
Published: (2013)

Minimum-violation scLTL motion planning for mobility-on-demand
by: Tumova, Jana, et al.
Published: (2018)

Multi-Vehicle Motion Planning for Social Optimal Mobility-on-Demand
by: Karlsson, Jesper, et al.
Published: (2018)

Minimum-violation LTL planning with conflicting specifications
by: Tumova, Jana, et al.
Published: (2013)

Scaling Techno-Optimistic Visions
by: Seyram Avle, et al.
Published: (2020-05-01)

Persistent Monitoring of Events With Stochastic Arrivals at Multiple Stations
by: Yu, Jingjin, et al.
Published: (2016)

Deep brain stimulation in the media: over-optimistic portrayals call for a new strategy involving journalists and scientists in ethical debates.
by: Frédéric eGilbert, et al.
Published: (2011-05-01)

Learning from academically optimistic teachers: supporting teacher academic optimism.
Published: ()

The optimistic pursuit of plastic surgery training
by: Gary Masterton, et al.
Published: (2021-03-01)

Professor BS Chavan: The ever optimist
by: Nitin Gupta
Published: (2020-01-01)

Fishes and cowboy boots: An optimistic view
by: Pedro de Araujo Lima Constantino, et al.
Published: (2020-09-01)

U.S. Productivity Growth: An Optimistic Perspective
by: Martin Neil Baily, et al.
Published: (2013-04-01)

The Rational Optimist: How Prosperity Evolves
by: Matt Ridley
Published: (2012-05-01)

The emergence and evolution of optimistic expectations in schoolchildren
by: Carolina Falcón, et al.

Optimistic Fair Exchange of Digital Signature
by: Ying-Hao Lee, et al.

Cannot write session to /tmp/vufind_sessions/sess_4mht6ng7nsfgjvelroq3n8dv1l