Comparative Analysis of A3C and PPO Algorithms in Reinforcement Learning: A Survey on General Environments

This research article presents a comparison between two mainstream Deep Reinforcement Learning (DRL) algorithms, Asynchronous Advantage Actor-Critic (A3C) and Proximal Policy Optimization (PPO), in the context of two diverse environments: CartPole and Lunar Lander. DRL algorithms are widely known fo...

Full description

Bibliographic Details
Published in:IEEE Access
Main Authors: Alberto del Rio, David Jimenez, Javier Serrano
Format: Article
Language:English
Published: IEEE 2024-01-01
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10703056/