Comparative Analysis of A3C and PPO Algorithms in Reinforcement Learning: A Survey on General Environments
This research article presents a comparison between two mainstream Deep Reinforcement Learning (DRL) algorithms, Asynchronous Advantage Actor-Critic (A3C) and Proximal Policy Optimization (PPO), in the context of two diverse environments: CartPole and Lunar Lander. DRL algorithms are widely known fo...
| Published in: | IEEE Access |
|---|---|
| Main Authors: | , , |
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2024-01-01
|
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10703056/ |
