Skip to content

Q-Learning, Actor-Critics, Proximal-Policy Optimization reinforcement learning algorithms in bipedal walker-v2 environment

License

Notifications You must be signed in to change notification settings

mnguyen0226/bipedal_robotics_rl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

68 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Performance Comparison and Analysis Between Q-Learning, A2C with Generalized Advantage Estimation, and PPO with Generalized Advantage Estimation in BipedalWalker-v2

About

In recent years, reinforcement learning (RL) algorithms have been implemented in several robotics and control systems applications. Several RL techniques are used to achieve basic autonomous controls, path-findings, vision tracker, and intelligent decision. Stabilizing bipedal walking robot is one of the challenging problems. In this paper, I will experiment and evaluate the three reinforcement learning algorithms to solve the simulated bipedal walking problem. Without any prior knowledge of its surrounding environment, the agent is able to demonstrate successful walking ability through trial and error via Advantage Actor-Critic (A2C) and Proximal Policy Optimization (PPO). The results show that A2C and PPO with different bias estimation rates are capable of solving the bipedal walking problem.

alt text

Reproducibility

  • Fork the project and enter the directory: $cd soo_non_convex_ml
  • $cd rl_value_based_vs_value_policy_based
  • To run experiments on all three algorithms Q-Learning, A2C, PPO: python src/main.py
  • To specify algorithms in src/main.py:
# train q leanring
q_learning_main()

# train a2c gae
# a2c_main()

# train ppo gae
# ppo_main()

Recorded Agents

Q-Learning Performance

alt text

A2C with GAE Performance

alt text

PPO with GAE Performance

alt text

Proposal & Report

References

About

Q-Learning, Actor-Critics, Proximal-Policy Optimization reinforcement learning algorithms in bipedal walker-v2 environment

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published