Applied various Reinforcement Learning (RL) algorithms to determine the optimal policy for diverse Markov Decision Processes (MDPs) specified within the OpenAI Gym library
-
Updated
Dec 18, 2023 - Python
Applied various Reinforcement Learning (RL) algorithms to determine the optimal policy for diverse Markov Decision Processes (MDPs) specified within the OpenAI Gym library
Some RL algorithm implementation (PyTorch) (Colab executable)
Solving OpenAI's "Lunar Lander" with Reinforcement Learning
Implementation of two Deep Reinforcement Learning agents for solving the lunar-lander environment.
Add a description, image, and links to the reinforce-with-baseline topic page so that developers can more easily learn about it.
To associate your repository with the reinforce-with-baseline topic, visit your repo's landing page and select "manage topics."