This repository contains reinforcement learning agent I trained for the Deep Reinforcement Learning Course from Hugging Face this
Agent trained using PPO implemented in Stable-Baselines3
replay.mp4
Agent trained using Q-learning implemented from scratch.
replay.mp4
Agent trained using custom PolicyGradient implemented in PyTorch
replay.mp4
Agent trained the same ways as in CartPole. Diffrence beeing the input was an image of the game.
replay.mp4
This agent was trained using Stable-Baselines3 implementation of A2C (Actor-Critic)