Developed various model-based and model-free Intelligent and Naive algorithms for the beam balance environment in OpenAI Gym.
deep-reinforcement-learning epsilon-greedy-exploration boltzman-policy-reward variational-pid-controller
-
Updated
Mar 29, 2021 - Jupyter Notebook