Pytorch Reinforcement Learning

This repository contains the code for policy gradient algorithm incorporating with credit assignment mechanism.

Install Dependencies

pip install torch torchvision

pip install tensorflow=2.2

or

pip install tensorflow-gpu=2.2

git clone https://github.com/openai/baselines.git -b tf2 && \
cd baselines && \
pip install -e .

Note: I haven't tested the code on Tensorflow 1 yet but it should work as well.

pip install 'gym[atari]'

Install Park Platform. I modified the platform slightly to make it compatible with OpenAI's baseline.

git clone https://github.com/lehduong/park -b openai_baseline &&\
cd park && \
pip install -e .

python main.py --algo a2c --env-name PongNoFrameskip-v4

The started code is based on ikostrikov's repository

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
assets		assets
core		core
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
conda_env.yml		conda_env.yml
enjoy.py		enjoy.py
evaluation.py		evaluation.py
main.py		main.py