DRLND Project 2: Continuous Control

The goal of the agent is to control a double-jointed arm and have it follow a goal target.

The agent receives information about the position, rotation, velocity, and angular velocities of the arm, for a total of 33 variables, ie., the agent is dealing with a 33-dimensional state space. It controls the torques of the two joints described by a 4-dimensional action where each value is in the interval $[-1,+1]$.

The agent receives a reward of +0.1 each time step the agent manages to keep the arm's hand in the goal target. The task is considered solved when the agent manages to get an average score of at least +30 over 100 consecutive episodes. (This implementation runs 20 agents simultaneously, and has to maintain a mean score of at least +30 over 100 episodes.)

Installation

Clone this repository and install the requirements needed as per the instructions below.

Python Requirements

Follow the instructions in the Udacity Deep Reinforcement Learning repository on how to set up the drlnd environment, and then also install the Click package (used for handling command line arguments):

pip install click

Alternatively, on some systems it might be enough to install the required packages from the provided requirements.txt file:

pip install -r requirements.txt

Unity environment

For training:

Download the multi-agent Unity environment appropriate for your operating system using the links below and unzip it into the project folder.

Linux: click here
Mac OSX: click here
Windows (32-bit): click here
Windows (64-bit): click here

For visualizing the agent:

Download the single-agent Unity environment appropriate for your operating system using the links below and unzip it into the project folder.

Linux: click here
Mac OSX: click here
Windows (32-bit): click here
Windows (64-bit): click here

Training and Running the Agent

To train the agent, use the train.py program which takes the Unity environment, and optionally locations of output files and/or a random seed.

(drlnd) $ python train.py --help
Usage: train.py [OPTIONS]

Options:
  --environment PATH     Path to Unity environment  [required]
  --plot-output PATH     Output file for score plot
  --scores-output PATH   Output file for scores
  --weights-output PATH  File to save weights to after success
  --seed INTEGER         Random seed
  --help                 Show this message and exit.

For example:

(drlnd) $ python train.py --environment=Reacher.app --seed=20190415

After successfully training the agent, use the run.py program to load weights and run the simulation, which takes similar parameters as the training program:

(drlnd) $ python run.py --help
Usage: run.py [OPTIONS]

Options:
  --environment PATH    Path to Unity environment  [required]
    ...
  --help                Show this message and exit.

Note that running the agent requires the single-agent Unity environment.

For example:

(drlnd) $ python run.py --environment=Reacher.app --weights-input weights.pth

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.idea		.idea
.gitignore		.gitignore
README.md		README.md
Report.md		Report.md
agent.py		agent.py
model.py		model.py
noise.py		noise.py
reference_score.png		reference_score.png
reference_scores.txt		reference_scores.txt
replay.py		replay.py
requirements.txt		requirements.txt
run.py		run.py
score.png		score.png
scores.txt		scores.txt
train.py		train.py
weights.pth		weights.pth

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DRLND Project 2: Continuous Control

Installation

Python Requirements

Unity environment

Training and Running the Agent

About

Releases

Packages

Languages

chrka/drlnd-p2-continuous-control

Folders and files

Latest commit

History

Repository files navigation

DRLND Project 2: Continuous Control

Installation

Python Requirements

Unity environment

Training and Running the Agent

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages