Solving peg solitaire by reinforcement learning

In this repo we have built a reinforcement learning system according to the Actor-Critic model, and applied it to Peg Solitaire. The overall architecture is shown in the figure below. The agent (the reinforcement learner) consists of an actor and a critic, where the actor holds the action policy and adjusts this policy by receiving messages (the Temporal Difference error) from the critic. The agent is built as a stand-alone general purpose model which can be applied to any kind of game environment. Here it is used to solve Peg Solitaire. As shown in the figure, the game is represented by a Simulated World that holds the rules of the game and knows the transitions between states by actions.


The basic components of an Actor-Critic reincforcement learning system for playing Peg Solitaire. The red dotted arrows represents communication related to learning.

Results

The goal of the game is to empty the entire board except for one peg, by making certain valid moves. In this case, the remaining peg does not need to be in the middle. The figures below shows the results of one complete round of training (3200 games played). The two leftmost figures shows the progression of learning by plotting the number of remaining pegs after each game played. The rightmost animation shows how the last game in the training was played out, to showcase the skills of the trained agent.

Progression of learning	Averaged progression	Last game

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
Agent		Agent
Helpers		Helpers
LearningAlgorithm		LearningAlgorithm
SimWorld		SimWorld
configs		configs
images		images
.gitignore		.gitignore
README.md		README.md
game.py		game.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Solving peg solitaire by reinforcement learning

Results

Eliot & Markus

About

Releases

Packages

Languages

markusmkim/RL-PegSolitaire

Folders and files

Latest commit

History

Repository files navigation

Solving peg solitaire by reinforcement learning

Results

Eliot & Markus

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages