Original PyTorch implementation of the ACE Planner
ACE Planner is a model-based deep reinforcement learning algorithm that seamlessly integrates
online planning and off-policy agent learning for sample-efficient exploration.
Drawing on theoretical insights gained from the performance analysis of an
Ensure MuJoCo is installed If you want to run the Adroit, Meta-World, and Maze2D experiments, please be sure that the environmental dependencies are satisfied.
After installing dependencies, you can train an agent by calling
python src/train_icem_drnn.py task=acrobot-swingup
The training script supports both local logging as well as cloud-based logging with Weights & Biases.
To use W&B, provide a key by setting the environment variable WANDB_API_KEY=<YOUR_KEY>
and add your W&B project and entity details to cfgs/default.yaml
.
ACE planer is licensed under the MIT license. MuJoCo and DeepMind Control Suite are licensed under the Apache 2.0 license. We thank the TD-MPC authors for their implementation of the training and logging frameworks.