Skip to content

Latest commit

 

History

History
37 lines (32 loc) · 2.12 KB

README.md

File metadata and controls

37 lines (32 loc) · 2.12 KB

ZXreinforce

This project contains the code used to produce the results in the two publications: "Optimizing ZX-Diagrams with Deep Reinforcement Learning" (https://doi.org/10.1088/2632-2153/ad76f7) and "Tackling Decision Processes with Non-Cumulative Objectives using Reinforcement Learning" (https://arxiv.org/abs/2405.13609).

  • Main code of the algorithm is in zxreinforce

For results relating to "Optimizing ZX-Diagrams with Deep Reinforcement Learning" see:

  • A script showing how to train an agent is at experiments/train_rl_agent/runner_final.py.
  • The agent's training progress can be monitored with experiments/evaluation_rl_agent/evaluation_training_logger.ipynb
  • An example notebook showing how to simplify a diagram with the trained agent is at experiments/evaluation_rl_agent/simplify_example_traj.ipynb.
  • Scripts to compare the performance of the RL agent to a greedy strategy, simulated annealing, and PyZX are in experiments/evaluation_performance.
  • The evaluation of the Copy action is done in experiments/eval_copy_action.
  • The evaluation of the action dependence on the local environment is done in experiments/prob_vs_layer.
  • The network weights of the agents trained for the ablation studies can be found in saved_agents.

For results relating to "Tackling Decision Processes with Non-Cumulative Objectives using Reinforcement Learning" see:

  • A script showing how to train an agent is at ncmdp_experiments/runner_max_20s_bench.py.
  • Trained agent weights and training progress logs are in ncmdp_experiments/runs.
  • The results are analyzed in ncmdp_experiments/evalt_result.ipynb.

To run this code install the requirements.txt or use the docker image as described in https://github.com/alexandrupaler/zxreinforce_small.

Citation

@article{Nagele_2024_Opt,
doi = {10.1088/2632-2153/ad76f7},
url = {https://dx.doi.org/10.1088/2632-2153/ad76f7},
year = {2024},
month = {sep},
publisher = {IOP Publishing},
volume = {5},
number = {3},
pages = {035077},
author = {Maximilian N\"agele and Florian Marquardt},
title = {Optimizing {ZX}-diagrams with deep reinforcement learning},
journal = {Machine Learning: Science and Technology},
}