Contributions are welcome
-
- Deep Q Network
-
- Dueling Q Network
-
- Policy Gradient: REINFORCE
-
- Advantage Actor-Critic
-
- Deep Deterministic Policy Gradient
-
- Asynchronous Advantage Actor-Critic (A3C)
-
- Estimate the concrete performance of each algorithms
MIT Licence