kaggle_luxai_season2

lux Ai Season2

"# luxui-2022"

This project showcase the code that achieved the 49th place (out 646 participants) in the Lux-UI Kaggle competition.

You can see the engine in action in this replay against a pure ML engine

It uses a mix of rule base engine, and an machine learning one that have been developed in parallel.

Neural network architecture Engine

The neural network body consisted of a convolutional ResNet architecture with squeeze-excitation layers. The network blocks used 128-channel 5x5 convolutions, and include two types of normalization. The network had four outputs consisting of three actor outputs - a 32x32xN-actions tensor for workers, and city tiles The final network consisted of 16 residual blocks, plus the input encoder and output layers, for a grand total of 3 million parameters.

Reinforcement learning algorithm

For reinforcement learning, I used a frozen teacher model perform inference on all states, and added a KL loss term for the current model’s policy from that of the teacher.

Rule Based Engine

The approach taken was stateless approach. I iterate across all the units in multiple-passes scoring the actions that the units may take with the context of what other units/cities are doing. If there's a clash, units are rerun with knowledge of what caused the clash. Standard A* pathfinding with weights to bias toward/away from certain things. I also built tables of path length approximations avoiding cities using Dijkstra which were for fast lookup to avoid having to run the full pathfinding too frequently. There's a few places with state, mostly to hack around unstable scoring. For example, there is specific code designed to reach new clusters. Scoring where exactly to move to came out fairly unstable, so there's a bias toward doing whatever it is you planned to do last tick. I still get a bit of "dancing" though where performing an action results in the weights changing and the unit wanting to go the opposite direction next tick. Maybe more statefulness would stabilize this.

The decision Engine

The key of a strong performance was to have an engine that was switching between the two tpe of approach (ML and Rule Based) using a very specific multi dimensional KPI:

Turn number
Distance from enemy
Density of resources around the unit.

Backtesting

The fact that there are two rule engine plus a super engine on top, create a three dimensional testing space. In other words, you have a combination of version across different engine that work together, and for which finding a surface maximum has a way more complex structure.

Name		Name	Last commit message	Last commit date
Latest commit History 117 Commits
compareBots		compareBots
currentCode		currentCode
.gitignore		.gitignore
README.md		README.md
compare.bat		compare.bat
compare_best_vs_X.bat		compare_best_vs_X.bat
compare_best_vs_current.bat		compare_best_vs_current.bat
submit.bat		submit.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

kaggle_luxai_season2

Neural network architecture Engine

Reinforcement learning algorithm

Rule Based Engine

The decision Engine

Backtesting

About

Releases

Packages

Languages

vitoque-git/Kaggle-luxai-Season2

Folders and files

Latest commit

History

Repository files navigation

kaggle_luxai_season2

Neural network architecture Engine

Reinforcement learning algorithm

Rule Based Engine

The decision Engine

Backtesting

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages