-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What effect are the Lambda and Mu? #140
Comments
Thank you @CGLemon for your interest and your help. The sigmoid is useful in itself as it allows SAI to play with variable komi. The parameters λ (lambda) and μ (mu) moreover allow to modify SAI behaviour activating (when nonzero) what we call SAI agent. SAI agent will use the sigmoid by looking also to komi values different from the real one. So, the purpose of SAI agent is to allow SAI to virtually consider different komi values, in particular when it is ahead or behind. When LZ or AGZ are very much ahead, most move choices will have a flat 0.99 winrate, and so it becomes difficult to distinguish between moves that lose points and moves that do not. SAI agent helps with that, because these moves will have very different winrate in the range of komi the agent considers. When LZ or AGZ are very much behind (consider handicap games), most move choices will have a flat 0.01 winrate, and in the same way the agent may help to recover points, at least against a human or weak engine. |
Thank you very much @CGLemon for the message.
Yes, we plan to publish a paper with the results on 19x19 board, including details on improvements since the 9x9 paper. Moreover, in order to prove that SAI framework is general enough to be applied to games other than Go, we started a Othello-SAI project. Results are very promising: AlphaZero-like software appears to make suboptimal moves in the endgame of Othello, just like in Go, and this suboptimal moves are not played by SAI! |
SAI Team:
First, Thank you for your research to provide the free strong Go Engine and give us a different idea to implement it.
Since the 2019 SAI's paper, SAI: a Sensible Artificial Intelligence that plays with handicap and targets high scores in 9x9 Go, SAI is quite different from that paper now. A main different is adding Lambda and Mu to sigmoid bonus. I do not understand why do you do that. What is main idea for MCTS with Lambda and Mu. What is the core idea for this? Is it significant advancement?
In addition, do you plan to publish next paper? I am interested in the detail about the every improvement methods, like Average FPU, KLE Network or adapt SAI to mush handicap and high komi etc. I can understand the basic methods by following the code. But I can not really understand the core idea and other effects. It will be helpful to me.
Very thanks!
-- Hung Zhe, Lin
The text was updated successfully, but these errors were encountered: