Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ジニ係数とジニ不純度の違いについて #73

Open
yushiro-yamashita opened this issue Jun 24, 2021 · 1 comment
Open

ジニ係数とジニ不純度の違いについて #73

yushiro-yamashita opened this issue Jun 24, 2021 · 1 comment

Comments

@yushiro-yamashita
Copy link
Collaborator

yushiro-yamashita commented Jun 24, 2021

決定木のページで「Gini index」に対する訳語として「ジニ係数」を当てていますが、一般的には「ジニ係数」と言うと経済学用語を指すことが多いようで、それは決定木に使われる「ジニ不純度」とは似て非なる概念らしいです。
参考:https://qiita.com/murs313/items/0a2077f97664b58c2ea9
https://satopirka.com/2018/09/decision-tree-implementation-with-numpy/
https://stats.stackexchange.com/questions/445978/what-is-the-difference-between-gini-index-and-gini-coefficient

原文が「Gini index」のためジニ係数と訳すのが素直な気はしますが、日本語読者の混乱を避けるために「ジニ不純度」に書き換えた方がいいかもしれません。

@yushiro-yamashita
Copy link
Collaborator Author

yushiro-yamashita commented Jun 24, 2021

軽く検索した感じだと、「Gini coefficient」→ジニ係数、「Gini impurity」→ジニ不純度、「Gini index」→どちらを指す場合もある という感じですね

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant