-
-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ENH] Add option to permute per forest fraction #145
Conversation
Signed-off-by: Adam Li <adam2392@gmail.com>
Codecov ReportAttention:
Additional details and impacted files@@ Coverage Diff @@
## main #145 +/- ##
==========================================
+ Coverage 88.86% 89.06% +0.20%
==========================================
Files 41 41
Lines 3439 3531 +92
==========================================
+ Hits 3056 3145 +89
- Misses 383 386 +3
☔ View full report in Codecov by Sentry. |
Signed-off-by: Adam Li <adam2392@gmail.com>
Signed-off-by: Adam Li <adam2392@gmail.com>
Signed-off-by: Adam Li <adam2392@gmail.com>
Signed-off-by: Adam Li <adam2392@gmail.com>
Signed-off-by: Adam Li <adam2392@gmail.com>
Signed-off-by: Adam Li <adam2392@gmail.com>
Signed-off-by: Adam Li <adam2392@gmail.com>
Signed-off-by: Adam Li <adam2392@gmail.com>
… into might-params
Signed-off-by: Adam Li <adam2392@gmail.com>
Signed-off-by: Adam Li <adam2392@gmail.com>
I'll let #143 get merged first, so I can test out changes with this new functionality |
… into might-params
Signed-off-by: Adam Li <adam2392@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The tests didn't pass because some indices were not stratified during testing. I also feel the samples
variable is used too many times, which could cause confusion. In classifier _statistic
, samples
could mean test indices or non-nan indices.
Should we close this? |
It's a simple addition that's backwards compatible so I think I can just finish adding it. |
Signed-off-by: Adam Li <adam2392@gmail.com>
Signed-off-by: Adam Li <adam2392@gmail.com>
Signed-off-by: Adam Li <adam2392@gmail.com>
Signed-off-by: Adam Li <adam2392@gmail.com>
Signed-off-by: Adam Li <adam2392@gmail.com>
Is this mergable? @PSSF23 to preserve the old behavior, just set |
@adam2392 We should keep the same behavior for the |
Currently, permute_per_tree default for Sam is With this PR, permute_per_forest_fraction = None would have the same default. Is this what you mean? |
Signed-off-by: Adam Li <adam2392@gmail.com>
Signed-off-by: Adam Li <adam2392@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@adam2392 Actually it might not be relevant to Sam at the current stage of implementation as he's only running statistics.
My original thought is to preserve any current use of permute_per_tree
and treat permute_per_forest_fraction
as add-on to it. But as the new parameter fully covers the original use, I think it's fine to upgrade. Can you fix the error in the regressor test?
Signed-off-by: Adam Li <adam2392@gmail.com>
Signed-off-by: Adam Li <adam2392@gmail.com>
Discussed in Forest meeting today:
Changes proposed in this pull request:
permute_per_forest_fraction
parameter that permutes the covariate_index a controlled number of times over the entire forest (rather than per tree)X
, or large forests, or many repeated jobs ofFeatureImportance*Classifier
.Before submitting
section of the
CONTRIBUTING
docs.Writing docstrings section of the
CONTRIBUTING
docs.After submitting