-
-
Notifications
You must be signed in to change notification settings - Fork 8.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[python-package][PySpark] Expose Training and Validation Metrics #11133
Conversation
3d7161a
to
99fc349
Compare
99fc349
to
25a06b8
Compare
|
||
|
||
@dataclass | ||
class _XGBoostTrainingSummary: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could be XGBoostTrainingSummary
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
DONE
@@ -0,0 +1,43 @@ | |||
"""Xgboost training summary integration submodule.""" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
how about renaming xgboost_training_summary.py to summary.py?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
DONE
a1f9df2
to
469c6bb
Compare
Hi @ayoub317, Could you be able to add some unit tests for this feature? |
Hi @wbo4958, Yes, with pleasure ! Could you please provide some pointers on where I should define these tests ? I had a quick look at the test codebase, and I would assume the following: The tests for the regressor and classifier could be placed under: The test for the ranker could be placed under: Does that sound like a good choice ? Thanks ! |
The path you pasted should be ok for adding new testing. |
469c6bb
to
ef56736
Compare
…gSummary to XGBoostTrainingSummary
5828a0c
to
da80def
Compare
Thanks @wbo4958 ! I pushed another commit adding the tests. Any feedback is welcome. |
da80def
to
3e60eec
Compare
@@ -1148,7 +1151,7 @@ def _train_booster( | |||
if dvalid is not None: | |||
dval = [(dtrain, "training"), (dvalid, "validation")] | |||
else: | |||
dval = None | |||
dval = [(dtrain, "training")] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@trivialfis, Could you check this is ok by enabling it by default?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @trivialfis, I wonder if this default eval dataset is necessary?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will look into this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me, it's unlikely someone will train a model without any evaluation.
|
||
from .test_spark_local import spark as spark_local | ||
|
||
logging.getLogger("py4j").setLevel(logging.INFO) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is this for debug?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, and since it was also set in test_spark_local.py, I kept it. Do you prefer that we remove it ?
@@ -0,0 +1,233 @@ | |||
import logging |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm wondering, if we could put the tests in this file into the existing test_spark_local.py and reuse the existing test data?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, I can move them there without much effort. Let me know if you'd like me to proceed with that.
However, in my humble opinion, it’s better to keep them in this separate file, and here’s the rationale :
The test_spark_local.py file already exceeds 1800 lines of code, which makes it increasingly difficult to read, maintain and navigate. As new features are added to PySpark XGBoost, this file will only continue to grow, compounding the problem.
I think refactoring the tests to organize them by key features, rather than bundling everything under the TestPySparkLocal class would be a better long-term approach.
If we decide to keep the tests in this file, I can either leave the examples here as they are, or, as you suggested, for better modularity and data reuse, we could import them from test_spark_local.py. Another option is to store all shared data in a separate file, allowing both test_spark_local.py and test_xgboost_summary.py to import what they need from it.
Let me know what you think, I have no strong opinion on this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, That's good point. Originally, I would like to separate the tests per the estimators. like XGBoostClassifier/Regressor/Ranker, instead of per features. So you can share the same dataset for different features.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Perfect ! I like that approach more than what I proposed ! I might work on it sometime soon if no one else does, especially since I have some other features I would like us to introduce.
assert not xgb_model.training_summary.validation_objective_history | ||
|
||
@staticmethod | ||
def assert_non_empty_training_objective_history( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm wondering if we could get the evaluate_results from xgboost itself and the training summary from xgboost-pyspark on the same dataset, and then check if they are equal? You can see some tests in test_spark_local.py are doing same comparison.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, absolutely. Thank you for pointing this out ! I tested this on a simple DataFrame locally, and the results matched perfectly. We should definitely add such tests, I’ll take care of that !
1e98e82
to
f36603a
Compare
Thanks for the review @wbo4958. I added a new commit to address what we discussed. |
… them in test_spark_local.py
f36603a
to
92d7cec
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @ayoub317, Thx for bringing summary feature into xgboost-pyspark.
LGTM.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me.
Let me fix the CI first #11162 . |
Closes #11132