-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is it "train" or "update"? #1
Comments
I didn't write this GensimSimilarity class (Misha did) so he could better answer this. My general thought is that update (in general) for online learning models is going to take a batch and not a single item. Since this is a more specific class meant to update a documents corpus, perhaps it is okay for it to receive an iterable. |
GensimSimilarity has Intended usage:
So, this |
So it seems like |
I don't see what does not match the current interface. Nothing is wrong, everything is right. In general, Having model training procedures in something called
Not every model is suitable for online learning. In the current interface a person who implements the model need to make sure it would make sense to do updates, and implement how to do that inside For example the same The other problem is that we don't actually have an online learning example anywhere and the online learning module was not used anywhere after I wrote trending recommender using that long time ago, which was then again promptly forgotten. Also I would likely refactor the online learning a little differently now if I was working on it. |
We don't define any Model Class that has an update method. Maybe we should define an ABC like:
and make this the required type for LocalTrainer at least? |
GensimSimilarity implements both. StatefulModel requires
train()
. LocalTrainer tries to callmodel.update()
.Background: (Back before
transform
anditransform
we used train() for batches of records and update() for single records. Now we expecttrain
to receive a batch and letOnlineLearningWrapper.transform()
handle single records.)So,
train
instead?update()
back into stateful model or do we need to clean-up GensimSimilarity?The text was updated successfully, but these errors were encountered: