You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, for the task of imdb_text_continuation, I looked into the config file gpt2_ppo_on_supervised.yml where policy contains model_name: rajkumarrrk/gpt2-fine-tuned-on-imdb-positive-reviews. This is the model that will be improved but which is the reference model?
I want to provide the checkpoints for both the reference model and the model that I want to improve i.e. policy model. In fact, they are the same models.
The text was updated successfully, but these errors were encountered:
Hi, for the task of
imdb_text_continuation
, I looked into the config filegpt2_ppo_on_supervised.yml
wherepolicy
containsmodel_name: rajkumarrrk/gpt2-fine-tuned-on-imdb-positive-reviews
. This is the model that will be improved but which is thereference
model?I want to provide the checkpoints for both the
reference model
and the model that I want to improve i.e.policy model
. In fact, they are the same models.The text was updated successfully, but these errors were encountered: