You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We do evaluate the hard subset of QuALITY (as shown in the leaderboard), but as you noticed we don't allow the user/model to know if an example is from the hard subset or not (our jsonl files don't hold this information).
However, I am pretty sure that you can get this information from the original QuALITY dataset if you want.
Is there a way to load only the hard examples for QuALITY?
The text was updated successfully, but these errors were encountered: