Finetuning Datasets

For task A for each class, we choose 8 samples per class from the train set. As well for task B and C, we have only considered those train that has been split with integration with task A. The obtained stat for GeoNames (tasks A, and B), WN18RR (task A), UMLS (NCI only for Task A, B, and C), and Schema.OrG (task B) is presented as followings:

Dataset	Supported Task(s)	# of samples
WN18RR	A	32
GeoNames	A,B	5102
NCI	A, B, C	911
Schema.Org	B	1068

Each dataset format has been changed to support our requirements for the training. Also, it is worth mentioning that we applied negative sampling for datasets but we ended-up up NOT using them during training process.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Finetuning Datasets

Files

README.md

Latest commit

History

README.md

File metadata and controls

Finetuning Datasets