| LLMs4OL Paradigm | Task A: Term Typing | Task B: Type Taxonomy Discovery | Task C: Type Non-Taxonomic Relation Extraction | Finetuning | Task A Detailed Results | Task B Detailed Results | Task C Detailed Results | Task A Datasets | Task B Datasets | Task C Datasets | Finetuning Datasets |
For task A for each class, we choose 8 samples per class from the train set. As well for task B and C, we have only considered those train that has been split with integration with task A. The obtained stat for GeoNames (tasks A, and B), WN18RR (task A), UMLS (NCI only for Task A, B, and C), and Schema.OrG (task B) is presented as followings:
Dataset | Supported Task(s) | # of samples |
---|---|---|
WN18RR | A | 32 |
GeoNames | A,B | 5102 |
NCI | A, B, C | 911 |
Schema.Org | B | 1068 |
Each dataset format has been changed to support our requirements for the training. Also, it is worth mentioning that we applied negative sampling for datasets but we ended-up up NOT using them during training process.