Skip to content

Latest commit

 

History

History
89 lines (66 loc) · 5.51 KB

README.md

File metadata and controls

89 lines (66 loc) · 5.51 KB

amrlib models

This repository contains releases of models for the amrlib library. For information on how to download and install see ReadTheDocs Installation Instructions or follow the instructions below.

Note that because these models are large binary files, they are not directly tracked with git. Instead the are provided for download as .tar.gz files.

Installation

Download the pre-trained models from the releases directory and extract in them in amrlib/data. Set a link to the directory (or rename it) as either model_stog (sentence to graph) for the parse style models or model_gtos (graph to sentence) for the generate style models.

If you're unsure where amrlib is installed you can do...

pip3 show amrlib

or

python3
>>> import amrlib
>>> amrlib.__file__

Sentence to Graph Models

Name Version Date Size Score Speed
parse_xfm_bart_large 0.1.0 2022-02-16 1.4GB 83.7 SMATCH 17/sec
parse_xfm_bart_base 0.1.0 2022-02-16 492 MB 82.3 SMATCH 31/sec
parse_spring 0.1.0 2021-11-25 1.5GB 83.5 SMATCH 14/sec
parse_t5 0.2.0 2020-11-27 785MB 81.9 SMATCH 11/sec
parse_gsii 0.1.0 2020-08-30 787MB 76.8 SMATCH 28/sec

All models are trained and scored on AMR-3 (LDC2020T02) using num_beams=4 for parse_xfm_x and num_beams=5 for parse_spring. Note that AMR-3 is a more difficult test set than the older AMR-2 set and generally scores a bit lower for similar models. All scores are without adding the :wiki tags. However, when using BLINK, scores typically stay approximately the same since the wikification process itself scores in the low to mid 80s on smatch.

Speed is the inference speed on the AMR-3 test set (1898 graphs) using an RTX3090 with num_beams=1 and batch_size=32. The units are sentences/second.

Attribution:

  • The parse_spring code is from the SPRING model. Note that the author's license for their code is "Attribution-NonCommercial-ShareAlike 4.0 International". Details on the model can be found in this paper.

  • The parse_gsii model comes from jcyk/AMR-gs, the details of which can be found in this paper.

  • All other models were developed as part of amrlib.

Graph to Sentence Models

Name Version Date Size Score
generate_t5wtense 0.1.0 2020-12-30 787MB 54/44 BLEU

The generate_t5wtense gives a 54 BLEU with tense tags or 44 BLEU with un-tagged LDC2020T02. Note that the model is only scored with graphs that fit in the T5 model's 512 token limit. If including clipped graphs, scores will be more like 52/43 BLEU. Details on using this type of model for generation can be found in this paper.

Additionally, there is a training config file for a T5-large based model in the amrlib/config directory here. This model scores about 2 BLEU points higher than generate_t5wtense-v0_1_0 if you take the time to train it yourself. The training configuration has not been full optimized so you may be able to increase this if you try different hyperparameters or possibly a different version of the base pretrained T5 model.

Issues and bug reports

To report an issue with a model please use the amrlib issues list.