ROUGE-N added #69

astariul · 2019-06-04T07:46:55Z

To fix #68, ROUGE-N metric is added to Rouge.

Taken from : https://github.com/pltrdy/seq2seq/blob/master/seq2seq/metrics/rouge.py

Results might be slightly different from official ROUGE-155 script, but at least code is very simple.

msftclas · 2019-06-04T07:47:10Z

All CLA requirements met.

temporaer · 2019-06-04T10:19:20Z

Hey, thanks for the PR! It looks like there's an issue with the tests still:

juharris · 2019-06-05T13:28:48Z

nlgeval/tests/test_nlgeval.py

@@ -118,4 +118,4 @@ def test_compute_metrics(self):
        self.assertAlmostEqual(0.88469, scores['EmbeddingAverageCosineSimilairty'], places=5)
        self.assertAlmostEqual(0.568696, scores['VectorExtremaCosineSimilarity'], places=5)
        self.assertAlmostEqual(0.784205, scores['GreedyMatchingScore'], places=5)


Thanks for the contribution!
Would you add some tests for the value of the ROUGE metrics?

Sorry, but I will not :

The code added is from another repository, not my code

The score are slightly different with pyrouge.

The goal of this PR is to give a quick and approximate way to get ROUGE-N score. This should not be merged to main branch, but kept open here.

For a real ROUGE-N score, someone needs to add the official perl script ROUGE-155... I don't have time for this now :/

The other repo's code seems to be Apache-licensed, I'm not sure if we can merge it, particularly without including their license. I'm not too worried about slightly different values as long as we're clear about the methods used in the docs. Are you aware where differences might come from?

We could at least test that the values are within some reasonable bounds.

I tried comparing the results with the results using the rouge package. But scores are different (and not only a few digits...). Not only ROUGE-N are different, but also the existing ROUGE-L.

This package is also Apache-licensed. Not sure if we can just use it without including their license (without modifying their code).

astariul added 2 commits June 4, 2019 16:40

rouge 1/2 added

c044460

Merge branch 'master' of https://github.com/Colanim/nlg-eval

011d0a7

astariul added 2 commits June 5, 2019 08:51

other functions updated to take ROUGE N as well

49464bf

Tests updated

01f73db

juharris reviewed Jun 5, 2019

View reviewed changes

ghost mentioned this pull request Jun 30, 2019

Official ROUGE added #74

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ROUGE-N added #69

ROUGE-N added #69

astariul commented Jun 4, 2019

msftclas commented Jun 4, 2019 •

edited

Loading

temporaer commented Jun 4, 2019

juharris Jun 5, 2019

astariul Jun 6, 2019

temporaer Jun 7, 2019

juharris Jun 7, 2019

astariul Jun 10, 2019

ROUGE-N added #69

Are you sure you want to change the base?

ROUGE-N added #69

Conversation

astariul commented Jun 4, 2019

msftclas commented Jun 4, 2019 • edited Loading

temporaer commented Jun 4, 2019

juharris Jun 5, 2019

Choose a reason for hiding this comment

astariul Jun 6, 2019

Choose a reason for hiding this comment

temporaer Jun 7, 2019

Choose a reason for hiding this comment

juharris Jun 7, 2019

Choose a reason for hiding this comment

astariul Jun 10, 2019

Choose a reason for hiding this comment

msftclas commented Jun 4, 2019 •

edited

Loading