Skip to content

Discuss ROUGE section in eval metrics #87

@cleong110

Description

@cleong110

Within "evaluation metrics", talk about how ROUGE is not really intended for machine translation, and the pitfalls thereof.

https://stats.stackexchange.com/questions/301626/interpreting-rouge-scores

https://towardsdatascience.com/to-rouge-or-not-to-rouge-6a5f3552ea45

https://hyperskill.org/learn/step/29669

https://en.wikipedia.org/wiki/ROUGE_(metric)

https://aclanthology.org/P04-1077/ "Automatic Evaluation of Machine Translation Quality Using Longest Com-
mon Subsequence and Skip-Bigram Statistics" is the one that talks about the ROUGE-L variation, this one is actually for MT

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions