Question 1

What is the XED dataset?

Accepted Answer

Dataset consists of emotion annotated movie subtitles from OPUS. Plutchik's 8 core emotions to annotate were used. The data is multilabel. The original annotations have been sourced for mainly English and Finnish, with the rest created using annotation projection to aligned subtitles in 41 additional languages, with 31…

Question 2

Is XED a benchmark?

Accepted Answer

XED is a dataset for training or evaluation; it isn't tracked as a standard LLM benchmark in our catalog.

Question 3

Where can I download XED?

Accepted Answer

XED is available at its source: https://github.com/Helsinki-NLP/XED.

XED

About XED

Details

Related Sentiment Analysis datasets

FAQ