Question 1

What is the Train-O-Matic Large dataset?

Accepted Answer

Automatically-generated corpora in multiple languages with sense annotations for nouns using WordNet for English and BabelNet for all other languages as inventories of senses.

Question 2

Is Train-O-Matic Large a benchmark?

Accepted Answer

Train-O-Matic Large is a dataset for training or evaluation; it isn't tracked as a standard LLM benchmark in our catalog.

Question 3

Where can I download Train-O-Matic Large?

Accepted Answer

Train-O-Matic Large is available at its source: http://trainomatic.org/data/train-o-matic_lrec2018.tar.gz.

Train-O-Matic Large

About Train-O-Matic Large

Details

Related Word Sense Disambiguation datasets

FAQ