Question 1

What is the 1 Billion Word Language Model Benchmark (lm1b) dataset?

Accepted Answer

The 1 Billion Word Language Model Benchmark (lm1b) dataset is a English language modeling resource from Chelba et al. at 2013 comprising 1.1 examples.

Question 2

Is 1 Billion Word Language Model Benchmark (lm1b) a benchmark?

Accepted Answer

1 Billion Word Language Model Benchmark (lm1b) is a dataset for training or evaluation; it isn't tracked as a standard LLM benchmark in our catalog.

Question 3

Where can I download 1 Billion Word Language Model Benchmark (lm1b)?

Accepted Answer

1 Billion Word Language Model Benchmark (lm1b) is available at its source: https://github.com/ciprian-chelba/1-billion-word-language-modeling-benchmark.

1 Billion Word Language Model Benchmark (lm1b)

Details

Related Language Modeling datasets

FAQ