Skip to content

1 Billion Word Language Model Benchmark (lm1b)

Language ModelingEnglish

The 1 Billion Word Language Model Benchmark (lm1b) dataset is a English language modeling resource from Chelba et al. at 2013 comprising 1.1 examples.

Details

Task
Language Modeling
Language
English
Format
n/a
Rows / instances
1.1B
Creator
Chelba et al.
Year
2013
Download Paper

Related Language Modeling datasets

FAQ