cais/mmlu
Question AnsweringENBenchmarkmit
Cais/mmlu is a question answering benchmark dataset in EN from cais with 231,400 records in Parquet format. It is distributed under the mit license and falls in the 100K<n<1M size category, and has been downloaded 429.7K times.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About cais/mmlu
Dataset Card for MMLU
Dataset Summary
Measuring Massive Multitask Language Understanding by Dan Hendrycks, Collin Burns, Steven Basart, Andy Zou, Mantas Mazeika, Dawn Song, and Jacob Steinhardt (ICLR 2021).
This is a massive ...
Details
- Task
- Question Answering
- Language
- EN
- Format
- Parquet
- Rows / instances
- 231400
- Size
- 100K<n<1M
- Creator
- cais
- Year
- 2026
- License
- mit
- Downloads
- 429656
- Likes
- 778