TIGER-Lab/MMLU-Pro
Question AnsweringENBenchmark
The TIGER-Lab/MMLU-Pro dataset is a EN question answering resource from TIGER-Lab at 2024.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About TIGER-Lab/MMLU-Pro
MMLU-Pro Dataset
MMLU-Pro dataset is a more robust and challenging massive multi-task understanding dataset tailored to more rigorously benchmark large language models' capabilities. This dataset contains 12K complex questions across various di...
Details
- Task
- Question Answering
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- TIGER-Lab
- Year
- 2024