Skip to content

TIGER-Lab/MMLU-Pro

Question AnsweringENBenchmark

The TIGER-Lab/MMLU-Pro dataset is a EN question answering resource from TIGER-Lab at 2024.

📊 This dataset is used as an LLM benchmark. See model leaderboards →

About TIGER-Lab/MMLU-Pro

MMLU-Pro Dataset MMLU-Pro dataset is a more robust and challenging massive multi-task understanding dataset tailored to more rigorously benchmark large language models' capabilities. This dataset contains 12K complex questions across various di...

Details

Task
Question Answering
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
TIGER-Lab
Year
2024
Download

Related Question Answering datasets

FAQ