m-a-p/SuperGPQA
General NLPENBenchmark
The m-a-p/SuperGPQA dataset is a EN General NLP resource from m-a-p at 2025.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About m-a-p/SuperGPQA
This repository contains the data presented in SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines.
Tutorials for submitting to the official leadboard
coming soon
📜 License
SuperGPQA is a composite dataset that ...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- m-a-p
- Year
- 2025