gaia-benchmark/GAIA
General NLPENBenchmark
The gaia-benchmark/GAIA dataset is a EN General NLP resource from gaia-benchmark at 2023. With 30.8K downloads and 704 likes, it is actively used by the community and is a n<1K-scale dataset.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About gaia-benchmark/GAIA
GAIA dataset
GAIA is a benchmark which aims at evaluating next-generation LLMs (LLMs with augmented capabilities due to added tooling, efficient prompting, access to search, etc).
We added gating to prevent bots from scraping the dataset. Pleas...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Size
- n<1K
- Creator
- gaia-benchmark
- Year
- 2023
- Downloads
- 30847
- Likes
- 704