Skip to content

openai/gsm8k

Text GenerationENBenchmark

The openai/gsm8k dataset is a EN text generation resource from openai at 2026 comprising 17,584 examples.

📊 This dataset is used as an LLM benchmark. See model leaderboards →

About openai/gsm8k

Dataset Card for GSM8K Dataset Summary GSM8K (Grade School Math 8K) is a dataset of 8.5K high quality linguistically diverse grade school math word problems. The dataset was created to support the task of question answering o...

Details

Task
Text Generation
Language
EN
Format
Parquet
Rows / instances
17,584
Creator
openai
Year
2026
Download

Related Text Generation datasets

FAQ