Skip to content

HumanEval

CodeEnglishBenchmark

The HumanEval dataset is a English code resource from OpenAI (Chen et al.) at 2021 comprising 164 examples.

📊 This dataset is used as an LLM benchmark. See model leaderboards →

Details

Task
Code
Language
English
Format
JSONL
Rows / instances
164
Creator
OpenAI (Chen et al.)
Year
2021
Download Paper

Related Code datasets

FAQ