evalplus/humanevalplus
General NLPENBenchmarkapache-2.0
Evalplus/humanevalplus is a General NLP-focused benchmark dataset in EN that provides 164 labeled examples distributed in Parquet format. It is distributed under the apache-2.0 license and falls in the n<1K size category, and has been downloaded 18.7K times.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- 164
- Size
- n<1K
- Creator
- evalplus
- Year
- 2024
- License
- apache-2.0
- Downloads
- 18677
- Likes
- 21