google-research-datasets/natural_questions
Question AnsweringENBenchmarkcc-by-sa-3.0
The google-research-datasets/natural_questions dataset is a EN question answering resource from google-research-datasets at 2022 comprising 323,033 examples. With 21K downloads and 124 likes, it is actively used by the community. It is released under the cc-by-sa-3.0 license and is a 10K<n<100K-scale dataset.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About google-research-datasets/natural_questions
Dataset Card for Natural Questions
Dataset Summary
The NQ corpus contains questions from real users, and it requires QA systems to
read and comprehend an entire Wikipedia article that may or may not contain the
answer to the question...
Details
- Task
- Question Answering
- Language
- EN
- Format
- Parquet
- Rows / instances
- 323033
- Size
- 10K<n<100K
- Creator
- google-research-datasets
- Year
- 2022
- License
- cc-by-sa-3.0
- Downloads
- 21026
- Likes
- 124