Skip to content

google-research-datasets/natural_questions

Question AnsweringENBenchmarkcc-by-sa-3.0

The google-research-datasets/natural_questions dataset is a EN question answering resource from google-research-datasets at 2022 comprising 323,033 examples. With 21K downloads and 124 likes, it is actively used by the community. It is released under the cc-by-sa-3.0 license and is a 10K<n<100K-scale dataset.

📊 This dataset is used as an LLM benchmark. See model leaderboards →

About google-research-datasets/natural_questions

Dataset Card for Natural Questions Dataset Summary The NQ corpus contains questions from real users, and it requires QA systems to read and comprehend an entire Wikipedia article that may or may not contain the answer to the question...

Details

Task
Question Answering
Language
EN
Format
Parquet
Rows / instances
323033
Size
10K<n<100K
Creator
google-research-datasets
Year
2022
License
cc-by-sa-3.0
Downloads
21026
Likes
124
Download Homepage

Related Question Answering datasets

FAQ