Skip to content

microsoft/ms_marco

General NLPENBenchmark

Microsoft/ms_marco is a General NLP-focused benchmark dataset in EN distributed in Parquet format.

📊 This dataset is used as an LLM benchmark. See model leaderboards →

About microsoft/ms_marco

Dataset Card for "ms_marco" Dataset Summary Starting with a paper released at NIPS 2016, MS MARCO is a collection of datasets focused on deep learning in search. The first dataset was a question answering dataset featuring 100,000 re...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
microsoft
Year
2022
Download

Related General NLP datasets

FAQ