IIRC Dataset
Created by Ferguson et al. at 2020, the IIRC Dataset contains more than 13K questions over paragraphs from English Wikipedia that provide only partial information to answer them, with the missing information occurring in one or more linked documents., in English language. Containing 5,698 in JSON file format.
Dataset Sources
Here you can download the IIRC dataset in JSON format.
Download IIRC dataset JSON files
Fine-tune with IIRC dataset
Metatext is a powerful no-code tool for train, tune and integrate custom NLP models
Paper
Read full original IIRC paper.
Classify and extract text 10x better and faster 🦾
Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.