Skip to content

A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning (CLEVR & CoGenT)

Question AnsweringVisualEnglish

A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning (CLEVR & CoGenT) is a question answering dataset in English from Johnson et al. with 999,968 questions; 100,000 images records in JSON format.

About A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning (CLEVR & CoGenT)

Visual question answering dataset contains 100,000 images and 999,968 questions.

Details

Task
Question Answering, Visual
Language
English
Format
JSON
Rows / instances
999,968 questions; 100,000 images
Creator
Johnson et al.
Year
2016
Download Paper

Related Question Answering, Visual datasets

FAQ