Visual Datasets
There are 10 visual datasets in our directory. Each links to its source, paper, and download — browse the full list below or filter by language.
Visual is a machine-learning task covered in our directory. We catalog 10 datasets for it.
Updated June 2026
- Video Commonsense Reasoning (VCR)Question Answering, Visual, CommonsenseEnglish
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning (CLEVR & CoGenT)Question Answering, VisualEnglish
- Fact-based Visual Question Answering (FVQA)Question Answering, VisualEnglish
- CMU Multimodal Opinion Sentiment and Emotion Intensity (CMU-MOSEI)Sentiment Analysis, Emotion Recognition, VisualEnglish
- Microsoft Information-Seeking Conversation (MISC) datasetSpeech Recognition, Dialogue, VisualEnglish
- GQAQuestion Answering, Visual, CommonsenseEnglish
- Social-IQ DatasetQuestion Answering, Visual, CommonsenseEnglish
- Textbook Question AnsweringQuestion Answering, Reading Comprehension, VisualEnglish
- TextVQAQuestion Answering, Visual, CommonsenseEnglish
- VoxCelebSpeech Recognition, VisualMulti-Lingual