TextVQA Dataset
Created by Singh et al. at 2019, the TextVQA TextVQA requires models to read and reason about text in images to answer questions about them. Specifically, models need to incorporate a new modality of text present in the images and reason over it to answer TextVQA questions., in English language. Containing 36,602 in JSON, PNG file format.
Dataset Sources
Here you can download the TextVQA dataset in JSON, PNG format.
Download TextVQA dataset JSON, PNG files
Fine-tune with TextVQA dataset
Metatext is a powerful no-code tool for train, tune and integrate custom NLP models
Paper
Read full original TextVQA paper.
Classify and extract text 10x better and faster 🦾
Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.