Classify and extract text 10x better and faster 🦾


➡️  Learn more

Flickr30K Entities Dataset

Created by Plummer et al. at 2017, the Flickr30K Entities Dataset contains 244k coreference chains and 276k manually annotated bounding boxes for each of the 31,783 images and 158,915 English captions (five per image) in the original dataset., in English language. Containing 31,783 in Text, XML file format.

Dataset Sources

Here you can download the Flickr30K Entities dataset in Text, XML format.

Download Flickr30K Entities dataset Text, XML files

Fine-tune with Flickr30K Entities dataset

Metatext is a powerful no-code tool for train, tune and integrate custom NLP models

➡️  Learn more

Paper

Read full original Flickr30K Entities paper.

Download PDF paper


Classify and extract text 10x better and faster 🦾

Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.