Classify and extract text 10x better and faster 🦾


➡️  Learn more

How2 Dataset

Created by Sanabria et al. at 2018, the How2 Dataset of instructional videos covering a wide variety of topics across video clips (about 2,000 hours), with word-level time alignments to the ground-truth English subtitles. And 300 hours was translated into Portuguese subtitles., in Portuguese, English language. Containing ~2,000 Hours in n/a file format.

Dataset Sources

Here you can download the How2 dataset in n/a format.

Download How2 dataset n/a files

Fine-tune with How2 dataset

Metatext is a powerful no-code tool for train, tune and integrate custom NLP models

➡️  Learn more

Paper

Read full original How2 paper.

Download PDF paper


Classify and extract text 10x better and faster 🦾

Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.