Classify and extract text 10x better and faster 🦾


➡️  Learn more

ParaBank Dataset

Created by Hu et al. at 2019, the ParaBank Dataset contains paraphrases with 79.5 million references and on average 4 paraphrases per reference., in English language. Containing 79.5M references in TSV file format.

Dataset Sources

Here you can download the ParaBank dataset in TSV format.

Download ParaBank dataset TSV files

Fine-tune with ParaBank dataset

Metatext is a powerful no-code tool for train, tune and integrate custom NLP models

➡️  Learn more

Paper

Read full original ParaBank paper.

Download PDF paper


Classify and extract text 10x better and faster 🦾

Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.