Classify and extract text 10x better and faster 🦾


➡️  Learn more

CoNLL 2003 ++ Dataset

Created by Wang et al. at 2020, the CoNLL 2003 ++ Similar to the original CoNLL except test set has been corrected for label mistakes. The dataset is split into training, development, and test sets, with 14,041, 3,250, and 3,453 instances respectively., in English language. Containing 20,744 in Text file format.

Dataset Sources

Here you can download the CoNLL 2003 ++ dataset in Text format.

Download CoNLL 2003 ++ dataset Text files

Fine-tune with CoNLL 2003 ++ dataset

Metatext is a powerful no-code tool for train, tune and integrate custom NLP models

➡️  Learn more

Paper

Read full original CoNLL 2003 ++ paper.

Download PDF paper


Classify and extract text 10x better and faster 🦾

Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.