Classify and extract text 10x better and faster 🦾


➡️  Learn more

BSNLP-2019 Dataset

Created by Piskorski et al. at 2019, the BSNLP-2019 Dataset used to classify named entities in web documents in Slavic languages, their lemmatization, and cross-language matching. Dataset covers 4 languages: Bulgarian, Czech, Polish, and Russian., in Multi-Lingual language. Containing n/a in Text, OUT file format.

Dataset Sources

Here you can download the BSNLP-2019 dataset in Text, OUT format.

Download BSNLP-2019 dataset Text, OUT files

Fine-tune with BSNLP-2019 dataset

Metatext is a powerful no-code tool for train, tune and integrate custom NLP models

➡️  Learn more

Paper

Read full original BSNLP-2019 paper.

Download PDF paper


Classify and extract text 10x better and faster 🦾

Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.