BSNLP-2019 Dataset
Created by Piskorski et al. at 2019, the BSNLP-2019 Dataset used to classify named entities in web documents in Slavic languages, their lemmatization, and cross-language matching. Dataset covers 4 languages: Bulgarian, Czech, Polish, and Russian., in Multi-Lingual language. Containing n/a in Text, OUT file format.
Dataset Sources
Here you can download the BSNLP-2019 dataset in Text, OUT format.
Download BSNLP-2019 dataset Text, OUT files
Fine-tune with BSNLP-2019 dataset
Metatext is a powerful no-code tool for train, tune and integrate custom NLP models
Paper
Read full original BSNLP-2019 paper.
Classify and extract text 10x better and faster 🦾
Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.