Classify and extract text 10x better and faster 🦾


➡️  Learn more

UIT-SPC Dataset

Created by Thin et al. at 2017, the UIT-SPC Dataset contains 1,565 papers of top NLP/CL conferences such as ACL, CoNLL , EACL NAACL and EMNLP. They are pre-processed by removing unnecessary information (e.g formula, table, etc). Then, they were formatted to .xml that includes the title paper, sections, and sub-sections according to the paper's structure. [requires contacting author for corpus], in Vietnamese language. Containing 1,565 in n/a file format.

Dataset Sources

Here you can download the UIT-SPC dataset in n/a format.

Download UIT-SPC dataset n/a files

Fine-tune with UIT-SPC dataset

Metatext is a powerful no-code tool for train, tune and integrate custom NLP models

➡️  Learn more

Paper

Read full original UIT-SPC paper.

Download PDF paper


Classify and extract text 10x better and faster 🦾

Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.