UIT-SPC Dataset
Created by Thin et al. at 2017, the UIT-SPC Dataset contains 1,565 papers of top NLP/CL conferences such as ACL, CoNLL , EACL NAACL and EMNLP. They are pre-processed by removing unnecessary information (e.g formula, table, etc). Then, they were formatted to .xml that includes the title paper, sections, and sub-sections according to the paper's structure. [requires contacting author for corpus], in Vietnamese language. Containing 1,565 in n/a file format.
Dataset Sources
Here you can download the UIT-SPC dataset in n/a format.
Download UIT-SPC dataset n/a files
Fine-tune with UIT-SPC dataset
Metatext is a powerful no-code tool for train, tune and integrate custom NLP models
Paper
Read full original UIT-SPC paper.
Classify and extract text 10x better and faster 🦾
Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.