ACL Anthology Reference Corpus (ACL ARC) Dataset
Created by Lahiri et al. at 2014, the ACL Anthology Reference Corpus (ACL ARC) Dataset contains 10,921 articles from the February 2007 snapshot of the Anthology; text and metadata for the articles were extracted, consisting of BibTeX records derived either from the headers of each paper or from metadata taken from the Anthology website., in English language. Containing 10,921 in Text file format.
Dataset Sources
Here you can download the ACL Anthology Reference Corpus (ACL ARC) dataset in Text format.
Download ACL Anthology Reference Corpus (ACL ARC) dataset Text files
Fine-tune with ACL Anthology Reference Corpus (ACL ARC) dataset
Metatext is a powerful no-code tool for train, tune and integrate custom NLP models
Paper
Read full original ACL Anthology Reference Corpus (ACL ARC) paper.
Classify and extract text 10x better and faster 🦾
Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.