Classify and extract text 10x better and faster 🦾


➡️  Learn more

ACL Anthology Reference Corpus (ACL ARC) Dataset

Created by Lahiri et al. at 2014, the ACL Anthology Reference Corpus (ACL ARC) Dataset contains 10,921 articles from the February 2007 snapshot of the Anthology; text and metadata for the articles were extracted, consisting of BibTeX records derived either from the headers of each paper or from metadata taken from the Anthology website., in English language. Containing 10,921 in Text file format.

Dataset Sources

Here you can download the ACL Anthology Reference Corpus (ACL ARC) dataset in Text format.

Download ACL Anthology Reference Corpus (ACL ARC) dataset Text files

Fine-tune with ACL Anthology Reference Corpus (ACL ARC) dataset

Metatext is a powerful no-code tool for train, tune and integrate custom NLP models

➡️  Learn more

Paper

Read full original ACL Anthology Reference Corpus (ACL ARC) paper.

Download PDF paper


Classify and extract text 10x better and faster 🦾

Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.