Offensive Language Identification Dataset (OLID) Dataset
Created by Zampieri et al. at 2019, the Offensive Language Identification Dataset (OLID) Dataset contains a collection of 14,200 annotated English tweets using an annotation model that encompasses three levels: offensive language detection, categorization of offensive language, and offensive language target identification., in English language. Containing 14,2 in TSV file format.
Dataset Sources
Here you can download the Offensive Language Identification Dataset (OLID) dataset in TSV format.
Download Offensive Language Identification Dataset (OLID) dataset TSV files
Fine-tune with Offensive Language Identification Dataset (OLID) dataset
Metatext is a powerful no-code tool for train, tune and integrate custom NLP models
Paper
Read full original Offensive Language Identification Dataset (OLID) paper.
Classify and extract text 10x better and faster 🦾
Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.