Skip to content

Arabic Datasets

We catalog 22 Arabic datasets for NLP and machine learning. Browse the list below or narrow down by task.

This page covers Arabic, a morphologically rich language spoken across the Middle East and North Africa. Our directory includes 22 datasets in Arabic.

Updated June 2026

What tasks do Arabic datasets cover?

Datasets in other languages

Frequently asked questions