Classify and extract text 10x better and faster 🦾


➡️  Learn more

Content-Based Categorized Dataset Dataset

Created by Suwaileh et al. at 2016, the Content-Based Categorized Dataset Dataset contains 996 Web pages from the ArabicWeb16 dataset were extracted and labeled., in Arabic language. Containing 996 in Text file format.

Dataset Sources

Here you can download the Content-Based Categorized Dataset dataset in Text format.

Download Content-Based Categorized Dataset dataset Text files

Fine-tune with Content-Based Categorized Dataset dataset

Metatext is a powerful no-code tool for train, tune and integrate custom NLP models

➡️  Learn more

Paper

Read full original Content-Based Categorized Dataset paper.

Download PDF paper


Classify and extract text 10x better and faster 🦾

Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.