Classify and extract text 10x better and faster 🦾


➡️  Learn more

Kensho Derived Wikimedia Dataset (KDWD) Dataset

Created by Kensho R&D at 2020, the Kensho Derived Wikimedia Dataset (KDWD) Dataset contains two main components - a link annotated corpus of English Wikipedia pages and a compact sample of the Wikidata knowledge base., in English language. Containing n/a in CSV, JSON file format.

Dataset Sources

Here you can download the Kensho Derived Wikimedia Dataset (KDWD) dataset in CSV, JSON format.

Download Kensho Derived Wikimedia Dataset (KDWD) dataset CSV, JSON files

Fine-tune with Kensho Derived Wikimedia Dataset (KDWD) dataset

Metatext is a powerful no-code tool for train, tune and integrate custom NLP models

➡️  Learn more

Paper

Read full original Kensho Derived Wikimedia Dataset (KDWD) paper.

Download PDF paper


Classify and extract text 10x better and faster 🦾

Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.