Classify and extract text 10x better and faster 🦾


➡️  Learn more

TriageSQL Dataset

Created by Zhang et al. at 2020, the TriageSQL Dataset is a cross-domain text-to-SQL question intention classification benchmark. It contains 34K databases and 390K questions from 20 existing datasets., in English language. Containing 390 in JSON file format.

Dataset Sources

Here you can download the TriageSQL dataset in JSON format.

Download TriageSQL dataset JSON files

Fine-tune with TriageSQL dataset

Metatext is a powerful no-code tool for train, tune and integrate custom NLP models

➡️  Learn more

Paper

Read full original TriageSQL paper.

Download PDF paper


Classify and extract text 10x better and faster 🦾

Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.