Code-Mixed-Dialog Dataset
Created by Banerjee et al. at 2018, the Code-Mixed-Dialog A goal-oriented dialog dataset containing code-mixed conversations. Specifically, text from the DSTC2 restaurant reservation dataset and create code-mixed versions of it in Hindi-English, Bengali-English, Gujarati-English and Tamil-English., in Multi-Lingual language. Containing 49,167 in Text file format.
Dataset Sources
Here you can download the Code-Mixed-Dialog dataset in Text format.
Download Code-Mixed-Dialog dataset Text files
Fine-tune with Code-Mixed-Dialog dataset
Metatext is a powerful no-code tool for train, tune and integrate custom NLP models
Paper
Read full original Code-Mixed-Dialog paper.
Classify and extract text 10x better and faster 🦾
Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.