Classify and extract text 10x better and faster 🦾


➡️  Learn more

Microsoft Speech Language Translation Corpus (MSLT) Dataset

Created by Federmann et al. at 2017, the Microsoft Speech Language Translation Corpus (MSLT) Dataset contains conversational, bilingual speech test and tuning data for English, Chinese, and Japanese. It includes audio data, transcripts, and translations; and allows end-to-end testing of spoken language translation systems on real-world data., in Multi-Lingual language. Containing n/a in Wav file format.

Dataset Sources

Here you can download the Microsoft Speech Language Translation Corpus (MSLT) dataset in Wav format.

Download Microsoft Speech Language Translation Corpus (MSLT) dataset Wav files

Fine-tune with Microsoft Speech Language Translation Corpus (MSLT) dataset

Metatext is a powerful no-code tool for train, tune and integrate custom NLP models

➡️  Learn more

Paper

Read full original Microsoft Speech Language Translation Corpus (MSLT) paper.

Download PDF paper


Classify and extract text 10x better and faster 🦾

Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.