Skip to content

Semantic Textual Similarity Datasets

There are 5 semantic textual similarity datasets in our directory. Each links to its source, paper, and download — browse the full list below or filter by language.

Semantic Textual Similarity is a machine-learning task covered in our directory. We catalog 5 datasets for it.

Updated June 2026

What languages do semantic textual similarity datasets cover?

Explore other dataset tasks

Frequently asked questions