List of Embeddings Datasets for Machine Learning Projects
High-quality datasets are the key to good performance in natural language processing (NLP) projects. We collected a list of NLP datasets for Embeddings task, to get started your machine learning projects. Bellow your find a large curated training base for Embeddings.
What is Embeddings task?
Word embeddings is a set of techniques in natural language processing that is typically used for building contextually-sensitive word representation vectors.
Custom fine-tune with Embeddings datasets
Metatext is a powerful no-code tool for train, tune and integrate custom NLP models
➡️ Try for free
Found 6 Embeddings Datasets
Let’s get started!
Datasets Knowledge Embedding
Several datasets containing edges and nodes for knowledge base building.
Tencent AI Lab Embedding Corpus
Dataset provides 200-dimension vector representations, a.k.a. embeddings, for over 8 million Chinese words and phrases.
Tencent AI Lab Embedding Corpus
Dataset provides 200-dimension vector representations, a.k.a. embeddings, for over 8 million Chinese words and phrases.
Datasets Knowledge Embedding
Several datasets containing edges and nodes for knowledge base building.
Tencent AI Lab Embedding Corpus
Dataset provides 200-dimension vector representations, a.k.a. embeddings, for over 8 million Chinese words and phrases.
Tencent AI Lab Embedding Corpus
Dataset provides 200-dimension vector representations, a.k.a. embeddings, for over 8 million Chinese words and phrases.
Classify and extract text 10x better and faster 🦾
Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.