Classify and extract text 10x better and faster 🦾


➡️  Learn more

The BrWaC (Brazilian Portuguese Web as Corpus) Dataset

Created by Pedro et al. at 2020, the The BrWaC (Brazilian Portuguese Web as Corpus) This dataset is a large corpus constructed in our lab following the Wacky framework, which was made public for research purposes., in Portuguese language. Containing n/a in Text file format.

Dataset Sources

Here you can download the The BrWaC (Brazilian Portuguese Web as Corpus) dataset in Text format.

Download The BrWaC (Brazilian Portuguese Web as Corpus) dataset Text files

Fine-tune with The BrWaC (Brazilian Portuguese Web as Corpus) dataset

Metatext is a powerful no-code tool for train, tune and integrate custom NLP models

➡️  Learn more

Paper

Read full original The BrWaC (Brazilian Portuguese Web as Corpus) paper.

Download PDF paper


Classify and extract text 10x better and faster 🦾

Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.