The BrWaC (Brazilian Portuguese Web as Corpus) Dataset
Created by Pedro et al. at 2020, the The BrWaC (Brazilian Portuguese Web as Corpus) This dataset is a large corpus constructed in our lab following the Wacky framework, which was made public for research purposes., in Portuguese language. Containing n/a in Text file format.
Dataset Sources
Here you can download the The BrWaC (Brazilian Portuguese Web as Corpus) dataset in Text format.
Download The BrWaC (Brazilian Portuguese Web as Corpus) dataset Text files
Fine-tune with The BrWaC (Brazilian Portuguese Web as Corpus) dataset
Metatext is a powerful no-code tool for train, tune and integrate custom NLP models
Paper
Read full original The BrWaC (Brazilian Portuguese Web as Corpus) paper.
Classify and extract text 10x better and faster 🦾
Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.