Skip to content

BERT-Large-CAS (PTB+WT2+WT103)

AmazonNeural Architecture Search - NASLanguage modeling/generation

BERT-Large-CAS (PTB+WT2+WT103) is a Neural Architecture Search - NAS model from Amazon released in 2019 with 395000000.0 parameters.

About BERT-Large-CAS (PTB+WT2+WT103)

The Transformer architecture is superior to RNN-based models in computational efficiency. Recently, GPT and BERT demonstrate the efficacy of Transformer models on various NLP tasks using pre-trained language models on large-scale corpora. Surprisingl

Details

Provider
Amazon
Task
Neural Architecture Search - NAS,Language modeling/generation
Parameters
395000000.0
Released
2019-04-20
Open weights
No
View model source

Explore

FAQ