Skip to content

GL-LWGC-AWD-MoS-LSTM + dynamic evaluation (WT2)

Ben-Gurion University of the NegevLanguage modeling

GL-LWGC-AWD-MoS-LSTM + dynamic evaluation (WT2) is a language modeling model from Ben-Gurion University of the Negev released in 2017 with 38000000.0 parameters.

About GL-LWGC-AWD-MoS-LSTM + dynamic evaluation (WT2)

Recurrent Neural Networks (RNNs) achieve state-of-the-art results in many sequence-to-sequence modeling tasks. However, RNNs are difficult to train and tend to suffer from overfitting. Motivated by the Data Processing Inequality (DPI), we formulate t

Details

Provider
Ben-Gurion University of the Negev
Task
Language modeling
Parameters
38000000.0
Released
2017-08-29
Open weights
No
View model source

Explore

FAQ