GL-LWGC-AWD-MoS-LSTM + dynamic evaluation (WT2)
Ben-Gurion University of the NegevLanguage modeling
GL-LWGC-AWD-MoS-LSTM + dynamic evaluation (WT2) is a language modeling model from Ben-Gurion University of the Negev released in 2017 with 38000000.0 parameters.
About GL-LWGC-AWD-MoS-LSTM + dynamic evaluation (WT2)
Recurrent Neural Networks (RNNs) achieve state-of-the-art results in many sequence-to-sequence modeling tasks. However, RNNs are difficult to train and tend to suffer from overfitting. Motivated by the Data Processing Inequality (DPI), we formulate t
Details
- Provider
- Ben-Gurion University of the Negev
- Task
- Language modeling
- Parameters
- 38000000.0
- Released
- 2017-08-29
- Open weights
- No