TrellisNet
Carnegie Mellon University (CMU)Bosch Center for Artificial IntelligenceIntel LabsLanguage modeling
TrellisNet is a language modeling model from Carnegie Mellon University (CMU),Bosch Center for Artificial Intelligence,Intel Labs released in 2018 with 180000000.0 parameters.
About TrellisNet
We present trellis networks, a new architecture for sequence modeling. On the one hand, a trellis network is a temporal convolutional network with special structure, characterized by weight tying across depth and direct injection of the input into de
Details
- Provider
- Carnegie Mellon University (CMU),Bosch Center for Artificial Intelligence,Intel Labs
- Task
- Language modeling
- Parameters
- 180000000.0
- Released
- 2018-10-15
- Open weights
- No