Skip to content

TrellisNet

Carnegie Mellon University (CMU)Bosch Center for Artificial IntelligenceIntel LabsLanguage modeling

TrellisNet is a language modeling model from Carnegie Mellon University (CMU),Bosch Center for Artificial Intelligence,Intel Labs released in 2018 with 180000000.0 parameters.

About TrellisNet

We present trellis networks, a new architecture for sequence modeling. On the one hand, a trellis network is a temporal convolutional network with special structure, characterized by weight tying across depth and direct injection of the input into de

Details

Provider
Carnegie Mellon University (CMU),Bosch Center for Artificial Intelligence,Intel Labs
Task
Language modeling
Parameters
180000000.0
Released
2018-10-15
Open weights
No
View model source

Explore

FAQ