M6-T
AlibabaChatImage captioning
M6-T is a chat model from Alibaba released in 2021 with 1002700000000.0 parameters.
About M6-T
Mixture-of-Experts (MoE) models can achieve promising results with outrageous large amount of parameters but constant computation cost, and thus it has become a trend in model scaling. Still it is a mystery how MoE layers bring quality gains by lever
Details
- Provider
- Alibaba
- Task
- Chat,Image captioning
- Parameters
- 1002700000000.0
- Released
- 2021-03-05
- Open weights
- No