ERNIE-ViLG
BaiduVision-language generationImage generationText-to-imageImage captioningLanguage modeling/generationVisual question answering
ERNIE-ViLG is vision-language generation model published by Baidu in 2021 featuring 10000000000.0 parameters.
About ERNIE-ViLG
Conventional methods for the image-text generation tasks mainly tackle the naturally bidirectional generation tasks separately, focusing on designing task-specific frameworks to improve the quality and fidelity of the generated samples. Recently, Vis
Details
- Provider
- Baidu
- Task
- Vision-language generation,Image generation,Text-to-image,Image captioning,Language modeling/generation,Visual question answering
- Parameters
- 10000000000.0
- Released
- 2021-12-31
- Open weights
- No