Audio generation Models
There are 18 AI and NLP models for Audio generation in our directory. Browse the full list below, or explore models by provider.
Audio generation is a machine-learning task covered in our directory. We list 18 models for it.
Updated June 2026
- MuseNetAudio generationOpenAI
- Gemini Flash 3.1 TTSAudio generationGoogle DeepMind
- Veo 3.1Image-to-video,Video generation,Text-to-video,Audio generationGoogle DeepMind
- GPT-4o (Mar 2025)Chat,Image generation,Audio generation,Vision-language generation,Table tasks,Language modeling/generation,Question answering,Speech recognition (ASR),Speech-to-textOpenAI
- GPT-4o (Jan 2025)Chat,Image generation,Audio generation,Vision-language generation,Table tasks,Language modeling/generation,Question answering,Speech recognition (ASR),Speech-to-textOpenAI
- Fugatto 1Audio generationNVIDIA
- GPT-4o (Nov 2024)Chat,Image generation,Audio generation,Vision-language generation,Table tasks,Language modeling/generation,Question answering,Speech recognition (ASR),Speech-to-textOpenAI
- Suno v4Audio generationSuno
- GPT-4o (Aug 2024)Chat,Image generation,Audio generation,Vision-language generation,Table tasks,Language modeling/generation,Question answering,Speech recognition (ASR),Speech-to-textOpenAI
- GPT-4oChat,Image generation,Audio generation,Vision-language generation,Table tasks,Language modeling/generation,Question answering,Speech recognition (ASR),Speech-to-textOpenAI
- Seedance 2.0Video generation,Audio generationByteDance
- Mamba-24M (SC09)Audio generation,Speech synthesis,Text-to-speech (TTS)Carnegie Mellon University (CMU),Princeton University
- MultiBand DiffusionAudio generationMeta AI,Hebrew University of Jerusalem,LORIA
- AudioLMAudio generationGoogle Research
- MusicGenAudio generationMeta AI
- AudioGenAudio generationMeta AI,Hebrew University of Jerusalem
- MusicLMAudio generationGoogle
- EnCodecAudio generationMeta AI