Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
It is Stability AI’s first product for music and sound effect generation. Users can create original audio by entering a text prompt and a duration, generating audio in high-quality, 44.1 kHz stereo. | Turn any idea into a complete song in seconds with TextSong.ai—the most intuitive text to song experience: type, generate, and download high-quality melodies, vocals, and full arrangements ready to share. |
Create original audio by entering a text prompt and a duration;
High-quality audio generation;
Uses a latent diffusion for audio model | AI Music Generator, Custom Song AI, Instrumental Music AI |
Statistics | |
Stacks 1 | Stacks 0 |
Followers 3 | Followers 1 |
Votes 0 | Votes 1 |

It is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.

It is a 1.2B parameter base model trained on 100K hours of speech for TTS (text-to-speech). It empowers developers and businesses to better connect with their audiences at scale.

It is a transformer-based text-to-audio model. It can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects.