Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
It is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning. | MumbleFlow is a fully local speech to text and voice to text app. Sub-second offline transcription powered by whisper.cpp. No cloud, no subscription — $5 one-time purchase. Available on macOS, Windows & Linux. |
PyTorch library for deep learning research on audio generation;
Features the state-of-the-art EnCodec audio compressor / tokenizer | MumbleFlow Team, Tauri 2.0, whisper.cpp, llama.cpp, macOS 14+ |
Statistics | |
GitHub Stars 22.6K | GitHub Stars - |
GitHub Forks 2.5K | GitHub Forks - |
Stacks 3 | Stacks 0 |
Followers 7 | Followers 1 |
Votes 0 | Votes 1 |
Integrations | |
| No integrations available | |

It can be used to complement any regular touch user interface with a real time voice user interface. It offers real time feedback for faster and more intuitive experience that enables end user to recover from possible errors quickly and with no interruptions.

Turn prompts or lyric drafts into complete songs with vocals, arrangement, and mix in minutes. AITextSong is free to try in your browser, with MP3/WAV downloads on paid plans.

Use sora2 to create realistic AI videos with synchronized audio instantly. Physics-accurate motion, cinematic quality. 10 free credits, no credit card needed. Try Sora 2 now!

All-in-one content studio — easily create any photo, video or audio clip with AI. Affordable, easy to use and featuring the latest AI models.

Powered by advanced AI models. Transform text into professional music instantly. No subscriptions required - start creating now!

The ultimate Image to Image AI tool. Instantly apply AI style transfer and powerful photo effects. Explore our suite of image and video transformation tools.

Create custom songs for videos, gifts & brands instantly. 20+ styles with lyrics & vocals. Commercial license included.

Create royalty-free music with AI. Turn text or lyrics into professional tracks. Commercial license for YouTube, Spotify, TikTok. Instant downloads.

Instantly transcribe video to text with our advanced engine. High accuracy, speaker ID, and smart subtitles. The best video to text converter for creators.

Use Lip Sync AI to create free AI-powered lip sync animations effortlessly. Generate perfectly synced videos with Lip Sync AI for any language and scenario!