Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
It is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning. | Create royalty-free music with AI. Turn text or lyrics into professional tracks. Commercial license for YouTube, Spotify, TikTok. Instant downloads. |
PyTorch library for deep learning research on audio generation;
Features the state-of-the-art EnCodec audio compressor / tokenizer | ai music generator, ai lyrics generator, ai vocal remover, ai stem splitter |
Statistics | |
GitHub Stars 22.6K | GitHub Stars - |
GitHub Forks 2.5K | GitHub Forks - |
Stacks 3 | Stacks 0 |
Followers 7 | Followers 1 |
Votes 0 | Votes 1 |
Integrations | |
| No integrations available | |

Convert text to high-quality AI voice in seconds. Perfect for content creators, businesses, educators and video makers. Fast, affordable and studio-grade output with multiple accents and languages.

Transform Text into Natural Speech Clear Speak uses advanced AI to generate human-like voices from text. Experience 27 unique voices with customizable pronunciation.

Droidal Voice AI Agent automates scheduling, insurance verification, prior authorizations, and claim follow-ups. It handles payer calls, updates EHR/RCM systems in real time, and cuts manual work by 70%. HIPAA-compliant and built for healthcare RCM teams.

Voice agent QA for teams who can't afford broken calls, compliance gaps, or production failures. Simulate thousands of conversations, validate legal

Music Make AI uses Suno AI's latest music generation technology to create professional, fully mastered tracks in seconds. Multiple genres and styles available - pop, electronic, hip-hop, classical, and more. Perfect for content creators, musicians, and anyone who loves music. Free trial!

Transform your spoken thoughts into engaging X posts with AI. Speak naturally, get authentic tweets ready to publish. Free to start, no credit card required.

AI note taking app that transforms voice recordings, text, images, audio files and videos into clear, summarized notes for meetings, lectures, journals, and more.

Turn lectures, podcasts, and voice notes into clean text with an AI-powered MP3 to text converter.

Seedance 1.5 is a cinematic AI model for native audio-visual video generation with film-grade storytelling quality.

Build AI video, image, and audio pipelines with a simple composable API