Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
It is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning. | (4 hours/day). Accurate audio to text with Speaker ID & timestamps. Export as Word/SRT. Fast, private, and no login required. |
PyTorch library for deep learning research on audio generation;
Features the state-of-the-art EnCodec audio compressor / tokenizer | Audio to Text, MP3 to Text, Video to Text, Voice Memo to Text, YouTube Transcript |
Statistics | |
GitHub Stars 22.6K | GitHub Stars - |
GitHub Forks 2.5K | GitHub Forks - |
Stacks 3 | Stacks 0 |
Followers 7 | Followers 1 |
Votes 0 | Votes 1 |
Integrations | |
| No integrations available | |

All-in-one content studio — easily create any photo, video or audio clip with AI. Affordable, easy to use and featuring the latest AI models.

Powered by advanced AI models. Transform text into professional music instantly. No subscriptions required - start creating now!

Instantly transcribe video to text with our advanced engine. High accuracy, speaker ID, and smart subtitles. The best video to text converter for creators.

The ultimate Image to Image AI tool. Instantly apply AI style transfer and powerful photo effects. Explore our suite of image and video transformation tools.

Use Lip Sync AI to create free AI-powered lip sync animations effortlessly. Generate perfectly synced videos with Lip Sync AI for any language and scenario!

Free to transcribe, translate, and summarize audio/video with ScreenApp AI. Get instant highlighted notes and save time with accurate AI tools.

Transform your spoken thoughts into engaging X posts with AI. Speak naturally, get authentic tweets ready to publish. Free to start, no credit card required.

AI note taking app that transforms voice recordings, text, images, audio files and videos into clear, summarized notes for meetings, lectures, journals, and more.

Music Make AI uses Suno AI's latest music generation technology to create professional, fully mastered tracks in seconds. Multiple genres and styles available - pop, electronic, hip-hop, classical, and more. Perfect for content creators, musicians, and anyone who loves music. Free trial!

Turn lectures, podcasts, and voice notes into clean text with an AI-powered MP3 to text converter.