Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
Have full ownership of the professional audio creation workflow: from content creation and versioning from text, to generation to speech, to sound design and mastering. Create and integrate audio experiences into your mobile applications, IoT projects, websites or social channels without learning specialized audio tools. | It is a library for advanced Text-to-Speech generation. It’s built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed, and quality. It comes with pre-trained models, tools for measuring dataset quality and is already used in 20+ languages for products and research projects. |
Create speech & add sounds; Add personalization to your audio; Real-time text to speech; Version your audio content; Integrate with various APIs such as slack and google docs; Add sound effects | High-performance Deep Learning models for Text2Speech tasks;
Fast and efficient model training;
Detailed training logs on the terminal and Tensorboard;
Support for Multi-speaker TTS;
Efficient, flexible, lightweight but feature complete Trainer API |
Statistics | |
GitHub Stars - | GitHub Stars 43.3K |
GitHub Forks - | GitHub Forks 5.7K |
Stacks 0 | Stacks 1 |
Followers 0 | Followers 5 |
Votes 0 | Votes 0 |
Integrations | |

Convert text to high-quality AI voice in seconds. Perfect for content creators, businesses, educators and video makers. Fast, affordable and studio-grade output with multiple accents and languages.

Plan, write, and publish books, PDF guides, workbooks, and audiobooks with AI workflows. Customize branding and export instantly.

AI note taking app that transforms voice recordings, text, images, audio files and videos into clear, summarized notes for meetings, lectures, journals, and more.

Build AI video, image, and audio pipelines with a simple composable API

VibeMusicing is an AI music tool that creates original songs, lyrics, and beats instantly—fast, customizable, and royalty-free for all types of creators.

Transform boring PDFs and text into viral TikTok-style brainrot study videos. Free online tool with AI voices, speed control, and Minecraft backgrounds. 3 free videos daily!

All-in-one content studio — easily create any photo, video or audio clip with AI. Affordable, easy to use and featuring the latest AI models.

Create viral AI-powered short videos, reels, TikToks, YouTube Shorts, and music videos with voiceovers, auto scripts, subtitles, and ai images — perfect for creators, educators, and marketers.

Amazon Polly is a service that turns text into lifelike speech. Polly lets you create applications that talk, enabling you to build entirely new categories of speech-enabled products. Polly is an Amazon AI service that uses advanced deep learning technologies to synthesize speech that sounds like a human voice.

Google Cloud Text-to-Speech enables developers to synthesize natural-sounding speech with 30 voices, available in multiple languages and variants. It applies DeepMind’s groundbreaking research in WaveNet and Google’s powerful neural networks to deliver the highest fidelity possible.