Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
It is an On-Premises, Streaming Speech Recognition System built with PyTorch and fastai. | Voibe is an offline voice dictation app for macOS that lets you write at the speed of thought. It works everywhere (Mail, Notes, Browsers, Slack, VS Code, ChatGPT, etc.), making it easy to draft messages, capture ideas, and produce long content without breaking concentration. |
RNN-T network;
Fused language models;
Dynamic Bucketing DataLoader;
Dynamic Quantization;
Tuned language model fusion | Productivity, Mac dictation, Voice to text, Speech to text, Voice typing, Offline dictation, On device AI, Private dictation, Transcription, Productivity, Accessibility, SaaS, AI, Mac App, Developer Tool, Voice dictation, Dictation |
Statistics | |
GitHub Stars 682 | GitHub Stars - |
GitHub Forks 32 | GitHub Forks - |
Stacks 1 | Stacks 0 |
Followers 3 | Followers 1 |
Votes 0 | Votes 1 |

It can be used to complement any regular touch user interface with a real time voice user interface. It offers real time feedback for faster and more intuitive experience that enables end user to recover from possible errors quickly and with no interruptions.

Convert text to high-quality AI voice in seconds. Perfect for content creators, businesses, educators and video makers. Fast, affordable and studio-grade output with multiple accents and languages.

Transform Text into Natural Speech Clear Speak uses advanced AI to generate human-like voices from text. Experience 27 unique voices with customizable pronunciation.

Voice agent QA for teams who can't afford broken calls, compliance gaps, or production failures. Simulate thousands of conversations, validate legal

Droidal Voice AI Agent automates scheduling, insurance verification, prior authorizations, and claim follow-ups. It handles payer calls, updates EHR/RCM systems in real time, and cuts manual work by 70%. HIPAA-compliant and built for healthcare RCM teams.

Seedance 1.5 is a cinematic AI model for native audio-visual video generation with film-grade storytelling quality.

AI concierge that automatically answers vacation rental guest questions 24/7 via text chat and real-time voice conversations. Supports 30+ languages with automatic detection. Powered by OpenAI and Anthropic, with 10-minute Airbnb import setup.

It is a cloud-based voice service and the brain behind tens of millions of devices including the Echo family of devices, FireTV, Fire Tablet, and third-party devices. You can build voice experiences, or skills, that make everyday tasks faster, easier, and more delightful for customers.

It is a state-of-the-art automatic speech recognition toolkit. It is intended for use by speech recognition researchers and professionals.

It is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.