Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
It builds upon the capabilities of the WhisperLive and WhisperSpeech by integrating Mistral, a Large Language Model (LLM), on top of the real-time speech-to-text pipeline. Both LLM and Whisper are optimized to run efficiently as TensorRT engines, maximizing performance and real-time processing capabilities. | Voibe is an offline voice dictation app for macOS that lets you write at the speed of thought. It works everywhere (Mail, Notes, Browsers, Slack, VS Code, ChatGPT, etc.), making it easy to draft messages, capture ideas, and produce long content without breaking concentration. |
Utilizes OpenAI WhisperLive to convert spoken language into text in real-time;
Large Language Model Integration;
TensorRT optimization | Productivity, Mac dictation, Voice to text, Speech to text, Voice typing, Offline dictation, On device AI, Private dictation, Transcription, Productivity, Accessibility, SaaS, AI, Mac App, Developer Tool, Voice dictation, Dictation |
Statistics | |
GitHub Stars 1.6K | GitHub Stars - |
GitHub Forks 126 | GitHub Forks - |
Stacks 0 | Stacks 0 |
Followers 0 | Followers 1 |
Votes 0 | Votes 1 |
Integrations | |
| No integrations available | |

It can be used to complement any regular touch user interface with a real time voice user interface. It offers real time feedback for faster and more intuitive experience that enables end user to recover from possible errors quickly and with no interruptions.

Convert text to high-quality AI voice in seconds. Perfect for content creators, businesses, educators and video makers. Fast, affordable and studio-grade output with multiple accents and languages.

Voice agent QA for teams who can't afford broken calls, compliance gaps, or production failures. Simulate thousands of conversations, validate legal

Seedance 1.5 is a cinematic AI model for native audio-visual video generation with film-grade storytelling quality.

AI concierge that automatically answers vacation rental guest questions 24/7 via text chat and real-time voice conversations. Supports 30+ languages with automatic detection. Powered by OpenAI and Anthropic, with 10-minute Airbnb import setup.

Droidal Voice AI Agent automates scheduling, insurance verification, prior authorizations, and claim follow-ups. It handles payer calls, updates EHR/RCM systems in real time, and cuts manual work by 70%. HIPAA-compliant and built for healthcare RCM teams.

Transform Text into Natural Speech Clear Speak uses advanced AI to generate human-like voices from text. Experience 27 unique voices with customizable pronunciation.

It is a cloud-based voice service and the brain behind tens of millions of devices including the Echo family of devices, FireTV, Fire Tablet, and third-party devices. You can build voice experiences, or skills, that make everyday tasks faster, easier, and more delightful for customers.

It is a state-of-the-art automatic speech recognition toolkit. It is intended for use by speech recognition researchers and professionals.

It is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.