Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
It is a 1.2B parameter base model trained on 100K hours of speech for TTS (text-to-speech). It empowers developers and businesses to better connect with their audiences at scale. | Keet is a blazing-fast, private voice dictation tool with auto-punctuation designed for developers, writers, and anyone wanting to move at the speed of thought. |
Emotional speech rhythm and tone in English;
Zero-shot cloning for American & British voices;
Support for (cross-lingual) voice cloning with fine-tuning;
Synthesis of arbitrary-length text
| 7-day free trial, Cancel anytime, 2 machine licenses |
Statistics | |
GitHub Stars 4.2K | GitHub Stars - |
GitHub Forks 693 | GitHub Forks - |
Stacks 0 | Stacks 0 |
Followers 2 | Followers 1 |
Votes 0 | Votes 1 |
Integrations | |
| No integrations available | |

It is the base model weights and network architecture of Grok-1, the large language model. Grok-1 is a 314 billion parameter Mixture-of-Experts model trained from scratch by xAI.

TalkAny—Free AI Speaking Practice Platform. Practice English/Chinese speaking with AI 24/7; no partner needed. Get real-time grammar correction, pronunciation feedback, and natural expression tips. Perfect for IELTS, TOEFL, DET exam prep, daily conversation, and job interviews. Zero pressure, unlimited practice. Start speaking now!

It is Google’s largest and most capable AI model. It is built to be multimodal, it can generalize, understand, operate across, and combine different types of info — like text, images, audio, video, and code.

It is a state-of-the-art foundational large language model designed to help researchers advance their work in this subfield of AI.

Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API. The API recognizes over 80 languages and variants, to support your global user base.

It is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.

Converts any video or audio to accurate transcripts in minutes. Free to use, supports 55+ languages.

Try Grok 4 on GPT Proto. Access xAI’s most advanced 1.7T LLM with 130K context, multimodal support, and real-time data integration for dynamic analysis.

Get real-time AI suggestions during your meetings. No bot joins your call, no awkward notifications for participants. Just helpful prompts while you speak, in 12 languages.

Build smarter coding agents with the GLM-4.7 API, featuring multilingual coding, terminal tasks, and think-before-act reasoning.