Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API. The API recognizes over 80 languages and variants, to support your global user base. | Creating safe artificial general intelligence that benefits all of humanity. Our work to create safe and beneficial AI requires a deep understanding of the potential risks and benefits, as well as careful consideration of the impact. |
Over 80 Languages;Return Text Results In Real-Time;Accurate In Noisy Environments;Powered by Machine Learning | Pioneering research on the path to AGI;
Transforming work and creativity with AI |
Statistics | |
Stacks 39 | Stacks 980 |
Followers 74 | Followers 203 |
Votes 1 | Votes 0 |
Pros & Cons | |
Pros
| No community feedback yet |
Integrations | |
| No integrations available | |

It is the base model weights and network architecture of Grok-1, the large language model. Grok-1 is a 314 billion parameter Mixture-of-Experts model trained from scratch by xAI.

Transcribe and translate speech in over 60 languages, in real-time, with high accuracy.

TalkAny—Free AI Speaking Practice Platform. Practice English/Chinese speaking with AI 24/7; no partner needed. Get real-time grammar correction, pronunciation feedback, and natural expression tips. Perfect for IELTS, TOEFL, DET exam prep, daily conversation, and job interviews. Zero pressure, unlimited practice. Start speaking now!

It is Google’s largest and most capable AI model. It is built to be multimodal, it can generalize, understand, operate across, and combine different types of info — like text, images, audio, video, and code.

It is a state-of-the-art foundational large language model designed to help researchers advance their work in this subfield of AI.

It is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.

Converts any video or audio to accurate transcripts in minutes. Free to use, supports 55+ languages.

Try Grok 4 on GPT Proto. Access xAI’s most advanced 1.7T LLM with 130K context, multimodal support, and real-time data integration for dynamic analysis.

Get real-time AI suggestions during your meetings. No bot joins your call, no awkward notifications for participants. Just helpful prompts while you speak, in 12 languages.

Build smarter coding agents with the GLM-4.7 API, featuring multilingual coding, terminal tasks, and think-before-act reasoning.