Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
Google Cloud Speech API enables developers to convert audio to text by applying powerful neural network models in an easy to use API. The API recognizes over 80 languages and variants, to support your global user base. | Creating safe artificial general intelligence that benefits all of humanity. Our work to create safe and beneficial AI requires a deep understanding of the potential risks and benefits, as well as careful consideration of the impact. |
Over 80 Languages;Return Text Results In Real-Time;Accurate In Noisy Environments;Powered by Machine Learning | Pioneering research on the path to AGI;
Transforming work and creativity with AI |
Statistics | |
Stacks 39 | Stacks 935 |
Followers 74 | Followers 195 |
Votes 1 | Votes 0 |
Pros & Cons | |
Pros
| No community feedback yet |
Integrations | |
| No integrations available | |

It is the base model weights and network architecture of Grok-1, the large language model. Grok-1 is a 314 billion parameter Mixture-of-Experts model trained from scratch by xAI.

TalkAny—Free AI Speaking Practice Platform. Practice English/Chinese speaking with AI 24/7; no partner needed. Get real-time grammar correction, pronunciation feedback, and natural expression tips. Perfect for IELTS, TOEFL, DET exam prep, daily conversation, and job interviews. Zero pressure, unlimited practice. Start speaking now!

It is Google’s largest and most capable AI model. It is built to be multimodal, it can generalize, understand, operate across, and combine different types of info — like text, images, audio, video, and code.

It is a state-of-the-art foundational large language model designed to help researchers advance their work in this subfield of AI.

It is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.

Try Grok 4 on GPT Proto. Access xAI’s most advanced 1.7T LLM with 130K context, multimodal support, and real-time data integration for dynamic analysis.

Transcribe and translate speech in over 60 languages, in real-time, with high accuracy.

It is a next-generation AI assistant. It is accessible through chat interface and API. It is capable of a wide variety of conversational and text-processing tasks while maintaining a high degree of reliability and predictability.

It is a large multimodal model (accepting text inputs and emitting text outputs today, with image inputs coming in the future) that can solve difficult problems with greater accuracy than any of our previous models, thanks to its broader general knowledge and advanced reasoning capabilities.

It offers an API to add cutting-edge language processing to any system. Through training, users can create massive models customized to their use case and trained on their data.