Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
It represents a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding, achieving impressive chat capabilities mimicking spirits of the multimodal GPT-4 and setting a new state-of-the-art accuracy on Science QA. | Build smarter coding agents with the GLM-4.7 API, featuring multilingual coding, terminal tasks, and think-before-act reasoning. |
Open-source;
Multimodal GPT-4 level capabilities;
Impressive chat abilities | Model Library, Serverless Inference, Dedicated Endpoint, Canopy Wave Chat, NVIDIA GB200 NVL72 |
Statistics | |
GitHub Stars 23.9K | GitHub Stars - |
GitHub Forks 2.7K | GitHub Forks - |
Stacks 1 | Stacks 0 |
Followers 1 | Followers 1 |
Votes 0 | Votes 1 |
Integrations | |
| No integrations available | |

It is a cutting-edge framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, it empowers agents to work together seamlessly, tackling complex tasks.

It is the base model weights and network architecture of Grok-1, the large language model. Grok-1 is a 314 billion parameter Mixture-of-Experts model trained from scratch by xAI.

It is Google’s largest and most capable AI model. It is built to be multimodal, it can generalize, understand, operate across, and combine different types of info — like text, images, audio, video, and code.

It is a state-of-the-art foundational large language model designed to help researchers advance their work in this subfield of AI.

It is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.

Discover and install agent skills for Claude Code, Cursor, Windsurf and more. Browse 10000+ curated skills by category or author. Start building smarter today.

An Agentic Enterprise Platform where your knowledge base powers AI with full ownership, control, and business-friendly interfaces. Find out our product: https://aiquinta.ai/our-product/

Oculer is an end-to-end AI video engine that turns a simple text idea into a fully produced Instagram Reel, or YouTube Short. Unlike basic motion graphic tools, Oculer creates complete storytelling videos — including script, storyboard, motion graphics, sound syncing, subtitles, and final editing — automatically.

Is an all-in-one AI coding platform that allows you build apps and websites by chatting with AI. YouWare enables full-stack code generation and deployment with a shareable URL instantly. no code, no setup, no hassle.

Try Grok 4 on GPT Proto. Access xAI’s most advanced 1.7T LLM with 130K context, multimodal support, and real-time data integration for dynamic analysis.