Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
It is a tool for testing and evaluating LLM output quality. With this tool, you can systematically test prompts, models, and RAGs with predefined test cases. It can be utilized as a CLI, a library, or integrated into CI/CD pipelines. | WFGY is a verification first reasoning engine for LLMs. It ships reproducible entry points and audit friendly specifications, designed to make failures visible and fixable. WFGY 1.0 to 3.0 are one set. Each version is a different depth level, not a different product line. MIT licensed. Public demos and docs live in the repo. Start here: Event Horizon (WFGY 3.0 public entry): https://github.com/onestardao/WFGY/blob/main/TensionUniverse/EventHorizon/README.md Starter Village (fast onboarding): https://github.com/onestardao/WFGY/blob/main/StarterVillage/README.md |
Evaluate quality and catch regressions;
Speed up evaluations with caching and concurrency;
Score outputs automatically by defining test cases | Verification, Reproducibility, Auditability, Failure analysis, RAG debugging, Open source |
Statistics | |
GitHub Stars 9.0K | GitHub Stars - |
GitHub Forks 760 | GitHub Forks - |
Stacks 0 | Stacks 0 |
Followers 0 | Followers 1 |
Votes 0 | Votes 1 |
Integrations | |
| No integrations available | |

It is a cutting-edge framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, it empowers agents to work together seamlessly, tackling complex tasks.

Waxell is the AI governance plane for agentic systems in production. It sits above agents, models, and integrations, enforcing constraints and defining what's allowed. Auto-instrumentation for 200+ libraries without code changes. Real-time tracing, token and cost tracking, and 11 categories of agentic governance policy enforcement.

Discover and install agent skills for Claude Code, Cursor, Windsurf and more. Browse 10000+ curated skills by category or author. Start building smarter today.

An Agentic Enterprise Platform where your knowledge base powers AI with full ownership, control, and business-friendly interfaces. Find out our product: https://aiquinta.ai/our-product/

It is a platform for building production-grade LLM applications. It lets you debug, test, evaluate, and monitor chains and intelligent agents built on any LLM framework and seamlessly integrates with LangChain, the go-to open source framework for building with LLMs.

Find what caused your AI bill. Opsmeter gives endpoint, user, model, and prompt-level AI cost attribution in one view.

Transform basic prompts into expert-level AI instructions. Enhance, benchmark & optimize prompts for ChatGPT, Claude, Gemini & more.

Oculer is an end-to-end AI video engine that turns a simple text idea into a fully produced Instagram Reel, or YouTube Short. Unlike basic motion graphic tools, Oculer creates complete storytelling videos — including script, storyboard, motion graphics, sound syncing, subtitles, and final editing — automatically.

Is an all-in-one AI coding platform that allows you build apps and websites by chatting with AI. YouWare enables full-stack code generation and deployment with a shareable URL instantly. no code, no setup, no hassle.

The collaborative testing platform for LLM applications and agents. Your whole team defines quality requirements together, Rhesis generates thousands of test scenarios covering edge cases, simulates realistic multi-turn conversations, and delivers actionable reviews. Testing infrastructure built for Gen AI.