Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
It is a tool for testing and evaluating LLM output quality. With this tool, you can systematically test prompts, models, and RAGs with predefined test cases. It can be utilized as a CLI, a library, or integrated into CI/CD pipelines. | Easily host and share test reports. Gaffer saves developers time and improves test visibility. |
Evaluate quality and catch regressions;
Speed up evaluations with caching and concurrency;
Score outputs automatically by defining test cases | Report Hosting, Report AI Analysis |
Statistics | |
GitHub Stars 9.0K | GitHub Stars - |
GitHub Forks 760 | GitHub Forks - |
Stacks 0 | Stacks 0 |
Followers 0 | Followers 1 |
Votes 0 | Votes 1 |
Integrations | |
| No integrations available | |

BrowserStack is the leading test platform built for developers & QAs to expand test coverage, scale & optimize testing with cross-browser, real device cloud, accessibility, visual testing, test management, and test observability.

TestRail helps you manage and track your software testing efforts and organize your QA department. Its intuitive web-based user interface makes it easy to create test cases, manage test runs and coordinate your entire testing process.

The most advanced, consistent, and effective AI humanizer on the market. Instantly transform AI-generated text into undetectable, human-like writing in one click.

Waxell is the AI governance plane for agentic systems in production. It sits above agents, models, and integrations, enforcing constraints and defining what's allowed. Auto-instrumentation for 200+ libraries without code changes. Real-time tracing, token and cost tracking, and 11 categories of agentic governance policy enforcement.

Manage all aspects of software quality; integrate with JIRA and various test tools, foster collaboration and gain real-time visibility.

It is a platform for building production-grade LLM applications. It lets you debug, test, evaluate, and monitor chains and intelligent agents built on any LLM framework and seamlessly integrates with LangChain, the go-to open source framework for building with LLMs.
Developer CLI tool for AI content compliance. Stamps files with provenance metadata, audits against EU AI Act, SOX, HIPAA. Integrates with GitHub Actions, pre-commit, and MCP.

Transform basic prompts into expert-level AI instructions. Enhance, benchmark & optimize prompts for ChatGPT, Claude, Gemini & more.

Find what caused your AI bill. Opsmeter gives endpoint, user, model, and prompt-level AI cost attribution in one view.

A high-performance AI detection infrastructure designed to identify synthetic media. AI Detect Lab leverages advanced neural network analysis to distinguish between human-generated content and AI outputs (Midjourney v7, Stable Diffusion 3.5, DALL-E 3,Flux2.0) with 99%+ accuracy. Supports multi-language text analysis and high-resolution image processing via a streamlined web interface.