Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
ReputAgent provides A2A evaluation infrastructure. When agents work together, reputation emerges from real work—not benchmarks. We help you build that trust infrastructure. | Find what caused your AI bill. Opsmeter gives endpoint, user, model, and prompt-level AI cost attribution in one view. |
Continuous AI agent evaluation, Reputation scoring from accumulated evidence, Evaluation dimensions framework (accuracy, safety, reliability), Failure modes library with mitigations, Evaluation patterns library (LLM-as-judge, human-in-the-loop, red teaming, orchestration), Agent Playground for pre-production scrimmage testing, Ecosystem tools tracker and comparisons, Research index of agent evaluation papers, Open dataset export (CC-BY-4.0), RepKit SDK (pre-release) for logging evaluations and querying reputation, Pre-production agent testing, Agent reliability QA before launch, Ongoing evaluation in production, Safety and red teaming workflows, Routing and delegation based on trust signals, Access control and governance decisions, Comparing agent frameworks and tools, Research and benchmarking of evaluation methods, Shared vocabulary and taxonomy for agent teams | ndpoint/user/model/prompt-level AI cost attribution, Provider-agnostic telemetry ingest API, Token and latency tracking, Budget alerts (Pro+ plans), Plan-based ingest rate limits with Retry-After headers, Workspace-level dashboards and filters, Retention controls (raw + summary data) |
Statistics | |
Stacks 0 | Stacks 10 |
Followers 1 | Followers 1 |
Votes 1 | Votes 1 |
Pros & Cons | |
No community feedback yet | Pros
Cons
|

Keen is a powerful set of API's that allow you to stream, store, query, and visualize event-based data. Customer-facing metrics bring SaaS products to the next level with acquiring, engaging, and retaining customers.

Snowplow is a real-time event data pipeline that lets you track, contextualize, validate and model your customers’ behaviour across your entire digital estate.

It is a service for collecting, analyzing and visualizing custom metrics. It can be used to track anything from signups to server response times. Sending events is super simple.

Ahoy provides a solid foundation to track visits and events in Ruby, JavaScript, and native apps.

It is a cutting-edge framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, it empowers agents to work together seamlessly, tackling complex tasks.

Is the game-changing European modern data quality platform that effortlessly uncovers anomalies and errors in your data with Artificial Intelligence.

Build dashboards and reports with exactly the metrics you need using plain Python scripts. There is nothing new to learn. Bitdeli keeps your results up to date, no matter how much data you have or how complex your metrics are. Get started in minutes with our growing library of open-source analytics, created by experienced data hackers.

Discover and install agent skills for Claude Code, Cursor, Windsurf and more. Browse 10000+ curated skills by category or author. Start building smarter today.

It is a platform for building production-grade LLM applications. It lets you debug, test, evaluate, and monitor chains and intelligent agents built on any LLM framework and seamlessly integrates with LangChain, the go-to open source framework for building with LLMs.

Transform basic prompts into expert-level AI instructions. Enhance, benchmark & optimize prompts for ChatGPT, Claude, Gemini & more.