Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
The collaborative testing platform for LLM applications and agents. Your whole team defines quality requirements together, Rhesis generates thousands of test scenarios covering edge cases, simulates realistic multi-turn conversations, and delivers actionable reviews. Testing infrastructure built for Gen AI. | Draft is a workflow platform for generating and evaluating structured documents using AI. It processes input data, applies multi-step pipelines, and produces context-specific outputs with built-in scoring and optimization. The system maintains state across iterations and supports versioning, analysis, and workflow management. |
LLM, Agents, Testing, QA, Evals, Multi-turn, Test Case Generation, Collaboration, Reviews | Multi-step AI pipelines for document generation, structured input parsing and transformation, context-aware output generation, scoring and evaluation mechanisms, versioning and state management, workflow orchestration across tasks, browser-based data capture, real-time editing and iteration, export in multiple formats, dashboard for tracking and organization |
Statistics | |
Stacks 2 | Stacks 0 |
Followers 2 | Followers 1 |
Votes 1 | Votes 1 |

Use Airflow to author workflows as directed acyclic graphs (DAGs) of tasks. The Airflow scheduler executes your tasks on an array of workers while following the specified dependencies. Rich command lines utilities makes performing complex surgeries on DAGs a snap. The rich user interface makes it easy to visualize pipelines running in production, monitor progress and troubleshoot issues when needed.

It is a generic test automation framework for acceptance testing and acceptance test-driven development. It has easy-to-use tabular test data syntax and it utilizes the keyword-driven testing approach. Its testing capabilities can be extended by test libraries implemented either with Python or Java, and users can create new higher-level keywords from existing ones using the same syntax that is used for creating test cases.

Combines API test-automation, mocks and performance-testing into a single, unified framework. The BDD syntax popularized by Cucumber is language-neutral, and easy for even non-programmers. Besides powerful JSON & XML assertions, you can run tests in parallel for speed - which is critical for HTTP API testing.

Cucumber is a tool that supports Behaviour-Driven Development (BDD) - a software development process that aims to enhance software quality and reduce maintenance costs.

It makes it easy to automate all your software workflows, now with world-class CI/CD. Build, test, and deploy your code right from GitHub. Make code reviews, branch management, and issue triaging work the way you want.

It is a pure node.js end-to-end solution for testing web apps. It takes care of all the stages: starting browsers, running tests, gathering test results and generating reports.

It is a testing and specification framework for Java and Groovy applications. What makes it stand out from the crowd is its beautiful and highly expressive specification language. It is compatible with most IDEs, build tools, and continuous integration servers.

It is a library for writing concise, readable, boilerplate-free tests in Java using Selenium WebDriver.

Capybara helps you test web applications by simulating how a real user would interact with your app. It is agnostic about the driver running your tests and comes with Rack::Test and Selenium support built in. WebKit is supported through an external gem.

PHPUnit is a programmer-oriented testing framework for PHP. It is an instance of the xUnit architecture for unit testing frameworks.