It is a tool that enables fast and efficient local LLM finetuning. It uses a manual autograd engine and Flash Attention v2 to achieve 2-5x speedup and 50% memory reduction compared to QLoRA, without compromising accuracy. | It is an SDK that integrates Large Language Models (LLMs) like OpenAI, Azure OpenAI, and Hugging Face with conventional programming languages like C#, Python, and Java. It achieves this by allowing you to define plugins that can be chained together in just a few lines of code. |
2x to 5x faster than QLoRA. 50% less memory usage;
All kernels written in OpenAI's Triton language;
0% loss in accuracy - no approximation methods - all exact;
No change of hardware necessary | SDK that integrates LLMs with conventional programming languages;
Has the ability to automatically orchestrate plugins with AI;
Highly extensible |
Statistics | |
GitHub Stars 47.8K | GitHub Stars 26.6K |
GitHub Forks 3.9K | GitHub Forks 4.3K |
Stacks 2 | Stacks 6 |
Followers 2 | Followers 4 |
Votes 0 | Votes 0 |
Integrations | |

That transforms AI-generated content into natural, undetectable human-like writing. Bypass AI detection systems with intelligent text humanization technology

It is a framework built around LLMs. It can be used for chatbots, generative question-answering, summarization, and much more. The core idea of the library is that we can “chain” together different components to create more advanced use cases around LLMs.

It allows you to run open-source large language models, such as Llama 2, locally.

It is a project that provides a central interface to connect your LLMs with external data. It offers you a comprehensive toolset trading off cost and performance.

It is a library for building stateful, multi-actor applications with LLMs, built on top of (and intended to be used with) LangChain. It extends the LangChain Expression Language with the ability to coordinate multiple chains (or actors) across multiple steps of computation in a cyclic manner.

It is a platform for building production-grade LLM applications. It lets you debug, test, evaluate, and monitor chains and intelligent agents built on any LLM framework and seamlessly integrates with LangChain, the go-to open source framework for building with LLMs.

It is a new scripting language to automate your interaction with a Large Language Model (LLM), namely OpenAI. The ultimate goal is to create a natural language programming experience. The syntax of GPTScript is largely natural language, making it very easy to learn and use.

It is an open-source library designed to help developers build conversational streaming user interfaces in JavaScript and TypeScript. The SDK supports React/Next.js, Svelte/SvelteKit, and Vue/Nuxt as well as Node.js, Serverless, and the Edge Runtime.

Build, train, and deploy state of the art models powered by the reference open source in machine learning.

It is an open-source embedding database. Chroma makes it easy to build LLM apps by making knowledge, facts, and skills pluggable for LLMs.