The weights and architecture of Mixture-of-Experts model, Grok-1 (By xAI)
Creating safe artificial general intelligence that benefits all of humanity. Our work to create safe and beneficial AI requires a deep understanding of the potential risks and benefits, as well as careful consideration of the impact. | It contains code and insights to fine-tune various large language models (LLMs) for different use cases. It also provides an evaluation framework to compare the performance, time, cost, and inference of different LLMs, both open-source and closed-source. It aims to make fine-tuning LLMs easier and more accessible. |
Pioneering research on the path to AGI;
Transforming work and creativity with AI | Fine-tune various large language models (LLMs) for different use cases;
Provides an evaluation framework that measures the performance, time, cost, and inference |
Statistics | |
Stacks 685 | Stacks 0 |
Followers 192 | Followers 2 |
Votes 0 | Votes 0 |
Integrations | |

It is the base model weights and network architecture of Grok-1, the large language model. Grok-1 is a 314 billion parameter Mixture-of-Experts model trained from scratch by xAI.

It is a platform for building production-grade LLM applications. It lets you debug, test, evaluate, and monitor chains and intelligent agents built on any LLM framework and seamlessly integrates with LangChain, the go-to open source framework for building with LLMs.

It is a new scripting language to automate your interaction with a Large Language Model (LLM), namely OpenAI. The ultimate goal is to create a natural language programming experience. The syntax of GPTScript is largely natural language, making it very easy to learn and use.

It is a framework built around LLMs. It can be used for chatbots, generative question-answering, summarization, and much more. The core idea of the library is that we can “chain” together different components to create more advanced use cases around LLMs.

It is an open-source library designed to help developers build conversational streaming user interfaces in JavaScript and TypeScript. The SDK supports React/Next.js, Svelte/SvelteKit, and Vue/Nuxt as well as Node.js, Serverless, and the Edge Runtime.

It is a next-generation AI assistant. It is accessible through chat interface and API. It is capable of a wide variety of conversational and text-processing tasks while maintaining a high degree of reliability and predictability.

Build, train, and deploy state of the art models powered by the reference open source in machine learning.

It is Google’s largest and most capable AI model. It is built to be multimodal, it can generalize, understand, operate across, and combine different types of info — like text, images, audio, video, and code.

It is a state-of-the-art foundational large language model designed to help researchers advance their work in this subfield of AI.

It allows you to run open-source large language models, such as Llama 2, locally.