Celery vs Kafka vs RabbitMQ

Overview

RabbitMQ

Stacks21.8K

Followers18.9K

Votes558

GitHub Stars13.2K

Forks4.0K

Kafka

Stacks24.2K

Followers22.3K

Votes607

GitHub Stars31.2K

Forks14.8K

Celery

Stacks1.7K

Followers1.6K

Votes280

GitHub Stars27.5K

Forks4.9K

Celery vs Kafka vs RabbitMQ: What are the differences?

Introduction

This markdown code provides a comparison of key differences between Celery, Kafka, and RabbitMQ, including their functionalities, features, and use cases.

Concurrency Model: Celery is a distributed task queue that follows a traditional task-oriented concurrency model, allowing asynchronous execution of tasks. Kafka, on the other hand, follows a publish-subscribe model with event sourcing capabilities, making it suitable for real-time stream processing. RabbitMQ follows the message queuing concept, enabling communication between multiple applications or services asynchronously.
Message Persistence: Kafka provides durable and fault-tolerant message persistence, making it suitable for storing large volumes of streaming data for extended periods. Celery comes with built-in support for both task result and task state persistence, ensuring reliability and resilience. RabbitMQ also offers message persistence by allowing messages to be stored on disk or in-memory based on the configuration.
Scalability: Celery is highly scalable, allowing distributed task processing across multiple workers and brokers, making it suitable for handling high-volume workloads. Kafka is designed for horizontal scalability due to its distributed nature, allowing seamless scaling by adding more brokers or partitions. RabbitMQ provides horizontal scalability by employing clustering, enabling high availability and load balancing.
Message Delivery Guarantees: Kafka ensures at-least-once delivery semantics, where messages are guaranteed to be delivered or persisted at least once, ensuring fault tolerance. Celery, by default, provides at-least-once delivery semantics by acknowledging task execution after it's completed. RabbitMQ offers configurable delivery guarantees between applications, including at-most-once or at-least-once, depending on the requirements.
Message Ordering: Kafka ensures strict message ordering within a single partition, which makes it ideal for maintaining event consistency. Celery provides ordering guarantees for tasks within a single queue or channel but doesn't enforce ordering across different queues. RabbitMQ guarantees message ordering within a single queue, ensuring sequential processing.
Supported Languages: Celery supports multiple programming languages, including Python, Ruby, Java, and C#, making it suitable for heterogeneous environments. Kafka provides client libraries for various programming languages, including Java, Python, and Scala, enabling developers to use their preferred language. RabbitMQ offers official client libraries for popular programming languages like Python, Java, and .NET, providing flexibility in language choice.

In summary, Celery is a task queue focused on distributed task processing and asynchronous execution, Kafka is a distributed streaming platform suited for real-time event processing, and RabbitMQ is a flexible message broker enabling asynchronous communication between multiple applications or services based on different messaging patterns.

Share your Stack

Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.

View Docs

CLI (Node.js)

Manual

Advice on RabbitMQ, Kafka, Celery

Pulkit

Software Engineer

Oct 30, 2020

Needs adviceon

Django

Amazon SQS

RabbitMQ

Hi! I am creating a scraping system in Django, which involves long running tasks between 1 minute & 1 Day. As I am new to Message Brokers and Task Queues, I need advice on which architecture to use for my system. ( Amazon SQS, RabbitMQ, or Celery). The system should be autoscalable using Kubernetes(K8) based on the number of pending tasks in the queue.

474k views474k

Comments

Meili

Software engineer at Digital Science

Sep 24, 2020

Needs adviceon

ZeroMQ

RabbitMQ

Amazon SQS

Hi, we are in a ZMQ set up in a push/pull pattern, and we currently start to have more traffic and cases that the service is unavailable or stuck. We want to:

Not loose messages in services outages
Safely restart service without losing messages (@{ZeroMQ}|tool:1064| seems to need to close the socket in the receiver before restart manually)

Do you have experience with this setup with ZeroMQ? Would you suggest RabbitMQ or Amazon SQS (we are in AWS setup) instead? Something else?

Thank you for your time

500k views500k

Comments

André

Technology Manager at GS1 Portugal - Codipor

Jul 30, 2020

Needs adviceon

.NET Core

Hello dear developers, our company is starting a new project for a new Web App, and we are currently designing the Architecture (we will be using .NET Core). We want to embark on something new, so we are thinking about migrating from a monolithic perspective to a microservices perspective. We wish to containerize those microservices and make them independent from each other. Is it the best way for microservices to communicate with each other via ESB, or is there a new way of doing this? Maybe complementing with an API Gateway? Can you recommend something else different than the two tools I provided?

We want something good for Cost/Benefit; performance should be high too (but not the primary constraint).

Thank you very much in advance :)

461k views461k

Comments

Detailed Comparison

RabbitMQ	Kafka	Celery
RabbitMQ gives your applications a common platform to send and receive messages, and your messages a safe place to live until received.	Kafka is a distributed, partitioned, replicated commit log service. It provides the functionality of a messaging system, but with a unique design.	Celery is an asynchronous task queue/job queue based on distributed message passing. It is focused on real-time operation, but supports scheduling as well.
Robust messaging for applications;Easy to use;Runs on all major operating systems;Supports a huge number of developer platforms;Open source and commercially supported	Written at LinkedIn in Scala;Used by LinkedIn to offload processing of all page and other views;Defaults to using persistence, uses OS disk cache for hot data (has higher throughput then any of the above having persistence enabled);Supports both on-line as off-line processing	-
Statistics
GitHub Stars 13.2K	GitHub Stars 31.2K	GitHub Stars 27.5K
GitHub Forks 4.0K	GitHub Forks 14.8K	GitHub Forks 4.9K
Stacks 21.8K	Stacks 24.2K	Stacks 1.7K
Followers 18.9K	Followers 22.3K	Followers 1.6K
Votes 558	Votes 607	Votes 280
Pros & Cons
Pros 235 It's fast and it works with good metrics/monitoring 80 Ease of configuration 60 I like the admin interface 52 Easy to set-up and start with 22 Durable Cons 9 Too complicated cluster/HA config and management 6 Needs Erlang runtime. Need ops good with Erlang runtime 5 Configuration must be done first, not by your code 4 Slow	Pros 126 High-throughput 119 Distributed 92 Scalable 86 High-Performance 66 Durable Cons 32 Non-Java clients are second-class citizens 29 Needs Zookeeper 9 Operational difficulties 5 Terrible Packaging	Pros 99 Task queue 63 Python integration 40 Django integration 30 Scheduled Task 19 Publish/subsribe Cons 4 Sometimes loses tasks 1 Depends on broker

What are some alternatives to RabbitMQ, Kafka, Celery?

Amazon SQS

Transmit any volume of data, at any level of throughput, without losing messages or requiring other services to be always available. With SQS, you can offload the administrative burden of operating and scaling a highly available messaging cluster, while paying a low price for only what you use.

NSQ

NSQ is a realtime distributed messaging platform designed to operate at scale, handling billions of messages per day. It promotes distributed and decentralized topologies without single points of failure, enabling fault tolerance and high availability coupled with a reliable message delivery guarantee. See features & guarantees.

ActiveMQ

Apache ActiveMQ is fast, supports many Cross Language Clients and Protocols, comes with easy to use Enterprise Integration Patterns and many advanced features while fully supporting JMS 1.1 and J2EE 1.4. Apache ActiveMQ is released under the Apache 2.0 License.

ZeroMQ

The 0MQ lightweight messaging kernel is a library which extends the standard socket interfaces with features traditionally provided by specialised messaging middleware products. 0MQ sockets provide an abstraction of asynchronous message queues, multiple messaging patterns, message filtering (subscriptions), seamless access to multiple transport protocols and more.

Apache NiFi

An easy to use, powerful, and reliable system to process and distribute data. It supports powerful and scalable directed graphs of data routing, transformation, and system mediation logic.

Gearman

Gearman allows you to do work in parallel, to load balance processing, and to call functions between languages. It can be used in a variety of applications, from high-availability web sites to the transport of database replication events.

Memphis

Highly scalable and effortless data streaming platform. Made to enable developers and data teams to collaborate and build real-time and streaming apps fast.

IronMQ

An easy-to-use highly available message queuing service. Built for distributed cloud applications with critical messaging needs. Provides on-demand message queuing with advanced features and cloud-optimized performance.

Apache Pulsar

Apache Pulsar is a distributed messaging solution developed and released to open source at Yahoo. Pulsar supports both pub-sub messaging and queuing in a platform designed for performance, scalability, and ease of development and operation.

Confluent

It is a data streaming platform based on Apache Kafka: a full-scale streaming platform, capable of not only publish-and-subscribe, but also the storage and processing of data within the stream

Related Comparisons

Celery vs Kafka vs RabbitMQ: What are the differences?

Introduction

This markdown code provides a comparison of key differences between Celery, Kafka, and RabbitMQ, including their functionalities, features, and use cases.

Concurrency Model: Celery is a distributed task queue that follows a traditional task-oriented concurrency model, allowing asynchronous execution of tasks. Kafka, on the other hand, follows a publish-subscribe model with event sourcing capabilities, making it suitable for real-time stream processing. RabbitMQ follows the message queuing concept, enabling communication between multiple applications or services asynchronously.
Message Persistence: Kafka provides durable and fault-tolerant message persistence, making it suitable for storing large volumes of streaming data for extended periods. Celery comes with built-in support for both task result and task state persistence, ensuring reliability and resilience. RabbitMQ also offers message persistence by allowing messages to be stored on disk or in-memory based on the configuration.
Scalability: Celery is highly scalable, allowing distributed task processing across multiple workers and brokers, making it suitable for handling high-volume workloads. Kafka is designed for horizontal scalability due to its distributed nature, allowing seamless scaling by adding more brokers or partitions. RabbitMQ provides horizontal scalability by employing clustering, enabling high availability and load balancing.
Message Delivery Guarantees: Kafka ensures at-least-once delivery semantics, where messages are guaranteed to be delivered or persisted at least once, ensuring fault tolerance. Celery, by default, provides at-least-once delivery semantics by acknowledging task execution after it's completed. RabbitMQ offers configurable delivery guarantees between applications, including at-most-once or at-least-once, depending on the requirements.
Message Ordering: Kafka ensures strict message ordering within a single partition, which makes it ideal for maintaining event consistency. Celery provides ordering guarantees for tasks within a single queue or channel but doesn't enforce ordering across different queues. RabbitMQ guarantees message ordering within a single queue, ensuring sequential processing.
Supported Languages: Celery supports multiple programming languages, including Python, Ruby, Java, and C#, making it suitable for heterogeneous environments. Kafka provides client libraries for various programming languages, including Java, Python, and Scala, enabling developers to use their preferred language. RabbitMQ offers official client libraries for popular programming languages like Python, Java, and .NET, providing flexibility in language choice.

Celery vs Kafka vs RabbitMQ

Overview

Celery vs Kafka vs RabbitMQ: What are the differences?