Apr 12, 2020
Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
Kafka is a distributed, partitioned, replicated commit log service. It provides the functionality of a messaging system, but with a unique design. | Snowplow is a real-time event data pipeline that lets you track, contextualize, validate and model your customers’ behaviour across your entire digital estate. |
Written at LinkedIn in Scala;Used by LinkedIn to offload processing of all page and other views;Defaults to using persistence, uses OS disk cache for hot data (has higher throughput then any of the above having persistence enabled);Supports both on-line as off-line processing | Track rich events from your websites, mobile apps, server-side systems, third party systems and any type of connected device, so that you have a record of what happened, when, and to whom;Load your data into your data warehouse of choice to power sophisticated analytics;Process your data including validating, enriching and modeling it;Your data is available in real-time via Amazon Kinesis, Google Pub/Sub and BigQuery to power real-time applications and reports;Your data pipeline is running in your cloud environment giving you full ownership and control of your data |
Statistics | |
GitHub Stars 31.2K | GitHub Stars 7.0K |
GitHub Forks 14.8K | GitHub Forks 1.2K |
Stacks 24.2K | Stacks 132 |
Followers 22.3K | Followers 174 |
Votes 607 | Votes 35 |
Pros & Cons | |
Pros
Cons
| Pros
|
Integrations | |
| No integrations available | |

RabbitMQ gives your applications a common platform to send and receive messages, and your messages a safe place to live until received.

Celery is an asynchronous task queue/job queue based on distributed message passing. It is focused on real-time operation, but supports scheduling as well.

Keen is a powerful set of API's that allow you to stream, store, query, and visualize event-based data. Customer-facing metrics bring SaaS products to the next level with acquiring, engaging, and retaining customers.

Transmit any volume of data, at any level of throughput, without losing messages or requiring other services to be always available. With SQS, you can offload the administrative burden of operating and scaling a highly available messaging cluster, while paying a low price for only what you use.

NSQ is a realtime distributed messaging platform designed to operate at scale, handling billions of messages per day. It promotes distributed and decentralized topologies without single points of failure, enabling fault tolerance and high availability coupled with a reliable message delivery guarantee. See features & guarantees.

Apache ActiveMQ is fast, supports many Cross Language Clients and Protocols, comes with easy to use Enterprise Integration Patterns and many advanced features while fully supporting JMS 1.1 and J2EE 1.4. Apache ActiveMQ is released under the Apache 2.0 License.

The 0MQ lightweight messaging kernel is a library which extends the standard socket interfaces with features traditionally provided by specialised messaging middleware products. 0MQ sockets provide an abstraction of asynchronous message queues, multiple messaging patterns, message filtering (subscriptions), seamless access to multiple transport protocols and more.

An easy to use, powerful, and reliable system to process and distribute data. It supports powerful and scalable directed graphs of data routing, transformation, and system mediation logic.

Gearman allows you to do work in parallel, to load balance processing, and to call functions between languages. It can be used in a variety of applications, from high-availability web sites to the transport of database replication events.

Highly scalable and effortless data streaming platform. Made to enable developers and data teams to collaborate and build real-time and streaming apps fast.