Druid

239
475
+ 1
22
Apache Spark

1.9K
2K
+ 1
121
Add tool
Pros of Druid
Pros of Apache Spark
Cons of Druid
Cons of Apache Spark

What is Druid?

Druid is a distributed, column-oriented, real-time analytics data store that is commonly used to power exploratory dashboards in multi-tenant environments. Druid excels as a data warehousing solution for fast aggregate queries on petabyte sized data sets. Druid supports a variety of flexible filters, exact calculations, approximate algorithms, and other useful calculations.

What is Apache Spark?

Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters through YARN or Spark's standalone mode, and it can process data in HDFS, HBase, Cassandra, Hive, and any Hadoop InputFormat. It is designed to perform both batch processing (similar to MapReduce) and new workloads like streaming, interactive queries, and machine learning.
What companies use Druid?
What companies use Apache Spark?
What tools integrate with Druid?
What tools integrate with Apache Spark?
Interest over time