Need advice about which tool to choose?Ask the StackShare community!

Amazon Redshift

Stacks1.5K

Followers1.4K

+ 1

Votes108

Druid

Stacks382

Followers867

+ 1

Votes32

Add tool

Amazon Redshift vs Druid: What are the differences?

Introduction:

When comparing Amazon Redshift and Druid, it's essential to understand the key differences between the two data storage and processing solutions. Both are popular choices for handling large volumes of data and providing analytical capabilities, but they have distinct features that make them suitable for different use cases.

Architecture: Amazon Redshift is a fully managed data warehouse service that uses a columnar storage architecture optimized for complex queries and high-performance analytics. In contrast, Druid is a distributed, column-oriented, real-time data store designed to handle high data ingestion rates and provide sub-second queries for time-series data.
Query Processing: Amazon Redshift uses traditional SQL queries and can handle complex joins, aggregations, and window functions efficiently. On the other hand, Druid supports SQL-like queries along with Apache Druid Query Language (DQL) for real-time and interactive analytics. It provides faster query response times for time-series data by utilizing a specialized query engine.
Data Ingestion: Amazon Redshift allows data to be loaded from various sources using tools like AWS Glue, Amazon Kinesis, and Amazon S3. Druid is designed for real-time data ingestion and can directly ingest streaming data from sources like Kafka and Apache Storm. It supports continuous data ingestion and enables interactive analytics on fresh data.
Scalability: Amazon Redshift offers on-demand scalability by automatically managing storage expansion, compute resources, and query optimization. Druid is horizontally scalable and can be easily scaled out by adding more nodes to the cluster, providing the ability to handle massive data sets and an increasing number of queries.
Use Cases: Amazon Redshift is well-suited for traditional data warehousing and business intelligence workloads where ad-hoc queries, reporting, and dashboarding are required. Druid is ideal for use cases that demand real-time analytics, event-driven architectures, time-series data analysis, and interactive dashboarding with near real-time insights.

In Summary, Amazon Redshift and Druid cater to different data processing and analytics requirements, with Redshift excelling in traditional data warehousing tasks and Druid providing real-time analytics capabilities for time-series data and event-driven applications.

Advice on Amazon Redshift and Druid

datocrats-org

Jul 29, 2020 | 5 upvotes · 309.9K views

Needs advice

Amazon Redshift

AWS Glue

and

Dremio

We need to perform ETL from several databases into a data warehouse or data lake. We want to

keep raw and transformed data available to users to draft their own queries efficiently
give users the ability to give custom permissions and SSO
move between open-source on-premises development and cloud-based production environments

We want to use inexpensive Amazon EC2 instances only on medium-sized data set 16GB to 32GB feeding into Tableau Server or PowerBI for reporting and data analysis purposes.

Replies (3)

John Nguyen

at kreuzwerker · Aug 13, 2020 | 4 upvotes · 241.8K views

Recommends

Airflow

AWS Lambda

You could also use AWS Lambda and use Cloudwatch event schedule if you know when the function should be triggered. The benefit is that you could use any language and use the respective database client.

But if you orchestrate ETLs then it makes sense to use Apache Airflow. This requires Python knowledge.

Raj Chandrasekaran

Aug 7, 2020 | 3 upvotes · 249.2K views

Recommends

Airflow

Though we have always built something custom, Apache airflow (https://airflow.apache.org/) stood out as a key contender/alternative when it comes to open sources. On the commercial offering, Amazon Redshift combined with Amazon Kinesis (for complex manipulations) is great for BI, though Redshift as such is expensive.

bobby huang

Aug 14, 2020 | 0 upvotes · 235.5K views

Recommends

You may want to look into a Data Virtualization product called Conduit. It connects to disparate data sources in AWS, on prem, Azure, GCP, and exposes them as a single unified Spark SQL view to PowerBI (direct query) or Tableau. Allows auto query and caching policies to enhance query speeds and experience. Has a GPU query engine and optimized Spark for fallback. Can be deployed on your AWS VM or on prem, scales up and out. Sounds like the ideal solution to your needs.

Manage your open source components, licenses, and vulnerabilities

Learn More

Pros of Amazon Redshift

Pros of Druid

41
Data Warehousing
27
Scalable
17
SQL
14
Backed by Amazon
5
Encryption
1
Cheap and reliable
1
Isolation
1
Best Cloud DW Performance
1
Fast columnar storage

15
Real Time Aggregations
6
Batch and Real-Time Ingestion
5
OLAP
3
OLAP + OLTP
2
Combining stream and historical analytics
1
OLTP

Sign up to add or upvote prosMake informed product decisions

Cons of Amazon Redshift

Cons of Druid

Be the first to leave a con

3
Limited sql support
2
Joins are not supported well
1
Complexity

Sign up to add or upvote consMake informed product decisions

What is Amazon Redshift?

It is optimized for data sets ranging from a few hundred gigabytes to a petabyte or more and costs less than $1,000 per terabyte per year, a tenth the cost of most traditional data warehousing solutions.

What is Druid?

Druid is a distributed, column-oriented, real-time analytics data store that is commonly used to power exploratory dashboards in multi-tenant environments. Druid excels as a data warehousing solution for fast aggregate queries on petabyte sized data sets. Druid supports a variety of flexible filters, exact calculations, approximate algorithms, and other useful calculations.

Need advice about which tool to choose?Ask the StackShare community!

What companies use Amazon Redshift?

What companies use Druid?

Manage your open source components, licenses, and vulnerabilities

Learn More

Sign up to get full access to all the companiesMake informed product decisions

What tools integrate with Amazon Redshift?

What tools integrate with Druid?

Sign up to get full access to all the tool integrationsMake informed product decisions

What are some alternatives to Amazon Redshift and Druid?

Google BigQuery

Run super-fast, SQL-like queries against terabytes of data in seconds, using the processing power of Google's infrastructure. Load data with ease. Bulk load your data using Google Cloud Storage or stream it in. Easy access. Access BigQuery by using a browser tool, a command-line tool, or by making calls to the BigQuery REST API with client libraries such as Java, PHP or Python.

Amazon Athena

Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.

Amazon DynamoDB

With it , you can offload the administrative burden of operating and scaling a highly available distributed database cluster, while paying a low price for only what you use.

Amazon Redshift Spectrum

With Redshift Spectrum, you can extend the analytic power of Amazon Redshift beyond data stored on local disks in your data warehouse to query vast amounts of unstructured data in your Amazon S3 “data lake” -- without having to load or transform any data.

Hadoop

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.

See all alternatives

Amazon Redshift vs Druid

Need advice about which tool to choose?Ask the StackShare community!

Amazon Redshift vs Druid: What are the differences?

Pros of Amazon Redshift

Pros of Druid

Sign up to add or upvote prosMake informed product decisions

Cons of Amazon Redshift

Cons of Druid

Sign up to add or upvote consMake informed product decisions

What is Amazon Redshift?

What is Druid?

Need advice about which tool to choose?Ask the StackShare community!

What companies use Amazon Redshift?

What companies use Druid?

Sign up to get full access to all the companiesMake informed product decisions

What tools integrate with Amazon Redshift?

What tools integrate with Druid?

Sign up to get full access to all the tool integrationsMake informed product decisions

Related Comparisons

Trending Comparisons

Top Comparisons