Why developers like Amazon Redshift

What is Amazon Redshift?

It is optimized for data sets ranging from a few hundred gigabytes to a petabyte or more and costs less than $1,000 per terabyte per year, a tenth the cost of most traditional data warehousing solutions.

Amazon Redshift is a tool in the Big Data as a Service category of a tech stack.

Explore Amazon Redshift's Story

Who uses Amazon Redshift?

Companies

478 companies reportedly use Amazon Redshift in their tech stacks, including Amazon, CRED, and Nubank.

Amazon

CRED

Nubank

Delivery Hero

Tech Stack

Lyft

Figma

Coursera

Intuit

Developers

1010 developers on StackShare have stated that they use Amazon Redshift.

Flagship Promotions

Bluesoft Stack

Personal Favourites

np

Test

Kobblr

My Stack

Data Engineering

Amazon Redshift Integrations

MySQL, SQLite, Metabase, Amplitude, and Oracle PL/SQL are some of the popular tools that integrate with Amazon Redshift. Here's a list of all 130 tools that integrate with Amazon Redshift.

MySQL

SQLite

Metabase

Amplitude

Oracle PL/SQL

AWS Glue

DataGrip

Dataform

dbt

Pros of Amazon Redshift

Data Warehousing

Scalable

SQL

Backed by Amazon

Encryption

Cheap and reliable

Isolation

Best Cloud DW Performance

Fast columnar storage

Decisions about Amazon Redshift

Here are some stack decisions, common use cases and reviews by companies and developers who chose Amazon Redshift in their tech stack.

Liza Shtonda

Mar 1, 2023 | 2 upvotes · 6.4K views

Needs advice

Amazon Redshift

and

MySQL

please help me understand the difference in syntax between MySQL and Amazon Redshift . They have the difference or they are completely the same?

Aditya Tyagi

Mar 14, 2021 | 8 upvotes · 17.5K views

Needs advice

Airflow

and

AWS Lambda

I have data stored in Amazon S3 bucket in parquet file format.

I want this data to be copied from S3 to Amazon Redshift, so I use copy commands to achieve this. But, I need to do this manually. I want to achieve this with some sort of automation such that if any new file comes into S3, it should be copied to the required table in redshift. Can you suggest what different approaches I can use?

Pavithra Nagaraj

Mar 12, 2020 | 3 upvotes · 514.9K views

Needs advice

Amazon Athena

Amazon Redshift

and

AWS Glue

Hi all,

Currently, we need to ingest the data from Amazon S3 to DB either Amazon Athena or Amazon Redshift. But the problem with the data is, it is in .PSV (pipe separated values) format and the size is also above 200 GB. The query performance of the timeout in Athena/Redshift is not up to the mark, too slow while compared to Google BigQuery. How would I optimize the performance and query result time? Can anyone please help me out?

See all decisions

Blog Posts

Shifting From Monitoring to Observability

Jul 9 2019 at 7:22PM

Blue Medora

2407

How LendingHome Scaled Their Marketplace To $750M In Real Esta...

Sep 2 2016 at 3:23AM

LendingHome

+43

5352

Scaling Zapier to Automate Billions of Tasks

Feb 4 2016 at 6:44PM

Zapier

+42

22322

The Stack That Helped Medium Scale To 2.6 Millennia Of Reading...

Oct 22 2015 at 8:05AM

Medium

+37

122

38278

How Cotap Is Building A HIPAA-compliant Messaging Service On A...

Jun 26 2015 at 10:14AM

Zinc

+39

1509

How 500px serves up over 500TB of high res photos

May 15 2015 at 3:43PM

500px

+44

109

50833

How The World's Largest Design Marketplace Builds and Ships Co...

May 1 2015 at 12:32PM

99designs

+35

1828

Amazon Redshift's Features

Optimized for Data Warehousing- It uses columnar storage, data compression, and zone maps to reduce the amount of IO needed to perform queries. Redshift has a massively parallel processing (MPP) architecture, parallelizing and distributing SQL operations to take advantage of all available resources.
Scalable- With a few clicks of the AWS Management Console or a simple API call, you can easily scale the number of nodes in your data warehouse up or down as your performance or capacity needs change.
No Up-Front Costs- You pay only for the resources you provision. You can choose On-Demand pricing with no up-front costs or long-term commitments, or obtain significantly discounted rates with Reserved Instance pricing.
Fault Tolerant- Amazon Redshift has multiple features that enhance the reliability of your data warehouse cluster. All data written to a node in your cluster is automatically replicated to other nodes within the cluster and all data is continuously backed up to Amazon S3.
SQL - Amazon Redshift is a SQL data warehouse and uses industry standard ODBC and JDBC connections and Postgres drivers.
Isolation - Amazon Redshift enables you to configure firewall rules to control network access to your data warehouse cluster.
Encryption – With just a couple of parameter settings, you can set up Amazon Redshift to use SSL to secure data in transit and hardware-acccelerated AES-256 encryption for data at rest.<br>

Amazon Redshift Alternatives & Comparisons

What are some alternatives to Amazon Redshift?

Google BigQuery

Run super-fast, SQL-like queries against terabytes of data in seconds, using the processing power of Google's infrastructure. Load data with ease. Bulk load your data using Google Cloud Storage or stream it in. Easy access. Access BigQuery by using a browser tool, a command-line tool, or by making calls to the BigQuery REST API with client libraries such as Java, PHP or Python.

Amazon Athena

Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.

Amazon DynamoDB

With it , you can offload the administrative burden of operating and scaling a highly available distributed database cluster, while paying a low price for only what you use.

Amazon Redshift Spectrum

With Redshift Spectrum, you can extend the analytic power of Amazon Redshift beyond data stored on local disks in your data warehouse to query vast amounts of unstructured data in your Amazon S3 “data lake” -- without having to load or transform any data.

Hadoop

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.

See all alternatives

Related Comparisons