Amazon EMR vs Amazon Redshift vs Qubole

Amazon EMR
Amazon EMR

215
18
49
Amazon Redshift
Amazon Redshift

579
1.4K
86
Qubole
Qubole

12
217
50
- No public GitHub repository available -
- No public GitHub repository available -
- No public GitHub repository available -

What is Amazon EMR?

Amazon EMR is used in a variety of applications, including log analysis, web indexing, data warehousing, machine learning, financial analysis, scientific simulation, and bioinformatics. Customers launch millions of Amazon EMR clusters every year.

What is Amazon Redshift?

Redshift makes it simple and cost-effective to efficiently analyze all your data using your existing business intelligence tools. It is optimized for datasets ranging from a few hundred gigabytes to a petabyte or more and costs less than $1,000 per terabyte per year, a tenth the cost of most traditional data warehousing solutions.

What is Qubole?

Qubole is a cloud based service that makes big data easy for analysts and data engineers.

Want advice about which of these to choose?Ask the StackShare community!

Why do developers choose Amazon EMR?
Why do developers choose Amazon Redshift?
Why do developers choose Qubole?
What are the cons of using Amazon EMR?
What are the cons of using Amazon Redshift?
What are the cons of using Qubole?
    Be the first to leave a con
      Be the first to leave a con
        Be the first to leave a con
        What companies use Amazon EMR?
        What companies use Amazon Redshift?
        What companies use Qubole?
        What are some alternatives to Amazon EMR, Amazon Redshift, and Qubole?
        Amazon EC2
        Amazon Elastic Compute Cloud (Amazon EC2) is a web service that provides resizable compute capacity in the cloud. It is designed to make web-scale computing easier for developers.
        Amazon DynamoDB
        All data items are stored on Solid State Drives (SSDs), and are replicated across 3 Availability Zones for high availability and durability. With DynamoDB, you can offload the administrative burden of operating and scaling a highly available distributed database cluster, while paying a low price for only what you use.
        Hadoop
        The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.
        Google BigQuery
        Run super-fast, SQL-like queries against terabytes of data in seconds, using the processing power of Google's infrastructure. Load data with ease. Bulk load your data using Google Cloud Storage or stream it in. Easy access. Access BigQuery by using a browser tool, a command-line tool, or by making calls to the BigQuery REST API with client libraries such as Java, PHP or Python.
        Snowflake
        Snowflake eliminates the administration and management demands of traditional data warehouses and big data platforms. Snowflake is a true data warehouse as a service running on Amazon Web Services (AWS)—no infrastructure to manage and no knobs to turn.
        See all alternatives
        What tools integrate with Amazon EMR?
        What tools integrate with Amazon Redshift?
        What tools integrate with Qubole?
          No integrations found
            No integrations found
            Decisions about Amazon EMR, Amazon Redshift, and Qubole
            No stack decisions found
            Interest over time
            Reviews of Amazon EMR, Amazon Redshift, and Qubole
            No reviews found
            How developers use Amazon EMR, Amazon Redshift, and Qubole
            Avatar of Pinterest
            Pinterest uses QuboleQubole

            We ultimately migrated our Hadoop jobs to Qubole, a rising player in the Hadoop as a Service space. Given that EMR had become unstable at our scale, we had to quickly move to a provider that played well with AWS (specifically, spot instances) and S3. Qubole supported AWS/S3 and was relatively easy to get started on. After vetting Qubole and comparing its performance against alternatives (including managed clusters), we decided to go with Qubole

            Avatar of Olo
            Olo uses Amazon RedshiftAmazon Redshift

            Aggressive archiving of historical data to keep the production database as small as possible. Using our in-house soon-to-be-open-sourced ETL library, SharpShifter.

            Avatar of Andrew La Grange
            Andrew La Grange uses Amazon EMRAmazon EMR

            We use Amazon EMR for all our Hadoop workloads.

            Avatar of Christian Moeller
            Christian Moeller uses Amazon RedshiftAmazon Redshift

            Connected to BI (Pentaho)

            Avatar of Kovid Rathee
            Kovid Rathee uses Amazon RedshiftAmazon Redshift

            OLAP and BI

            How much does Amazon EMR cost?
            How much does Amazon Redshift cost?
            How much does Qubole cost?
            News about Qubole
            More news