Amazon EMR vs Amazon Redshift vs Cloudera Enterprise

Get Advice Icon

Need advice about which tool to choose?Ask the StackShare community!

Amazon EMR
Amazon EMR

274
175
+ 1
49
Amazon Redshift
Amazon Redshift

778
486
+ 1
86
Cloudera Enterprise
Cloudera Enterprise

60
57
+ 1
0
- No public GitHub repository available -
- No public GitHub repository available -
- No public GitHub repository available -

What is Amazon EMR?

It is used in a variety of applications, including log analysis, data warehousing, machine learning, financial analysis, scientific simulation, and bioinformatics.

What is Amazon Redshift?

It is optimized for data sets ranging from a few hundred gigabytes to a petabyte or more and costs less than $1,000 per terabyte per year, a tenth the cost of most traditional data warehousing solutions.

What is Cloudera Enterprise?

Cloudera Enterprise includes CDH, the world’s most popular open source Hadoop-based platform, as well as advanced system management and data management tools plus dedicated support and community advocacy from our world-class team of Hadoop developers and experts.
Get Advice Icon

Need advice about which tool to choose?Ask the StackShare community!

Why do developers choose Amazon EMR?
Why do developers choose Amazon Redshift?
Why do developers choose Cloudera Enterprise?
    Be the first to leave a pro

    Sign up to add, upvote and see more prosMake informed product decisions

      Be the first to leave a con
        Be the first to leave a con
          Be the first to leave a con
          What companies use Amazon EMR?
          What companies use Amazon Redshift?
          What companies use Cloudera Enterprise?

          Sign up to get full access to all the companiesMake informed product decisions

          What tools integrate with Amazon EMR?
          What tools integrate with Amazon Redshift?
          What tools integrate with Cloudera Enterprise?

          Sign up to get full access to all the tool integrationsMake informed product decisions

          What are some alternatives to Amazon EMR, Amazon Redshift, and Cloudera Enterprise?
          Amazon EC2
          It is a web service that provides resizable compute capacity in the cloud. It is designed to make web-scale computing easier for developers.
          Hadoop
          The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.
          Amazon DynamoDB
          With it , you can offload the administrative burden of operating and scaling a highly available distributed database cluster, while paying a low price for only what you use.
          Azure HDInsight
          It is a cloud-based service from Microsoft for big data analytics that helps organizations process large amounts of streaming or historical data.
          Google BigQuery
          Run super-fast, SQL-like queries against terabytes of data in seconds, using the processing power of Google's infrastructure. Load data with ease. Bulk load your data using Google Cloud Storage or stream it in. Easy access. Access BigQuery by using a browser tool, a command-line tool, or by making calls to the BigQuery REST API with client libraries such as Java, PHP or Python.
          See all alternatives
          Decisions about Amazon EMR, Amazon Redshift, and Cloudera Enterprise
          Ankit Sobti
          Ankit Sobti
          CTO at Postman Inc · | 11 upvotes · 135.9K views
          atPostmanPostman
          Looker
          Looker
          Stitch
          Stitch
          Amazon Redshift
          Amazon Redshift
          dbt
          dbt

          Looker , Stitch , Amazon Redshift , dbt

          We recently moved our Data Analytics and Business Intelligence tooling to Looker . It's already helping us create a solid process for reusable SQL-based data modeling, with consistent definitions across the entire organizations. Looker allows us to collaboratively build these version-controlled models and push the limits of what we've traditionally been able to accomplish with analytics with a lean team.

          For Data Engineering, we're in the process of moving from maintaining our own ETL pipelines on AWS to a managed ELT system on Stitch. We're also evaluating the command line tool, dbt to manage data transformations. Our hope is that Stitch + dbt will streamline the ELT bit, allowing us to focus our energies on analyzing data, rather than managing it.

          See more
          Interest over time
          Reviews of Amazon EMR, Amazon Redshift, and Cloudera Enterprise
          No reviews found
          How developers use Amazon EMR, Amazon Redshift, and Cloudera Enterprise
          Avatar of Olo
          Olo uses Amazon RedshiftAmazon Redshift

          Aggressive archiving of historical data to keep the production database as small as possible. Using our in-house soon-to-be-open-sourced ETL library, SharpShifter.

          Avatar of Andrew La Grange
          Andrew La Grange uses Amazon EMRAmazon EMR

          We use Amazon EMR for all our Hadoop workloads.

          Avatar of Christian Moeller
          Christian Moeller uses Amazon RedshiftAmazon Redshift

          Connected to BI (Pentaho)

          Avatar of Kovid Rathee
          Kovid Rathee uses Amazon RedshiftAmazon Redshift

          OLAP and BI

          How much does Amazon EMR cost?
          How much does Amazon Redshift cost?
          How much does Cloudera Enterprise cost?
          Pricing unavailable
          News about Cloudera Enterprise
          More news