Amazon EMR vs Cloudera Enterprise vs Google BigQuery

Amazon EMR
Amazon EMR

Cloudera Enterprise
Cloudera Enterprise

Google BigQuery
Google BigQuery

- No public GitHub repository available -
- No public GitHub repository available -
- No public GitHub repository available -

What is Amazon EMR?

Amazon EMR is used in a variety of applications, including log analysis, web indexing, data warehousing, machine learning, financial analysis, scientific simulation, and bioinformatics. Customers launch millions of Amazon EMR clusters every year.

What is Cloudera Enterprise?

Cloudera Enterprise includes CDH, the world’s most popular open source Hadoop-based platform, as well as advanced system management and data management tools plus dedicated support and community advocacy from our world-class team of Hadoop developers and experts.

What is Google BigQuery?

Run super-fast, SQL-like queries against terabytes of data in seconds, using the processing power of Google's infrastructure. Load data with ease. Bulk load your data using Google Cloud Storage or stream it in. Easy access. Access BigQuery by using a browser tool, a command-line tool, or by making calls to the BigQuery REST API with client libraries such as Java, PHP or Python.

Want advice about which of these to choose?Ask the StackShare community!

Why do developers choose Amazon EMR?
Why do developers choose Cloudera Enterprise?
Why do developers choose Google BigQuery?
    Be the first to leave a pro
    What are the cons of using Amazon EMR?
    What are the cons of using Cloudera Enterprise?
    What are the cons of using Google BigQuery?
      Be the first to leave a con
        Be the first to leave a con
        What companies use Amazon EMR?
        What companies use Cloudera Enterprise?
        What companies use Google BigQuery?
        What are some alternatives to Amazon EMR, Cloudera Enterprise, and Google BigQuery?
        Amazon EC2
        Amazon Elastic Compute Cloud (Amazon EC2) is a web service that provides resizable compute capacity in the cloud. It is designed to make web-scale computing easier for developers.
        Amazon DynamoDB
        All data items are stored on Solid State Drives (SSDs), and are replicated across 3 Availability Zones for high availability and durability. With DynamoDB, you can offload the administrative burden of operating and scaling a highly available distributed database cluster, while paying a low price for only what you use.
        The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.
        Amazon Redshift
        Redshift makes it simple and cost-effective to efficiently analyze all your data using your existing business intelligence tools. It is optimized for datasets ranging from a few hundred gigabytes to a petabyte or more and costs less than $1,000 per terabyte per year, a tenth the cost of most traditional data warehousing solutions.
        Snowflake eliminates the administration and management demands of traditional data warehouses and big data platforms. Snowflake is a true data warehouse as a service running on Amazon Web Services (AWS)—no infrastructure to manage and no knobs to turn.
        See all alternatives
        What tools integrate with Amazon EMR?
        What tools integrate with Cloudera Enterprise?
        What tools integrate with Google BigQuery?
          No integrations found
            No integrations found
            Decisions about Amazon EMR, Cloudera Enterprise, and Google BigQuery
            No stack decisions found
            Interest over time
            Reviews of Amazon EMR, Cloudera Enterprise, and Google BigQuery
            No reviews found
            How developers use Amazon EMR, Cloudera Enterprise, and Google BigQuery
            Avatar of ShareThis
            ShareThis uses Google BigQueryGoogle BigQuery

            BigQuery allows our team to pull reports quickly using a SQL-like queries against our large store of data about social sharing. We use the information throughout the company, to do everything from making internal product decisions based on usage patterns to sharing certain kinds of custom reports with our publishers.

            Avatar of Lyndon Wong
            Lyndon Wong uses Google BigQueryGoogle BigQuery

            Aggregation of user events and traits across a marketing website, SaaS web application, user account provisioning backend and Salesforce CRM. Enables full-funnel analysis of campaign ROI, customer acquisition, engagement and retention at both the user and target account level.

            Avatar of Andrew La Grange
            Andrew La Grange uses Amazon EMRAmazon EMR

            We use Amazon EMR for all our Hadoop workloads.

            How much does Amazon EMR cost?
            How much does Cloudera Enterprise cost?
            How much does Google BigQuery cost?
            Pricing unavailable
            News about Cloudera Enterprise
            More news
            News about Google BigQuery
            More news