StackShareStackShare
Follow on
StackShare

Discover and share technology stacks from companies around the world.

Product

  • Stacks
  • Tools
  • Companies
  • Feed

Company

  • About
  • Blog
  • Contact

Legal

  • Privacy Policy
  • Terms of Service

© 2025 StackShare. All rights reserved.

API StatusChangelog
Qubole
ByQuboleQubole

Qubole

#156in Databases
Stacks35Discussions2
Followers104
OverviewDiscussions2

What is Qubole?

Qubole is a cloud based service that makes big data easy for analysts and data engineers.

Qubole is a tool in the Databases category of a tech stack.

Key Features

Intuitive GUIOptimized HiveImproved S3 PerformanceAuto ScalingSpot Instance PricingManaged ClustersCloud IntegrationCluster Lifecycle Management

Qubole Pros & Cons

Pros of Qubole

  • ✓Simple UI and autoscaling clusters
  • ✓Feature to use AWS Spot pricing
  • ✓Optimized Spark, Hive, Presto, Hadoop 2, HBase clusters
  • ✓Real-time data insights through Spark Notebook
  • ✓Easy to configure, deploy, and run Hadoop clusters
  • ✓Easy to manage costs
  • ✓Hyper elastic and scalable
  • ✓Backed by Amazon
  • ✓Gracefully Scale up & down with zero human intervention
  • ✓All-in-one platform

Cons of Qubole

No cons listed yet.

Qubole Alternatives & Comparisons

What are some alternatives to Qubole?

Google BigQuery

Google BigQuery

Run super-fast, SQL-like queries against terabytes of data in seconds, using the processing power of Google's infrastructure. Load data with ease. Bulk load your data using Google Cloud Storage or stream it in. Easy access. Access BigQuery by using a browser tool, a command-line tool, or by making calls to the BigQuery REST API with client libraries such as Java, PHP or Python.

Amazon Redshift

Amazon Redshift

It is optimized for data sets ranging from a few hundred gigabytes to a petabyte or more and costs less than $1,000 per terabyte per year, a tenth the cost of most traditional data warehousing solutions.

Snowflake

Snowflake

Snowflake eliminates the administration and management demands of traditional data warehouses and big data platforms. Snowflake is a true data warehouse as a service running on Amazon Web Services (AWS)—no infrastructure to manage and no knobs to turn.

Amazon EMR

Amazon EMR

It is used in a variety of applications, including log analysis, data warehousing, machine learning, financial analysis, scientific simulation, and bioinformatics.

Stitch

Stitch

Stitch is a simple, powerful ETL service built for software developers. Stitch evolved out of RJMetrics, a widely used business intelligence platform. When RJMetrics was acquired by Magento in 2016, Stitch was launched as its own company.

Cloudera Enterprise

Cloudera Enterprise

Cloudera Enterprise includes CDH, the world’s most popular open source Hadoop-based platform, as well as advanced system management and data management tools plus dedicated support and community advocacy from our world-class team of Hadoop developers and experts.

Qubole Integrations

Google Compute Engine, Google Compute Engine, Microsoft Azure, Redash, PeriscopeData are some of the popular tools that integrate with Qubole. Here's a list of all 5 tools that integrate with Qubole.

Google Compute Engine
Google Compute Engine
Google Compute Engine
Google Compute Engine
Microsoft Azure
Microsoft Azure
Redash
Redash
PeriscopeData
PeriscopeData

Qubole Discussions

Discover why developers choose Qubole. Read real-world technical decisions and stack choices from the StackShare community.Showing 2 of 3 discussions.

StackShare Editors
StackShare Editors

Jul 11, 2014

Big Data at with Hadoop, Hive, and Quoble

Needs adviceonPuppet LabsPuppet LabsHadoopHadoopQuboleQubole

By mid-2014, around the time of the Series F, Pinterest users had already created more than 30 billion Pins, and the company was logging around 20 terabytes of new data daily, with around 10 petabytes of data in S3. To drive personalization for its users, and to empower engineers to build big data applications quickly, the data team built a self-serve Hadoop platform.

To start, they decoupled compute from storage, which meant teams would have to worry less about loading or synchronizing data, allowing existing or future clusters to make use of the data across a single shared file system.

A centralized Hive metastore act as the source of truth. They chose Hive for most of their Hadoop jobs “primarily because the SQL interface is simple and familiar to people across the industry.”

Dependency management takes place across three layers: *** Baked AMIs**, which are large slow-loading dependencies pre-loaded on images; Automated Configurations (Masterless Puppets), which allows Puppet clients to “pull their configuration from S3 and set up a service that’s responsible for keeping S3 configurations in sync with the Puppet master;” and Runtime Staging on S3, which creates a working directory at runtime for each developer that pulls down its dependencies directly from S3.

Finally, they migrated their Hadoop jobs to Qubole, which “supported AWS/S3 and was relatively easy to get started on.”

0 views0
Comments
John Egan
John Egan

Feb 13, 2014

Needs adviceonQuboleQubole

We ultimately migrated our Hadoop jobs to Qubole, a rising player in the Hadoop as a Service space. Given that EMR had become unstable at our scale, we had to quickly move to a provider that played well with AWS (specifically, spot instances) and S3. Qubole supported AWS/S3 and was relatively easy to get started on. After vetting Qubole and comparing its performance against alternatives (including managed clusters), we decided to go with Qubole Qubole

0 views0
Comments
View all 3 discussions

Try It

Visit Website

Adoption

On StackShare

Companies
5
PMKSM
Developers
32
RLHBSD+26