Need advice about which tool to choose?Ask the StackShare community!

Google BigQuery

1.7K
1.5K
+ 1
152
Treasure Data

28
44
+ 1
5
Add tool

Google BigQuery vs Treasure Data: What are the differences?

Introduction

In this article, we will compare and highlight the key differences between Google BigQuery and Treasure Data. Both are powerful cloud-based data warehousing solutions, but they have distinct features and functionalities that make them unique.

  1. Scalability and Performance: Google BigQuery is known for its unmatched scalability and performance. It can handle massive datasets and process queries at lightning-fast speeds. It uses distributed architecture and parallel execution to deliver efficient query results. On the other hand, Treasure Data also offers scalability, but it may not provide the same level of performance as BigQuery when dealing with extremely large datasets.

  2. Data Integration and Flexibility: Google BigQuery seamlessly integrates with various Google Cloud Platform services, including data ingestion tools like Cloud Dataflow and Data Fusion. It also supports direct integration with popular data sources like Google Analytics, Google Ads, and more. Treasure Data, on the other hand, provides a flexible data pipeline infrastructure that can connect to a wide range of databases, data sources, and third-party tools, enabling easy data integration.

  3. SQL Dialect: Google BigQuery uses a modified version of SQL called "BigQuery SQL," which offers advanced analytical features like window functions, nested queries, and user-defined functions. It also provides support for standard SQL syntax. In contrast, Treasure Data primarily uses Presto, a SQL engine designed for distributed querying, which offers standard SQL functionalities.

  4. Pricing Model: Google BigQuery has a pricing model based on the amount of data processed in queries and storage usage. It offers different pricing tiers and options to suit different user requirements. Treasure Data, on the other hand, follows a different pricing model based on data volume ingested and retained, providing more flexibility when it comes to cost management.

  5. Managed vs. Self-Managed: Google BigQuery is a fully managed service, which means Google takes care of infrastructure maintenance, security, and updates. Users can focus on querying and analyzing data without worrying about underlying infrastructure. On the other hand, Treasure Data provides a self-managed data warehousing solution, giving users more control over their infrastructure and allowing customization according to their specific requirements.

  6. Ecosystem and Community: Google BigQuery has a robust ecosystem with strong community support. It provides comprehensive documentation, tutorials, and resources to help users get started quickly. It also has a wide range of partners offering integrations and extensions. Treasure Data, while also having a supportive community, may have a smaller ecosystem compared to BigQuery, which may limit the availability of ready-made connectors or extensions for specific use cases.

In summary, Google BigQuery offers exceptional scalability, performance, and integration capabilities with the Google Cloud Platform ecosystem. It has advanced analytical features and a fully managed infrastructure. On the other hand, Treasure Data provides flexibility in data integration, a self-managed infrastructure, and a pricing model based on data volume. The choice between the two depends on specific requirements and preferences.

Decisions about Google BigQuery and Treasure Data
Julien Lafont

Cloud Data-warehouse is the centerpiece of modern Data platform. The choice of the most suitable solution is therefore fundamental.

Our benchmark was conducted over BigQuery and Snowflake. These solutions seem to match our goals but they have very different approaches.

BigQuery is notably the only 100% serverless cloud data-warehouse, which requires absolutely NO maintenance: no re-clustering, no compression, no index optimization, no storage management, no performance management. Snowflake requires to set up (paid) reclustering processes, to manage the performance allocated to each profile, etc. We can also mention Redshift, which we have eliminated because this technology requires even more ops operation.

BigQuery can therefore be set up with almost zero cost of human resources. Its on-demand pricing is particularly adapted to small workloads. 0 cost when the solution is not used, only pay for the query you're running. But quickly the use of slots (with monthly or per-minute commitment) will drastically reduce the cost of use. We've reduced by 10 the cost of our nightly batches by using flex slots.

Finally, a major advantage of BigQuery is its almost perfect integration with Google Cloud Platform services: Cloud functions, Dataflow, Data Studio, etc.

BigQuery is still evolving very quickly. The next milestone, BigQuery Omni, will allow to run queries over data stored in an external Cloud platform (Amazon S3 for example). It will be a major breakthrough in the history of cloud data-warehouses. Omni will compensate a weakness of BigQuery: transferring data in near real time from S3 to BQ is not easy today. It was even simpler to implement via Snowflake's Snowpipe solution.

We also plan to use the Machine Learning features built into BigQuery to accelerate our deployment of Data-Science-based projects. An opportunity only offered by the BigQuery solution

See more
Manage your open source components, licenses, and vulnerabilities
Learn More
Pros of Google BigQuery
Pros of Treasure Data
  • 28
    High Performance
  • 25
    Easy to use
  • 22
    Fully managed service
  • 19
    Cheap Pricing
  • 16
    Process hundreds of GB in seconds
  • 12
    Big Data
  • 11
    Full table scans in seconds, no indexes needed
  • 8
    Always on, no per-hour costs
  • 6
    Good combination with fluentd
  • 4
    Machine learning
  • 1
    Easy to manage
  • 0
    Easy to learn
  • 2
    Scaleability, less overhead
  • 2
    Makes it easy to ingest all data from different inputs
  • 1
    Responsive to our business requirements, great support

Sign up to add or upvote prosMake informed product decisions

Cons of Google BigQuery
Cons of Treasure Data
  • 1
    You can't unit test changes in BQ data
  • 0
    Sdas
    Be the first to leave a con

    Sign up to add or upvote consMake informed product decisions

    What is Google BigQuery?

    Run super-fast, SQL-like queries against terabytes of data in seconds, using the processing power of Google's infrastructure. Load data with ease. Bulk load your data using Google Cloud Storage or stream it in. Easy access. Access BigQuery by using a browser tool, a command-line tool, or by making calls to the BigQuery REST API with client libraries such as Java, PHP or Python.

    What is Treasure Data?

    Treasure Data's Big Data as-a-Service cloud platform enables data-driven businesses to focus their precious development resources on their applications, not on mundane, time-consuming integration and operational tasks. The Treasure Data Cloud Data Warehouse service offers an affordable, quick-to-implement and easy-to-use big data option that does not require specialized IT resources, making big data analytics available to the mass market.

    Need advice about which tool to choose?Ask the StackShare community!

    Jobs that mention Google BigQuery and Treasure Data as a desired skillset
    What companies use Google BigQuery?
    What companies use Treasure Data?
    Manage your open source components, licenses, and vulnerabilities
    Learn More

    Sign up to get full access to all the companiesMake informed product decisions

    What tools integrate with Google BigQuery?
    What tools integrate with Treasure Data?

    Sign up to get full access to all the tool integrationsMake informed product decisions

    Blog Posts

    Aug 28 2019 at 3:10AM

    Segment

    PythonJavaAmazon S3+16
    7
    2623
    Jul 2 2019 at 9:34PM

    Segment

    Google AnalyticsAmazon S3New Relic+25
    10
    6880
    GitHubPythonNode.js+47
    55
    72769
    What are some alternatives to Google BigQuery and Treasure Data?
    Google Cloud Bigtable
    Google Cloud Bigtable offers you a fast, fully managed, massively scalable NoSQL database service that's ideal for web, mobile, and Internet of Things applications requiring terabytes to petabytes of data. Unlike comparable market offerings, Cloud Bigtable doesn't require you to sacrifice speed, scale, or cost efficiency when your applications grow. Cloud Bigtable has been battle-tested at Google for more than 10 years—it's the database driving major applications such as Google Analytics and Gmail.
    Amazon Redshift
    It is optimized for data sets ranging from a few hundred gigabytes to a petabyte or more and costs less than $1,000 per terabyte per year, a tenth the cost of most traditional data warehousing solutions.
    Hadoop
    The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.
    Snowflake
    Snowflake eliminates the administration and management demands of traditional data warehouses and big data platforms. Snowflake is a true data warehouse as a service running on Amazon Web Services (AWS)—no infrastructure to manage and no knobs to turn.
    Google Analytics
    Google Analytics lets you measure your advertising ROI as well as track your Flash, video, and social networking sites and applications.
    See all alternatives