Get Advice Icon

Need advice about which tool to choose?Ask the StackShare community!

Amazon DynamoDB

3.7K
3.2K
+ 1
195
Google BigQuery

1.7K
1.5K
+ 1
152
Add tool

Amazon DynamoDB vs Google BigQuery: What are the differences?

Introduction

In this article, we will compare Amazon DynamoDB and Google BigQuery, two popular data storage and analysis solutions, to understand their key differences and use cases.

  1. Data Structure and Querying Language: Amazon DynamoDB is a NoSQL database that supports key-value and document data models, making it suitable for unstructured and semi-structured data. On the other hand, Google BigQuery is a fully-managed, serverless data warehouse that excels in processing structured data using SQL-like queries.

  2. Scalability and Performance: DynamoDB offers automatic scaling with seamless data distribution, allowing it to handle massive workloads and provide high throughput with low latency. BigQuery also scales automatically and can enable petabyte-scale querying, but its performance may vary and require careful partitioning and optimization for optimal results.

  3. Storage and Cost Model: DynamoDB charges for provisioned capacity and additional features like backups and Global Tables. BigQuery follows a pay-per-query pricing model based on the amount of data processed, with separate charges for storage and queries. DynamoDB's pricing can be advantageous for steady or predictable workloads, while BigQuery's model is more suitable for ad-hoc and analytical workloads.

  4. Data Integration and Ecosystem: DynamoDB integrates tightly with other Amazon Web Services (AWS) services, allowing seamless integration and data ETL workflows. BigQuery has connectors with various Google Cloud Platform tools and external data sources, simplifying data ingestion and analysis from different systems.

  5. Security and Access Control: DynamoDB provides fine-grained access control with AWS Identity and Access Management (IAM) policies, allowing granular control over table and item-level permissions. BigQuery offers similar access control mechanisms but with Google Cloud IAM, enabling secure data sharing and collaboration within organizations.

  6. Data Analytics Capabilities: While DynamoDB provides basic querying capabilities, BigQuery offers more advanced analytics features like window functions, nested and repeated data handling, and machine learning integrations. BigQuery is optimized for complex analytical queries, making it suitable for data exploration, ad-hoc analysis, and business intelligence.

In Summary, Amazon DynamoDB is a NoSQL database with flexible data models, excellent scalability, and deep integration with AWS services. On the other hand, Google BigQuery is a versatile data warehouse that excels in structured data processing, ad-hoc querying, and advanced analytics capabilities.

Advice on Amazon DynamoDB and Google BigQuery

We are building a social media app, where users will post images, like their post, and make friends based on their interest. We are currently using Cloud Firestore and Firebase Realtime Database. We are looking for another database like Amazon DynamoDB; how much this decision can be efficient in terms of pricing and overhead?

See more
Replies (1)
William Frank
Data Science and Engineering at GeistM · | 2 upvotes · 114.8K views
Recommends

Hi, Akash,

I wouldn't make this decision without lots more information. Cloud Firestore has a much richer metamodel (document-oriented) than Dynamo (key-value), and Dynamo seems to be particularly restrictive. That is why it is so fast. There are many needs in most applications to get lightning access to the members of a set, one set at a time. Dynamo DB is a great choice. But, social media applications generally need to be able to make long traverses across a graph. While you can make almost any metamodel act like another one, with your own custom layers on top of it, or just by writing a lot more code, it's a long way around to do that with simple key-value sets. It's hard enough to traverse across networks of collections in a document-oriented database. So, if you are moving, I think a graph-oriented database like Amazon Neptune, or, if you might want built-in reasoning, Allegro or Ontotext, would take the least programming, which is where the most cost and bugs can be avoided. Also, managed systems are also less costly in terms of people's time and system errors. It's easier to measure the costs of managed systems, so they are often seen as more costly.

See more
Decisions about Amazon DynamoDB and Google BigQuery
Julien Lafont

Cloud Data-warehouse is the centerpiece of modern Data platform. The choice of the most suitable solution is therefore fundamental.

Our benchmark was conducted over BigQuery and Snowflake. These solutions seem to match our goals but they have very different approaches.

BigQuery is notably the only 100% serverless cloud data-warehouse, which requires absolutely NO maintenance: no re-clustering, no compression, no index optimization, no storage management, no performance management. Snowflake requires to set up (paid) reclustering processes, to manage the performance allocated to each profile, etc. We can also mention Redshift, which we have eliminated because this technology requires even more ops operation.

BigQuery can therefore be set up with almost zero cost of human resources. Its on-demand pricing is particularly adapted to small workloads. 0 cost when the solution is not used, only pay for the query you're running. But quickly the use of slots (with monthly or per-minute commitment) will drastically reduce the cost of use. We've reduced by 10 the cost of our nightly batches by using flex slots.

Finally, a major advantage of BigQuery is its almost perfect integration with Google Cloud Platform services: Cloud functions, Dataflow, Data Studio, etc.

BigQuery is still evolving very quickly. The next milestone, BigQuery Omni, will allow to run queries over data stored in an external Cloud platform (Amazon S3 for example). It will be a major breakthrough in the history of cloud data-warehouses. Omni will compensate a weakness of BigQuery: transferring data in near real time from S3 to BQ is not easy today. It was even simpler to implement via Snowflake's Snowpipe solution.

We also plan to use the Machine Learning features built into BigQuery to accelerate our deployment of Data-Science-based projects. An opportunity only offered by the BigQuery solution

See more
Manage your open source components, licenses, and vulnerabilities
Learn More
Pros of Amazon DynamoDB
Pros of Google BigQuery
  • 62
    Predictable performance and cost
  • 56
    Scalable
  • 35
    Native JSON Support
  • 21
    AWS Free Tier
  • 7
    Fast
  • 3
    No sql
  • 3
    To store data
  • 2
    Serverless
  • 2
    No Stored procedures is GOOD
  • 1
    ORM with DynamoDBMapper
  • 1
    Elastic Scalability using on-demand mode
  • 1
    Elastic Scalability using autoscaling
  • 1
    DynamoDB Stream
  • 28
    High Performance
  • 25
    Easy to use
  • 22
    Fully managed service
  • 19
    Cheap Pricing
  • 16
    Process hundreds of GB in seconds
  • 12
    Big Data
  • 11
    Full table scans in seconds, no indexes needed
  • 8
    Always on, no per-hour costs
  • 6
    Good combination with fluentd
  • 4
    Machine learning
  • 1
    Easy to manage
  • 0
    Easy to learn

Sign up to add or upvote prosMake informed product decisions

Cons of Amazon DynamoDB
Cons of Google BigQuery
  • 4
    Only sequential access for paginate data
  • 1
    Scaling
  • 1
    Document Limit Size
  • 1
    You can't unit test changes in BQ data
  • 0
    Sdas

Sign up to add or upvote consMake informed product decisions

272
14.1K
4.3K
26K

What is Amazon DynamoDB?

With it , you can offload the administrative burden of operating and scaling a highly available distributed database cluster, while paying a low price for only what you use.

What is Google BigQuery?

Run super-fast, SQL-like queries against terabytes of data in seconds, using the processing power of Google's infrastructure. Load data with ease. Bulk load your data using Google Cloud Storage or stream it in. Easy access. Access BigQuery by using a browser tool, a command-line tool, or by making calls to the BigQuery REST API with client libraries such as Java, PHP or Python.

Need advice about which tool to choose?Ask the StackShare community!

What companies use Amazon DynamoDB?
What companies use Google BigQuery?
Manage your open source components, licenses, and vulnerabilities
Learn More

Sign up to get full access to all the companiesMake informed product decisions

What tools integrate with Amazon DynamoDB?
What tools integrate with Google BigQuery?

Sign up to get full access to all the tool integrationsMake informed product decisions

Blog Posts

Aug 28 2019 at 3:10AM

Segment

PythonJavaAmazon S3+16
7
2680
Jul 2 2019 at 9:34PM

Segment

Google AnalyticsAmazon S3New Relic+25
10
6941
GitHubPythonNode.js+47
55
73004
GitGitHubSlack+30
27
18885
What are some alternatives to Amazon DynamoDB and Google BigQuery?
Google Cloud Datastore
Use a managed, NoSQL, schemaless database for storing non-relational data. Cloud Datastore automatically scales as you need it and supports transactions as well as robust, SQL-like queries.
MongoDB
MongoDB stores data in JSON-like documents that can vary in structure, offering a dynamic, flexible schema. MongoDB was also designed for high availability and scalability, with built-in replication and auto-sharding.
Amazon SimpleDB
Developers simply store and query data items via web services requests and Amazon SimpleDB does the rest. Behind the scenes, Amazon SimpleDB creates and manages multiple geographically distributed replicas of your data automatically to enable high availability and data durability. Amazon SimpleDB provides a simple web services interface to create and store multiple data sets, query your data easily, and return the results. Your data is automatically indexed, making it easy to quickly find the information that you need. There is no need to pre-define a schema or change a schema if new data is added later. And scale-out is as simple as creating new domains, rather than building out new servers.
MySQL
The MySQL software delivers a very fast, multi-threaded, multi-user, and robust SQL (Structured Query Language) database server. MySQL Server is intended for mission-critical, heavy-load production systems as well as for embedding into mass-deployed software.
Amazon S3
Amazon Simple Storage Service provides a fully redundant data storage infrastructure for storing and retrieving any amount of data, at any time, from anywhere on the web
See all alternatives