Apache Spark vs. Amazon Athena vs. AresDB

  • -
  • 564
  • 0
  • -
  • -
  • 767
  • -
  • 567
  • 0
No public GitHub repository stats available

What is Apache Spark?

Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters through YARN or Spark's standalone mode, and it can process data in HDFS, HBase, Cassandra, Hive, and any Hadoop InputFormat. It is designed to perform both batch processing (similar to MapReduce) and new workloads like streaming, interactive queries, and machine learning.

What is Amazon Athena?

Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.

What is AresDB?

AresDB is a GPU-powered real-time analytics storage and query engine. It features low query latency, high data freshness and highly efficient in-memory and on disk storage management.

Want advice about which of these to choose?Ask the StackShare community!

Why do developers choose Apache Spark?
Why do you like Apache Spark?

Why do developers choose Amazon Athena?
Why do you like Amazon Athena?

Why do developers choose AresDB?
Why do you like AresDB?

What are the cons of using Apache Spark?
No Cons submitted yet for Apache Spark
Downsides of Apache Spark?

What are the cons of using Amazon Athena?
No Cons submitted yet for Amazon Athena
Downsides of Amazon Athena?

What are the cons of using AresDB?
No Cons submitted yet for AresDB
Downsides of AresDB?

What companies use Apache Spark?
327 companies on StackShare use Apache Spark
What companies use Amazon Athena?
47 companies on StackShare use Amazon Athena
What companies use AresDB?
1 companies on StackShare use AresDB
What tools integrate with Apache Spark?
12 tools on StackShare integrate with Apache Spark
What tools integrate with Amazon Athena?
7 tools on StackShare integrate with Amazon Athena
No integrations listed yet

What are some alternatives to Apache Spark, Amazon Athena, and AresDB?

  • Apache Flink - Fast and reliable large-scale data processing engine
  • Druid - Fast column-oriented distributed data store
  • Presto - Distributed SQL Query Engine for Big Data
  • Impala - Real-time Query for Hadoop

See all alternatives to Apache Spark

Interest Over Time