Hadoop vs. Riak vs. HBase

Get help choosing one of these Get news updates about these tools


Hadoop

Riak

HBase

Favorites

43

Favorites

16

Favorites

13

Hacker News, Reddit, Stack Overflow Stats

  • -
  • 29
  • 37.1K
  • 1.45K
  • 438
  • 752
  • -
  • -
  • 5.95K

GitHub Stats

Description

What is Hadoop?

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.

What is Riak?

Riak is a distributed database designed to deliver maximum data availability by distributing data across multiple servers. As long as your client can reach one Riak server, it should be able to write data. In most failure scenarios, the data you want to read should be available, although it may not be the most up-to-date version of that data.

What is HBase?

Apache HBase is an open-source, distributed, versioned, column-oriented store modeled after Google' Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Apache Hadoop.

Pros about this tool

Why do you like Hadoop?

Why do you like Riak?

Why do you like HBase?

Cons about this tool

Customers

Integrations

Latest News

Introducing Spark Structured Streaming Support in ES...
Spring for Apache Hadoop 2.5.0 GA released
Elasticsearch for Apache Hadoop 5.5.0


Interest Over Time


Get help choosing one of these