Hadoop vs. HBase vs. RocksDB

Get help choosing one of these Get news updates about these tools


Favorites

45

Favorites

13

Favorites

8

Hacker News, Reddit, Stack Overflow Stats

  • -
  • 29
  • 39.1K
  • -
  • -
  • 6.23K
  • 326
  • 30
  • 138

GitHub Stats

Description

What is Hadoop?

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.

What is HBase?

Apache HBase is an open-source, distributed, versioned, column-oriented store modeled after Google' Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Apache Hadoop.

What is RocksDB?

RocksDB is an embeddable persistent key-value store for fast storage. RocksDB can also be the foundation for a client-server database but our current focus is on embedded workloads. RocksDB builds on LevelDB to be scalable to run on servers with many CPU cores, to efficiently use fast storage, to support IO-bound, in-memory and write-once workloads, and to be flexible to allow for innovation.

Pros

Why do developers choose Hadoop?
Why do you like Hadoop?

Why do developers choose HBase?
Why do you like HBase?

Why do developers choose RocksDB?
Why do you like RocksDB?

Companies

What companies use Hadoop?
210 companies on StackShare use Hadoop
What companies use HBase?
47 companies on StackShare use HBase
What companies use RocksDB?
4 companies on StackShare use RocksDB

Integrations

What tools integrate with Hadoop?
14 tools on StackShare integrate with Hadoop
What tools integrate with HBase?
3 tools on StackShare integrate with HBase
No integrations listed yet

What are some alternatives to Hadoop, HBase, and RocksDB?

  • MySQL - The world's most popular open source database
  • PostgreSQL - A powerful, open source object-relational database system
  • MongoDB - The database for giant ideas
  • Microsoft SQL Server - A relational database management system developed by Microsoft

See all alternatives to Hadoop

Latest News

Elasticsearch for Apache Hadoop 6.2.0 Released
How to Get Hadoop Data into a Python Model
Elasticsearch for Apache Hadoop 6.0.0 GA is Released
All things RocksDB at Percona Live Europe 2017


Interest Over Time


Get help choosing one of these