Hadoop vs. HBase vs. TokuMX

Get help choosing one of these Get news updates about these tools


Hadoop

HBase

TokuMX

Favorites

45

Favorites

13

Favorites

3

Hacker News, Reddit, Stack Overflow Stats

  • -
  • 29
  • 38.7K
  • -
  • -
  • 6.19K
  • 538
  • 134
  • 0

GitHub Stats

Description

What is Hadoop?

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.

What is HBase?

Apache HBase is an open-source, distributed, versioned, column-oriented store modeled after Google' Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Apache Hadoop.

What is TokuMX?

TokuMX is a drop-in replacement for MongoDB, and offers 20X performance improvements, 90% reduction in database size, and support for ACID transactions with MVCC. TokuMX has the same binaries, supports the same drivers, data model, and features of MongoDB, because it shares much of its code with MongoDB.

Pros about this tool

Pros
Why do you like Hadoop?

Pros
Why do you like HBase?

Pros
Why do you like TokuMX?

Pricing

Companies

207 Companies Using Hadoop
48 Companies Using HBase
2 Companies Using TokuMX

Integrations

Hadoop Integrations
HBase Integrations
TokuMX Integrations

Latest News

Elasticsearch for Apache Hadoop 6.2.0 Released
How to Get Hadoop Data into a Python Model
Elasticsearch for Apache Hadoop 6.0.0 GA is Released


Interest Over Time


Get help choosing one of these