HBase vs RethinkDB vs TokuMX

Get Advice Icon

Need advice about which tool to choose?Ask the StackShare community!

HBase
HBase

209
176
+ 1
12
RethinkDB
RethinkDB

242
239
+ 1
297
TokuMX
TokuMX

6
8
+ 1
3

What is HBase?

Apache HBase is an open-source, distributed, versioned, column-oriented store modeled after Google' Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Apache Hadoop.

What is RethinkDB?

RethinkDB is built to store JSON documents, and scale to multiple machines with very little effort. It has a pleasant query language that supports really useful queries like table joins and group by, and is easy to setup and learn.

What is TokuMX?

TokuMX is a drop-in replacement for MongoDB, and offers 20X performance improvements, 90% reduction in database size, and support for ACID transactions with MVCC. TokuMX has the same binaries, supports the same drivers, data model, and features of MongoDB, because it shares much of its code with MongoDB.
Get Advice Icon

Need advice about which tool to choose?Ask the StackShare community!

Why do developers choose HBase?
Why do developers choose RethinkDB?
Why do developers choose TokuMX?

Sign up to add, upvote and see more prosMake informed product decisions

    Be the first to leave a con
      Be the first to leave a con
        Be the first to leave a con
        What companies use HBase?
        What companies use RethinkDB?
        What companies use TokuMX?

        Sign up to get full access to all the companiesMake informed product decisions

        What tools integrate with HBase?
        What tools integrate with RethinkDB?
        What tools integrate with TokuMX?

        Sign up to get full access to all the tool integrationsMake informed product decisions

        What are some alternatives to HBase, RethinkDB, and TokuMX?
        Cassandra
        Partitioning means that Cassandra can distribute your data across multiple machines in an application-transparent matter. Cassandra will automatically repartition as machines are added and removed from the cluster. Row store means that like relational databases, Cassandra organizes data by rows and columns. The Cassandra Query Language (CQL) is a close relative of SQL.
        MongoDB
        MongoDB stores data in JSON-like documents that can vary in structure, offering a dynamic, flexible schema. MongoDB was also designed for high availability and scalability, with built-in replication and auto-sharding.
        Hadoop
        The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.
        Druid
        Druid is a distributed, column-oriented, real-time analytics data store that is commonly used to power exploratory dashboards in multi-tenant environments. Druid excels as a data warehousing solution for fast aggregate queries on petabyte sized data sets. Druid supports a variety of flexible filters, exact calculations, approximate algorithms, and other useful calculations.
        Apache Hive
        Hive facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. Structure can be projected onto data already in storage.
        See all alternatives
        Decisions about HBase, RethinkDB, and TokuMX
        No stack decisions found
        Interest over time
        Reviews of HBase, RethinkDB, and TokuMX
        No reviews found
        How developers use HBase, RethinkDB, and TokuMX
        Avatar of Pinterest
        Pinterest uses HBaseHBase

        The final output is inserted into HBase to serve the experiment dashboard. We also load the output data to Redshift for ad-hoc analysis. For real-time experiment data processing, we use Storm to tail Kafka and process data in real-time and insert metrics into MySQL, so we could identify group allocation problems and send out real-time alerts and metrics.

        Avatar of Sine Wave Entertainment
        Sine Wave Entertainment uses RethinkDBRethinkDB

        High-speed update-aware storage used in our region server infrastructure; provides a good middle layer for storage of rapidly modified information.

        Avatar of Runbook
        Runbook uses RethinkDBRethinkDB

        Main database, using it in multiple datacenters in an active-active configuration.

        Avatar of Tobe O
        Tobe O uses RethinkDBRethinkDB

        Angel includes support for multiple databases, out-of-the-box.

        Avatar of Mike MacCana
        Mike MacCana uses RethinkDBRethinkDB

        As a boring document oriented database with safe defaults.

        Avatar of Axibase
        Axibase uses HBaseHBase
        • Raw storage engine
        • Replication
        • Fault-tolerance
        Avatar of Domraider
        Domraider uses RethinkDBRethinkDB

        Sharded and replicated storage, NoSQL with joins

        Avatar of Mehdi TAZI
        Mehdi TAZI uses HBaseHBase

        Range scan and HDFS Buffering system

        Avatar of anerudhbalaji
        anerudhbalaji uses HBaseHBase

        Primary datastore

        How much does HBase cost?
        How much does RethinkDB cost?
        How much does TokuMX cost?
        Pricing unavailable
        Pricing unavailable
        Pricing unavailable
        News about HBase
        More news
        News about TokuMX
        More news