HBase vs Riak

Overview

Riak

Stacks103

Followers137

Votes44

GitHub Stars4.0K

Forks535

HBase

Stacks512

Followers498

Votes15

GitHub Stars5.5K

Forks3.4K

HBase vs Riak: What are the differences?

HBase: The Hadoop database, a distributed, scalable, big data store. Apache HBase is an open-source, distributed, versioned, column-oriented store modeled after Google' Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Apache Hadoop; Riak: A distributed, decentralized data storage system. Riak is a distributed database designed to deliver maximum data availability by distributing data across multiple servers. As long as your client can reach one Riak server, it should be able to write data. In most failure scenarios, the data you want to read should be available, although it may not be the most up-to-date version of that data.

HBase and Riak belong to "Databases" category of the tech stack.

"Performance" is the primary reason why developers consider HBase over the competitors, whereas "High Performance " was stated as the key factor in picking Riak.

HBase and Riak are both open source tools. Riak with 3.24K GitHub stars and 530 forks on GitHub appears to be more popular than HBase with 2.91K GitHub stars and 2.01K GitHub forks.

According to the StackShare community, HBase has a broader approval, being mentioned in 54 company stacks & 18 developers stacks; compared to Riak, which is listed in 15 company stacks and 10 developer stacks.

Share your Stack

Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.

View Docs

CLI (Node.js)

Manual

Advice on Riak, HBase

Feb 9, 2022

Needs adviceon

Milvus

HBase

RocksDB

I am researching different querying solutions to handle ~1 trillion records of data (in the realm of a petabyte). The data is mostly textual. I have identified a few options: Milvus, HBase, RocksDB, and Elasticsearch. I was wondering if there is a good way to compare the performance of these options (or if anyone has already done something like this). I want to be able to compare the speed of ingesting and querying textual data from these tools. Does anyone have information on this or know where I can find some? Thanks in advance!

174k views174k

Comments

Detailed Comparison

Riak	HBase
Riak is a distributed database designed to deliver maximum data availability by distributing data across multiple servers. As long as your client can reach one Riak server, it should be able to write data. In most failure scenarios, the data you want to read should be available, although it may not be the most up-to-date version of that data.	Apache HBase is an open-source, distributed, versioned, column-oriented store modeled after Google' Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Apache Hadoop.
Statistics
GitHub Stars 4.0K	GitHub Stars 5.5K
GitHub Forks 535	GitHub Forks 3.4K
Stacks 103	Stacks 512
Followers 137	Followers 498
Votes 44	Votes 15
Pros & Cons
Pros 14 High Performance 11 High Availability 9 Easy Scalability 5 Flexible 1 Reliable	Pros 9 Performance 5 OLTP 1 Fast Point Queries

What are some alternatives to Riak, HBase?

MongoDB

MongoDB stores data in JSON-like documents that can vary in structure, offering a dynamic, flexible schema. MongoDB was also designed for high availability and scalability, with built-in replication and auto-sharding.

MySQL

The MySQL software delivers a very fast, multi-threaded, multi-user, and robust SQL (Structured Query Language) database server. MySQL Server is intended for mission-critical, heavy-load production systems as well as for embedding into mass-deployed software.

PostgreSQL

PostgreSQL is an advanced object-relational database management system that supports an extended subset of the SQL standard, including transactions, foreign keys, subqueries, triggers, user-defined types and functions.

Microsoft SQL Server

Microsoft® SQL Server is a database management and analysis system for e-commerce, line-of-business, and data warehousing solutions.

SQLite

SQLite is an embedded SQL database engine. Unlike most other SQL databases, SQLite does not have a separate server process. SQLite reads and writes directly to ordinary disk files. A complete SQL database with multiple tables, indices, triggers, and views, is contained in a single disk file.

Cassandra

Partitioning means that Cassandra can distribute your data across multiple machines in an application-transparent matter. Cassandra will automatically repartition as machines are added and removed from the cluster. Row store means that like relational databases, Cassandra organizes data by rows and columns. The Cassandra Query Language (CQL) is a close relative of SQL.

Memcached

Memcached is an in-memory key-value store for small chunks of arbitrary data (strings, objects) from results of database calls, API calls, or page rendering.

MariaDB

Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stable, and sanely licensed open SQL server in the industry. MariaDB is designed as a drop-in replacement of MySQL(R) with more features, new storage engines, fewer bugs, and better performance.

RethinkDB

RethinkDB is built to store JSON documents, and scale to multiple machines with very little effort. It has a pleasant query language that supports really useful queries like table joins and group by, and is easy to setup and learn.

ArangoDB

A distributed free and open-source database with a flexible data model for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaScript extensions.

Related Comparisons

Bootstrap vs Materialize

Django vs Laravel vs Node.js

Bootstrap vs Foundation vs Material UI

Node.js vs Spring-Boot

Flyway vs Liquibase

HBase vs Riak: What are the differences?

HBase and Riak belong to "Databases" category of the tech stack.

"Performance" is the primary reason why developers consider HBase over the competitors, whereas "High Performance " was stated as the key factor in picking Riak.

HBase and Riak are both open source tools. Riak with 3.24K GitHub stars and 530 forks on GitHub appears to be more popular than HBase with 2.91K GitHub stars and 2.01K GitHub forks.