HBase vs Oracle

Overview

Oracle

Stacks2.6K

Followers1.8K

Votes113

HBase

Stacks514

Followers498

Votes15

GitHub Stars5.5K

Forks3.4K

HBase vs Oracle: What are the differences?

Introduction

HBase and Oracle are both popular database systems used for storing and managing large amounts of data. However, there are several key differences between the two.

Data Model: HBase is a NoSQL database, while Oracle is a relational database. This means that HBase stores data in a distributed and scalable manner, organized in columns, column families, and rows. On the other hand, Oracle stores data in tables and enforces a predefined schema, allowing for complex relationships between different tables.
Scalability: HBase is designed to scale horizontally, meaning that it can easily handle large amounts of data by adding more servers to the cluster. Oracle, on the other hand, can scale vertically by upgrading the hardware of a single server. This difference in scalability makes HBase a better choice for big data applications that require massive amounts of storage and processing power.
Consistency: HBase provides eventual consistency, which means that reads may not always return the most up-to-date data. This is because HBase is designed for high availability and fault tolerance, allowing for concurrent updates from multiple clients. In contrast, Oracle provides strong consistency, ensuring that every read operation returns the most recent data.
Transaction Support: Oracle is known for its strong transaction support, allowing for ACID (Atomicity, Consistency, Isolation, Durability) properties. It provides features like rollback, commit, and concurrency control, ensuring data integrity. HBase, on the other hand, does not natively support complex transactions, and only offers atomicity on a per-row basis.
Query Language: Oracle uses SQL (Structured Query Language) for querying and manipulating data, which is a widely adopted standard language for relational databases. HBase, on the other hand, provides a Java API for accessing and manipulating data. While SQL allows for complex queries and joins, the HBase API requires developers to write code to perform similar operations.
Cost: Oracle is a commercial database that requires licensing fees, while HBase is an open-source project that is free to use. This cost difference can be a significant factor for organizations with budget constraints, especially when dealing with large-scale data storage and processing requirements.

In Summary, HBase and Oracle differ in their data models, scalability, consistency, transaction support, query language, and cost. The choice between the two depends on the specific requirements of the application, such as the size of the dataset, the need for strong consistency, the complexity of transactions, and the available budget.

Share your Stack

Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.

View Docs

CLI (Node.js)

Manual

Advice on Oracle, HBase

Feb 9, 2022

Needs adviceon

Milvus

HBase

RocksDB

I am researching different querying solutions to handle ~1 trillion records of data (in the realm of a petabyte). The data is mostly textual. I have identified a few options: Milvus, HBase, RocksDB, and Elasticsearch. I was wondering if there is a good way to compare the performance of these options (or if anyone has already done something like this). I want to be able to compare the speed of ingesting and querying textual data from these tools. Does anyone have information on this or know where I can find some? Thanks in advance!

174k views174k

Comments

Daniel

Data Engineer at Dimensigon

Jul 18, 2020

Decided

We have chosen Tibero over Oracle because we want to offer a PL/SQL-as-a-Service that the users can deploy in any Cloud without concerns from our website at some standard cost. With Oracle Database, developers would have to worry about what they implement and the related costs of each feature but the licensing model from Tibero is just 1 price and we have all features included, so we don't have to worry and developers using our SQLaaS neither. PostgreSQL would be open source. We have chosen Tibero over Oracle because we want to offer a PL/SQL that you can deploy in any Cloud without concerns. PostgreSQL would be the open source option but we need to offer an SQLaaS with encryption and more enterprise features in the background and best value option we have found, it was Tibero Database for PL/SQL-based applications.

496k views496k

Comments

Abigail

Dec 6, 2019

Decided

In the field of bioinformatics, we regularly work with hierarchical and unstructured document data. Unstructured text data from PDFs, image data from radiographs, phylogenetic trees and cladograms, network graphs, streaming ECG data... none of it fits into a traditional SQL database particularly well. As such, we prefer to use document oriented databases.

MongoDB is probably the oldest component in our stack besides Javascript, having been in it for over 5 years. At the time, we were looking for a technology that could simply cache our data visualization state (stored in JSON) in a database as-is without any destructive normalization. MongoDB was the perfect tool; and has been exceeding expectations ever since.

Trivia fact: some of the earliest electronic medical records (EMRs) used a document oriented database called MUMPS as early as the 1960s, prior to the invention of SQL. MUMPS is still in use today in systems like Epic and VistA, and stores upwards of 40% of all medical records at hospitals. So, we saw MongoDB as something as a 21st century version of the MUMPS database.

540k views540k

Comments

Detailed Comparison

Oracle	HBase
Oracle Database is an RDBMS. An RDBMS that implements object-oriented features such as user-defined types, inheritance, and polymorphism is called an object-relational database management system (ORDBMS). Oracle Database has extended the relational model to an object-relational model, making it possible to store complex business models in a relational database.	Apache HBase is an open-source, distributed, versioned, column-oriented store modeled after Google' Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Apache Hadoop.
Statistics
GitHub Stars -	GitHub Stars 5.5K
GitHub Forks -	GitHub Forks 3.4K
Stacks 2.6K	Stacks 514
Followers 1.8K	Followers 498
Votes 113	Votes 15
Pros & Cons
Pros 44 Reliable 33 Enterprise 15 High Availability 5 Expensive 5 Hard to maintain Cons 14 Expensive	Pros 9 Performance 5 OLTP 1 Fast Point Queries

What are some alternatives to Oracle, HBase?

MongoDB

MongoDB stores data in JSON-like documents that can vary in structure, offering a dynamic, flexible schema. MongoDB was also designed for high availability and scalability, with built-in replication and auto-sharding.

MySQL

The MySQL software delivers a very fast, multi-threaded, multi-user, and robust SQL (Structured Query Language) database server. MySQL Server is intended for mission-critical, heavy-load production systems as well as for embedding into mass-deployed software.

PostgreSQL

PostgreSQL is an advanced object-relational database management system that supports an extended subset of the SQL standard, including transactions, foreign keys, subqueries, triggers, user-defined types and functions.

Microsoft SQL Server

Microsoft® SQL Server is a database management and analysis system for e-commerce, line-of-business, and data warehousing solutions.

SQLite

SQLite is an embedded SQL database engine. Unlike most other SQL databases, SQLite does not have a separate server process. SQLite reads and writes directly to ordinary disk files. A complete SQL database with multiple tables, indices, triggers, and views, is contained in a single disk file.

Cassandra

Partitioning means that Cassandra can distribute your data across multiple machines in an application-transparent matter. Cassandra will automatically repartition as machines are added and removed from the cluster. Row store means that like relational databases, Cassandra organizes data by rows and columns. The Cassandra Query Language (CQL) is a close relative of SQL.

Memcached

Memcached is an in-memory key-value store for small chunks of arbitrary data (strings, objects) from results of database calls, API calls, or page rendering.

MariaDB

Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stable, and sanely licensed open SQL server in the industry. MariaDB is designed as a drop-in replacement of MySQL(R) with more features, new storage engines, fewer bugs, and better performance.

RethinkDB

RethinkDB is built to store JSON documents, and scale to multiple machines with very little effort. It has a pleasant query language that supports really useful queries like table joins and group by, and is easy to setup and learn.

ArangoDB

A distributed free and open-source database with a flexible data model for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaScript extensions.

Related Comparisons

HBase vs Oracle: What are the differences?

Introduction

HBase and Oracle are both popular database systems used for storing and managing large amounts of data. However, there are several key differences between the two.

Data Model: HBase is a NoSQL database, while Oracle is a relational database. This means that HBase stores data in a distributed and scalable manner, organized in columns, column families, and rows. On the other hand, Oracle stores data in tables and enforces a predefined schema, allowing for complex relationships between different tables.
Scalability: HBase is designed to scale horizontally, meaning that it can easily handle large amounts of data by adding more servers to the cluster. Oracle, on the other hand, can scale vertically by upgrading the hardware of a single server. This difference in scalability makes HBase a better choice for big data applications that require massive amounts of storage and processing power.
Consistency: HBase provides eventual consistency, which means that reads may not always return the most up-to-date data. This is because HBase is designed for high availability and fault tolerance, allowing for concurrent updates from multiple clients. In contrast, Oracle provides strong consistency, ensuring that every read operation returns the most recent data.
Transaction Support: Oracle is known for its strong transaction support, allowing for ACID (Atomicity, Consistency, Isolation, Durability) properties. It provides features like rollback, commit, and concurrency control, ensuring data integrity. HBase, on the other hand, does not natively support complex transactions, and only offers atomicity on a per-row basis.
Query Language: Oracle uses SQL (Structured Query Language) for querying and manipulating data, which is a widely adopted standard language for relational databases. HBase, on the other hand, provides a Java API for accessing and manipulating data. While SQL allows for complex queries and joins, the HBase API requires developers to write code to perform similar operations.
Cost: Oracle is a commercial database that requires licensing fees, while HBase is an open-source project that is free to use. This cost difference can be a significant factor for organizations with budget constraints, especially when dealing with large-scale data storage and processing requirements.

HBase vs Oracle

Overview

HBase vs Oracle: What are the differences?