HBase vs Oracle

Need advice about which tool to choose?Ask the StackShare community!

HBase

372
441
+ 1
15
Oracle

1.7K
1.4K
+ 1
108
Add tool

HBase vs Oracle: What are the differences?

HBase: The Hadoop database, a distributed, scalable, big data store. Apache HBase is an open-source, distributed, versioned, column-oriented store modeled after Google' Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Apache Hadoop; Oracle: An RDBMS that implements object-oriented features such as user-defined types, inheritance, and polymorphism. Oracle Database is an RDBMS. An RDBMS that implements object-oriented features such as user-defined types, inheritance, and polymorphism is called an object-relational database management system (ORDBMS). Oracle Database has extended the relational model to an object-relational model, making it possible to store complex business models in a relational database.

HBase and Oracle belong to "Databases" category of the tech stack.

"Performance" is the primary reason why developers consider HBase over the competitors, whereas "Reliable" was stated as the key factor in picking Oracle.

HBase is an open source tool with 2.91K GitHub stars and 2.01K GitHub forks. Here's a link to HBase's open source repository on GitHub.

Netflix, ebay, and LinkedIn are some of the popular companies that use Oracle, whereas HBase is used by Pinterest, HubSpot, and Yammer. Oracle has a broader approval, being mentioned in 106 company stacks & 92 developers stacks; compared to HBase, which is listed in 54 company stacks and 18 developer stacks.

Decisions about HBase and Oracle
Daniel Moya
Data Engineer at Dimensigon · | 4 upvotes · 241.4K views

We have chosen Tibero over Oracle because we want to offer a PL/SQL-as-a-Service that the users can deploy in any Cloud without concerns from our website at some standard cost. With Oracle Database, developers would have to worry about what they implement and the related costs of each feature but the licensing model from Tibero is just 1 price and we have all features included, so we don't have to worry and developers using our SQLaaS neither. PostgreSQL would be open source. We have chosen Tibero over Oracle because we want to offer a PL/SQL that you can deploy in any Cloud without concerns. PostgreSQL would be the open source option but we need to offer an SQLaaS with encryption and more enterprise features in the background and best value option we have found, it was Tibero Database for PL/SQL-based applications.

See more

We wanted a JSON datastore that could save the state of our bioinformatics visualizations without destructive normalization. As a leading NoSQL data storage technology, MongoDB has been a perfect fit for our needs. Plus it's open source, and has an enterprise SLA scale-out path, with support of hosted solutions like Atlas. Mongo has been an absolute champ. So much so that SQL and Oracle have begun shipping JSON column types as a new feature for their databases. And when Fast Healthcare Interoperability Resources (FHIR) announced support for JSON, we basically had our FHIR datalake technology.

See more

In the field of bioinformatics, we regularly work with hierarchical and unstructured document data. Unstructured text data from PDFs, image data from radiographs, phylogenetic trees and cladograms, network graphs, streaming ECG data... none of it fits into a traditional SQL database particularly well. As such, we prefer to use document oriented databases.

MongoDB is probably the oldest component in our stack besides Javascript, having been in it for over 5 years. At the time, we were looking for a technology that could simply cache our data visualization state (stored in JSON) in a database as-is without any destructive normalization. MongoDB was the perfect tool; and has been exceeding expectations ever since.

Trivia fact: some of the earliest electronic medical records (EMRs) used a document oriented database called MUMPS as early as the 1960s, prior to the invention of SQL. MUMPS is still in use today in systems like Epic and VistA, and stores upwards of 40% of all medical records at hospitals. So, we saw MongoDB as something as a 21st century version of the MUMPS database.

See more
Get Advice from developers at your company using Private StackShare. Sign up for Private StackShare.
Learn More
Pros of HBase
Pros of Oracle
  • 9
    Performance
  • 5
    OLTP
  • 1
    Fast Point Queries
  • 42
    Reliable
  • 31
    Enterprise
  • 15
    High Availability
  • 5
    Expensive
  • 5
    Hard to maintain
  • 4
    Maintainable
  • 3
    High complexity
  • 3
    Hard to use

Sign up to add or upvote prosMake informed product decisions

Cons of HBase
Cons of Oracle
    Be the first to leave a con
    • 13
      Expensive

    Sign up to add or upvote consMake informed product decisions

    - No public GitHub repository available -

    What is HBase?

    Apache HBase is an open-source, distributed, versioned, column-oriented store modeled after Google' Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Apache Hadoop.

    What is Oracle?

    Oracle Database is an RDBMS. An RDBMS that implements object-oriented features such as user-defined types, inheritance, and polymorphism is called an object-relational database management system (ORDBMS). Oracle Database has extended the relational model to an object-relational model, making it possible to store complex business models in a relational database.

    Need advice about which tool to choose?Ask the StackShare community!

    What companies use HBase?
    What companies use Oracle?
    See which teams inside your own company are using HBase or Oracle.
    Sign up for Private StackShareLearn More

    Sign up to get full access to all the companiesMake informed product decisions

    What tools integrate with HBase?
    What tools integrate with Oracle?

    Sign up to get full access to all the tool integrationsMake informed product decisions

    Blog Posts

    Jun 24 2020 at 4:42PM

    Pinterest

    Amazon S3KafkaHBase+4
    4
    1090
    MySQLKafkaApache Spark+6
    2
    1721
    What are some alternatives to HBase and Oracle?
    Cassandra
    Partitioning means that Cassandra can distribute your data across multiple machines in an application-transparent matter. Cassandra will automatically repartition as machines are added and removed from the cluster. Row store means that like relational databases, Cassandra organizes data by rows and columns. The Cassandra Query Language (CQL) is a close relative of SQL.
    Google Cloud Bigtable
    Google Cloud Bigtable offers you a fast, fully managed, massively scalable NoSQL database service that's ideal for web, mobile, and Internet of Things applications requiring terabytes to petabytes of data. Unlike comparable market offerings, Cloud Bigtable doesn't require you to sacrifice speed, scale, or cost efficiency when your applications grow. Cloud Bigtable has been battle-tested at Google for more than 10 years—it's the database driving major applications such as Google Analytics and Gmail.
    MongoDB
    MongoDB stores data in JSON-like documents that can vary in structure, offering a dynamic, flexible schema. MongoDB was also designed for high availability and scalability, with built-in replication and auto-sharding.
    Hadoop
    The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.
    Druid
    Druid is a distributed, column-oriented, real-time analytics data store that is commonly used to power exploratory dashboards in multi-tenant environments. Druid excels as a data warehousing solution for fast aggregate queries on petabyte sized data sets. Druid supports a variety of flexible filters, exact calculations, approximate algorithms, and other useful calculations.
    See all alternatives