Hadoop vs MariaDB

Overview

Hadoop

Stacks2.7K

Followers2.3K

Votes56

GitHub Stars15.3K

Forks9.1K

MariaDB

Stacks16.6K

Followers12.8K

Votes468

GitHub Stars6.6K

Forks1.9K

Hadoop vs MariaDB: What are the differences?

What is Hadoop? Open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.

What is MariaDB? An enhanced, drop-in replacement for MySQL. Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stable, and sanely licensed open SQL server in the industry. MariaDB is designed as a drop-in replacement of MySQL(R) with more features, new storage engines, fewer bugs, and better performance.

Hadoop and MariaDB belong to "Databases" category of the tech stack.

"Great ecosystem" is the top reason why over 34 developers like Hadoop, while over 150 developers mention "Drop-in mysql replacement" as the leading cause for choosing MariaDB.

Hadoop and MariaDB are both open source tools. Hadoop with 9.26K GitHub stars and 5.78K forks on GitHub appears to be more popular than MariaDB with 2.82K GitHub stars and 864 GitHub forks.

Grooveshark, Shutterstock, and Geocodio are some of the popular companies that use MariaDB, whereas Hadoop is used by Airbnb, Uber Technologies, and Spotify. MariaDB has a broader approval, being mentioned in 496 company stacks & 461 developers stacks; compared to Hadoop, which is listed in 237 company stacks and 127 developer stacks.

Share your Stack

Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.

View Docs

CLI (Node.js)

Manual

Advice on Hadoop, MariaDB

Maxim

student at USI

Aug 25, 2020

Needs adviceon

Node.js

Mongoose

PostgreSQL

Hi all. I am an informatics student, and I need to realise a simple website for my friend. I am planning to realise the website using Node.js and Mongoose, since I have already done a project using these technologies. I also know SQL, and I have used PostgreSQL and MySQL previously.

The website will show a possible travel destination and local transportation. The database is used to store information about traveling, so only admin will manage the content (especially photos). While clients will see the content uploaded by the admin. I am planning to use Mongoose because it is very simple and efficient for this project. Please give me your opinion about this choice.

321k views321k

Comments

Omran

CTO & Co-founder at Bonton Connect

Jun 19, 2020

Needs advice

We actually use both Mongo and SQL databases in production. Mongo excels in both speed and developer friendliness when it comes to geospatial data and queries on the geospatial data, but we also like ACID compliance hence most of our other data (except on-site logs) are stored in a SQL Database (MariaDB for now)

582k views582k

Comments

SVP CTO

Apr 22, 2021

Needs adviceon

MarkLogic

Hadoop

Snowflake

For a property and casualty insurance company, we currently use MarkLogic and Hadoop for our raw data lake. Trying to figure out how snowflake fits in the picture. Does anybody have some good suggestions/best practices for when to use and what data to store in Mark logic versus Snowflake versus a hadoop or all three of these platforms redundant with one another?

136k views136k

Comments

Detailed Comparison

Hadoop	MariaDB
The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.	Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stable, and sanely licensed open SQL server in the industry. MariaDB is designed as a drop-in replacement of MySQL(R) with more features, new storage engines, fewer bugs, and better performance.
-	Replication;Insert Delayed;Events;Dynamic;Columns;Full-text;Search;GIS;Locale;Settings;subqueries;Timezones;Triggers;XML;Functions;Views;SSL;Show Profile
Statistics
GitHub Stars 15.3K	GitHub Stars 6.6K
GitHub Forks 9.1K	GitHub Forks 1.9K
Stacks 2.7K	Stacks 16.6K
Followers 2.3K	Followers 12.8K
Votes 56	Votes 468
Pros & Cons
Pros 39 Great ecosystem 11 One stack to rule them all 4 Great load balancer 1 Java syntax 1 Amazon aws	Pros 149 Drop-in mysql replacement 100 Great performance 74 Open source 55 Free 44 Easy setup

What are some alternatives to Hadoop, MariaDB?

MongoDB

MongoDB stores data in JSON-like documents that can vary in structure, offering a dynamic, flexible schema. MongoDB was also designed for high availability and scalability, with built-in replication and auto-sharding.

MySQL

The MySQL software delivers a very fast, multi-threaded, multi-user, and robust SQL (Structured Query Language) database server. MySQL Server is intended for mission-critical, heavy-load production systems as well as for embedding into mass-deployed software.

PostgreSQL

PostgreSQL is an advanced object-relational database management system that supports an extended subset of the SQL standard, including transactions, foreign keys, subqueries, triggers, user-defined types and functions.

Microsoft SQL Server

Microsoft® SQL Server is a database management and analysis system for e-commerce, line-of-business, and data warehousing solutions.

SQLite

SQLite is an embedded SQL database engine. Unlike most other SQL databases, SQLite does not have a separate server process. SQLite reads and writes directly to ordinary disk files. A complete SQL database with multiple tables, indices, triggers, and views, is contained in a single disk file.

Cassandra

Partitioning means that Cassandra can distribute your data across multiple machines in an application-transparent matter. Cassandra will automatically repartition as machines are added and removed from the cluster. Row store means that like relational databases, Cassandra organizes data by rows and columns. The Cassandra Query Language (CQL) is a close relative of SQL.

Memcached

Memcached is an in-memory key-value store for small chunks of arbitrary data (strings, objects) from results of database calls, API calls, or page rendering.

RethinkDB

RethinkDB is built to store JSON documents, and scale to multiple machines with very little effort. It has a pleasant query language that supports really useful queries like table joins and group by, and is easy to setup and learn.

ArangoDB

A distributed free and open-source database with a flexible data model for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaScript extensions.

InfluxDB

InfluxDB is a scalable datastore for metrics, events, and real-time analytics. It has a built-in HTTP API so you don't have to write any server side code to get up and running. InfluxDB is designed to be scalable, simple to install and manage, and fast to get data in and out.

Related Comparisons

"Great ecosystem" is the top reason why over 34 developers like Hadoop, while over 150 developers mention "Drop-in mysql replacement" as the leading cause for choosing MariaDB.

Hadoop vs MariaDB