Cassandra vs Galera Cluster

Overview

Cassandra

Stacks3.6K

Followers3.5K

Votes507

GitHub Stars9.5K

Forks3.8K

Galera Cluster

Stacks54

Followers102

Votes0

Cassandra vs Galera Cluster: What are the differences?

1. Scalability: Cassandra is designed for high scalability by distributing data across multiple nodes in a cluster, allowing linear scalability for both read and write operations. Galera Cluster, on the other hand, is a synchronous replication solution that can have limitations in scaling due to the need for all nodes to commit transactions in a multi-master setup.

2. Consistency: Cassandra offers tunable consistency levels, enabling users to choose between strong consistency and high availability. Galera Cluster enforces strict synchronous replication, ensuring strong consistency across all nodes but potentially impacting performance in certain scenarios.

3. Data Replication: In Cassandra, data replication is achieved through the replication factor and consistency level settings, allowing for control over data durability and availability. Galera Cluster replicates data synchronously across all nodes, ensuring that each node holds a copy of the same data at all times.

4. Partitioning: Cassandra uses consistent hashing to partition data across nodes, providing efficient distribution and retrieval of data. Galera Cluster does not support automatic data partitioning and relies on traditional sharding methods for horizontal scaling.

5. Conflict Resolution: In Cassandra, conflict resolution is handled through timestamps and client-provided timestamps, resolving conflicts based on the latest timestamp. Galera Cluster resolves conflicts at the network level, requiring strict consistency to avoid conflicts among nodes.

6. High Availability: Cassandra provides built-in fault tolerance through data replication and automatic data repair mechanisms. Galera Cluster relies on synchronous replication for high availability, which can introduce latency and performance trade-offs in certain scenarios.

In Summary, Cassandra focuses on scalability and flexibility in data distribution, while Galera Cluster prioritizes strong consistency and data integrity in a multi-master setup.

Share your Stack

Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.

View Docs

CLI (Node.js)

Manual

Advice on Cassandra, Galera Cluster

Vinay

Head of Engineering

Sep 19, 2019

Needs advice

The problem I have is - we need to process & change(update/insert) 55M Data every 2 min and this updated data to be available for Rest API for Filtering / Selection. Response time for Rest API should be less than 1 sec.

The most important factors for me are processing and storing time of 2 min. There need to be 2 views of Data One is for Selection & 2. Changed data.

174k views174k

Comments

Detailed Comparison

Cassandra	Galera Cluster
Partitioning means that Cassandra can distribute your data across multiple machines in an application-transparent matter. Cassandra will automatically repartition as machines are added and removed from the cluster. Row store means that like relational databases, Cassandra organizes data by rows and columns. The Cassandra Query Language (CQL) is a close relative of SQL.	It’s an easy-to-use, high-availability solution, which provides high system up-time, no data loss and scalability for future growth. You can Keep it up and running 24/7. Putting our expertise to use will help you avoid trial and error.
-	True Multi-master Read and write to any node at any time; Synchronous Replication No slave lag, no data is lost at node crash; Tightly Coupled All nodes hold the same state; Multi-threaded Slave For better performance.
Statistics
GitHub Stars 9.5K	GitHub Stars -
GitHub Forks 3.8K	GitHub Forks -
Stacks 3.6K	Stacks 54
Followers 3.5K	Followers 102
Votes 507	Votes 0
Pros & Cons
Pros 119 Distributed 98 High performance 81 High availability 74 Easy scalability 53 Replication Cons 3 Reliability of replication 1 Size 1 Updates	No community feedback yet
Integrations
No integrations available	MongoDB PostgreSQL Oracle MySQL SQLFlow MariaDB

What are some alternatives to Cassandra, Galera Cluster?

MongoDB

MongoDB stores data in JSON-like documents that can vary in structure, offering a dynamic, flexible schema. MongoDB was also designed for high availability and scalability, with built-in replication and auto-sharding.

MySQL

The MySQL software delivers a very fast, multi-threaded, multi-user, and robust SQL (Structured Query Language) database server. MySQL Server is intended for mission-critical, heavy-load production systems as well as for embedding into mass-deployed software.

PostgreSQL

PostgreSQL is an advanced object-relational database management system that supports an extended subset of the SQL standard, including transactions, foreign keys, subqueries, triggers, user-defined types and functions.

dbForge Studio for MySQL

It is the universal MySQL and MariaDB client for database management, administration and development. With the help of this intelligent MySQL client the work with data and code has become easier and more convenient. This tool provides utilities to compare, synchronize, and backup MySQL databases with scheduling, and gives possibility to analyze and report MySQL tables data.

Microsoft SQL Server

Microsoft® SQL Server is a database management and analysis system for e-commerce, line-of-business, and data warehousing solutions.

SQLite

SQLite is an embedded SQL database engine. Unlike most other SQL databases, SQLite does not have a separate server process. SQLite reads and writes directly to ordinary disk files. A complete SQL database with multiple tables, indices, triggers, and views, is contained in a single disk file.

Memcached

Memcached is an in-memory key-value store for small chunks of arbitrary data (strings, objects) from results of database calls, API calls, or page rendering.

MariaDB

Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stable, and sanely licensed open SQL server in the industry. MariaDB is designed as a drop-in replacement of MySQL(R) with more features, new storage engines, fewer bugs, and better performance.

dbForge Studio for Oracle

It is a powerful integrated development environment (IDE) which helps Oracle SQL developers to increase PL/SQL coding speed, provides versatile data editing tools for managing in-database and external data.

dbForge Studio for PostgreSQL

It is a GUI tool for database development and management. The IDE for PostgreSQL allows users to create, develop, and execute queries, edit and adjust the code to their requirements in a convenient and user-friendly interface.

Related Comparisons

Cassandra vs Galera Cluster: What are the differences?

In Summary, Cassandra focuses on scalability and flexibility in data distribution, while Galera Cluster prioritizes strong consistency and data integrity in a multi-master setup.

Cassandra vs Galera Cluster

Overview