Cassandra vs RethinkDB

Overview

RethinkDB

Stacks292

Followers406

Votes307

GitHub Stars27.0K

Forks1.9K

Cassandra

Stacks3.6K

Followers3.5K

Votes507

GitHub Stars9.5K

Forks3.8K

Cassandra vs RethinkDB: What are the differences?

# Introduction
Cassandra and RethinkDB are both popular NoSQL databases known for their scalability, high availability, and fault tolerance. However, there are key differences between the two that make them suitable for different use cases.

1. **Data Model**:
   Cassandra follows a wide column data model, allowing flexible schema design and column-based storage. In contrast, RethinkDB employs a JSON document model, providing a more structured and nested data representation.

2. **Query Language**:
   Cassandra uses CQL (Cassandra Query Language), which is SQL-like and optimized for data retrieval. RethinkDB utilizes ReQL (RethinkDB Query Language), a functional query language with support for real-time queries via changefeeds.

3. **Consistency Level**:
   Cassandra offers tunable consistency levels, allowing users to balance between consistency and availability. RethinkDB maintains strong consistency by default, ensuring that all nodes see the same data at the same time.

4. **Horizontal Scalability**:
   Cassandra is designed for linear scalability, making it ideal for large-scale distributed deployments. RethinkDB supports sharding for horizontal scalability, enabling a distributed architecture but may require more management overhead.

5. **Data Durability**:
   Cassandra provides durable writes through its commit log and storage engine, ensuring data persistence even in the event of node failures. RethinkDB offers strong durability guarantees by synchronously writing data to disk on multiple nodes for fault tolerance.

6. **Real-time Updates**:
   RethinkDB excels in real-time applications with its built-in support for changefeeds, allowing applications to receive updates as soon as they occur. Cassandra lacks native support for real-time updates, requiring additional tools or integration for similar functionality.

In Summary, while both Cassandra and RethinkDB offer scalability and fault tolerance, they differ in data models, query languages, consistency levels, scalability options, durability mechanisms, and real-time update capabilities, making them suitable for distinct use cases.

Share your Stack

Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.

View Docs

CLI (Node.js)

Manual

Advice on RethinkDB, Cassandra

Micha

CEO & Co-Founder at Dechea

May 27, 2022

Decided

Fauna is a serverless database where you store data as JSON. Also, you have build in a HTTP GraphQL interface with a full authentication & authorization layer. That means you can skip your Backend and call it directly from the Frontend. With the power, that you can write data transformation function within Fauna with her own language called FQL, we're getting a blazing fast application.

Also, Fauna takes care about scaling and backups (All data are sharded on three different locations on the globe). That means we can fully focus on writing business logic and don't have to worry anymore about infrastructure.

93.1k views93.1k

Comments

Krishna Chaitanya

Head of Technology at Adonmo

Jun 27, 2021

Review

For such a more realtime-focused, data-centered application like an exchange, it's not the frontend or backend that matter much. In fact for that, they can do away with any of the popular frameworks like React/Vue/Angular for the frontend and Go/Python for the backend. For example uniswap's frontend (although much simpler than binance) is built in React. The main interesting part here would be how they are able to handle updating data so quickly. In my opinion, they might be heavily reliant on realtime processing systems like Kafka+Kafka Streams, Apache Flink or Apache Spark Stream or similar. For more processing heavy but not so real-time processing, they might be relying on OLAP and/or warehousing tools like Cassandra/Redshift. They could have also optimized few high frequent queries using NoSQL stores like mongodb (for persistance) and in-memory cache like Redis (for further perfomance boost to get millisecond latencies).

53.8k views53.8k

Comments

Feb 24, 2022

Decided

I’m newbie I was developing a pouchdb and couchdb app cause if the sync. Lots of learning very little code available. I dropped the project cause it consumed my life. Yeats later I’m back into it. I researched other db and came across rethinkdb and mongo for the subscription features. With socketio I should be able to create and similar sync feature. Attempted to use mongo. I attempted to use rethink. Rethink for the win. Super clear l. I had it running in minutes on my local machine and I believe it’s supposed to scale easy. Mongo wasn’t as easy and there free online db is so slow what’s the point. Very easy to find mongo code examples and use rethink code in its place. I wish I went this route years ago. All that corporate google Amazon crap get bent. The reason they have so much power in the world is cause you guys are giving it to them.

79.7k views79.7k

Comments

Detailed Comparison

RethinkDB	Cassandra
RethinkDB is built to store JSON documents, and scale to multiple machines with very little effort. It has a pleasant query language that supports really useful queries like table joins and group by, and is easy to setup and learn.	Partitioning means that Cassandra can distribute your data across multiple machines in an application-transparent matter. Cassandra will automatically repartition as machines are added and removed from the cluster. Row store means that like relational databases, Cassandra organizes data by rows and columns. The Cassandra Query Language (CQL) is a close relative of SQL.
JSON data model and immediate consistency.;Distributed joins, subqueries, aggregation, atomic updates.;Secondary, compound, and arbitrarily computed indexes.;Hadoop-style map/reduce.;Friendly web and command-line administration tools.;Takes care of machine failures and network interrupts.;Multi-datacenter replication and failover.;Sharding and replication to multiple nodes.;Queries are automatically parallelized and distributed.;Lock-free operation via MVCC concurrency.	-
Statistics
GitHub Stars 27.0K	GitHub Stars 9.5K
GitHub Forks 1.9K	GitHub Forks 3.8K
Stacks 292	Stacks 3.6K
Followers 406	Followers 3.5K
Votes 307	Votes 507
Pros & Cons
Pros 48 Powerful query language 46 Excellent dashboard 42 JSON 41 Distributed database 38 Open source	Pros 119 Distributed 98 High performance 81 High availability 74 Easy scalability 53 Replication Cons 3 Reliability of replication 2 Size 1 Updates
Integrations
Amazon EC2	No integrations available

What are some alternatives to RethinkDB, Cassandra?

MongoDB

MongoDB stores data in JSON-like documents that can vary in structure, offering a dynamic, flexible schema. MongoDB was also designed for high availability and scalability, with built-in replication and auto-sharding.

MySQL

The MySQL software delivers a very fast, multi-threaded, multi-user, and robust SQL (Structured Query Language) database server. MySQL Server is intended for mission-critical, heavy-load production systems as well as for embedding into mass-deployed software.

PostgreSQL

PostgreSQL is an advanced object-relational database management system that supports an extended subset of the SQL standard, including transactions, foreign keys, subqueries, triggers, user-defined types and functions.

Microsoft SQL Server

Microsoft® SQL Server is a database management and analysis system for e-commerce, line-of-business, and data warehousing solutions.

SQLite

SQLite is an embedded SQL database engine. Unlike most other SQL databases, SQLite does not have a separate server process. SQLite reads and writes directly to ordinary disk files. A complete SQL database with multiple tables, indices, triggers, and views, is contained in a single disk file.

Memcached

Memcached is an in-memory key-value store for small chunks of arbitrary data (strings, objects) from results of database calls, API calls, or page rendering.

MariaDB

Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stable, and sanely licensed open SQL server in the industry. MariaDB is designed as a drop-in replacement of MySQL(R) with more features, new storage engines, fewer bugs, and better performance.

ArangoDB

A distributed free and open-source database with a flexible data model for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaScript extensions.

InfluxDB

InfluxDB is a scalable datastore for metrics, events, and real-time analytics. It has a built-in HTTP API so you don't have to write any server side code to get up and running. InfluxDB is designed to be scalable, simple to install and manage, and fast to get data in and out.

CouchDB

Apache CouchDB is a database that uses JSON for documents, JavaScript for MapReduce indexes, and regular HTTP for its API. CouchDB is a database that completely embraces the web. Store your data with JSON documents. Access your documents and query your indexes with your web browser, via HTTP. Index, combine, and transform your documents with JavaScript.

Related Comparisons

Cassandra vs RethinkDB: What are the differences?

# Introduction
Cassandra and RethinkDB are both popular NoSQL databases known for their scalability, high availability, and fault tolerance. However, there are key differences between the two that make them suitable for different use cases.

1. **Data Model**:
   Cassandra follows a wide column data model, allowing flexible schema design and column-based storage. In contrast, RethinkDB employs a JSON document model, providing a more structured and nested data representation.

2. **Query Language**:
   Cassandra uses CQL (Cassandra Query Language), which is SQL-like and optimized for data retrieval. RethinkDB utilizes ReQL (RethinkDB Query Language), a functional query language with support for real-time queries via changefeeds.

3. **Consistency Level**:
   Cassandra offers tunable consistency levels, allowing users to balance between consistency and availability. RethinkDB maintains strong consistency by default, ensuring that all nodes see the same data at the same time.

4. **Horizontal Scalability**:
   Cassandra is designed for linear scalability, making it ideal for large-scale distributed deployments. RethinkDB supports sharding for horizontal scalability, enabling a distributed architecture but may require more management overhead.

5. **Data Durability**:
   Cassandra provides durable writes through its commit log and storage engine, ensuring data persistence even in the event of node failures. RethinkDB offers strong durability guarantees by synchronously writing data to disk on multiple nodes for fault tolerance.

6. **Real-time Updates**:
   RethinkDB excels in real-time applications with its built-in support for changefeeds, allowing applications to receive updates as soon as they occur. Cassandra lacks native support for real-time updates, requiring additional tools or integration for similar functionality.

In Summary, while both Cassandra and RethinkDB offer scalability and fault tolerance, they differ in data models, query languages, consistency levels, scalability options, durability mechanisms, and real-time update capabilities, making them suitable for distinct use cases.

Cassandra vs RethinkDB

Overview

Cassandra vs RethinkDB: What are the differences?

Share your Stack

Advice on RethinkDB, Cassandra

Detailed Comparison

What are some alternatives to RethinkDB, Cassandra?

MongoDB

MySQL

PostgreSQL

Microsoft SQL Server

SQLite

Memcached

MariaDB

ArangoDB

InfluxDB

CouchDB

Related Comparisons

Bootstrap vs Materialize

Django vs Laravel vs Node.js

Bootstrap vs Foundation vs Material UI

Node.js vs Spring-Boot

Flyway vs Liquibase

Cassandra vs RethinkDB

Overview

Cassandra vs RethinkDB: What are the differences?

Share your Stack

Advice on RethinkDB, Cassandra

Detailed Comparison

What are some alternatives to RethinkDB, Cassandra?

MongoDB

MySQL

PostgreSQL

Microsoft SQL Server

SQLite

Memcached

MariaDB

ArangoDB

InfluxDB

CouchDB

Related Comparisons

Bootstrap vs Materialize

Django vs Laravel vs Node.js

Bootstrap vs Foundation vs Material UI

Node.js vs Spring-Boot

Flyway vs Liquibase