Cloudera Enterprise vs Redis

Overview

Cloudera Enterprise

Stacks126

Followers172

Votes5

Redis

Stacks61.9K

Followers46.5K

Votes3.9K

GitHub Stars42

Forks6

Cloudera Enterprise vs Redis: What are the differences?

Developers describe Cloudera Enterprise as "Enterprise Platform for Big Data". Cloudera Enterprise includes CDH, the world’s most popular open source Hadoop-based platform, as well as advanced system management and data management tools plus dedicated support and community advocacy from our world-class team of Hadoop developers and experts. On the other hand, Redis is detailed as "An in-memory database that persists on disk". Redis is an open source, BSD licensed, advanced key-value store. It is often referred to as a data structure server since keys can contain strings, hashes, lists, sets and sorted sets.

Cloudera Enterprise can be classified as a tool in the "Big Data as a Service" category, while Redis is grouped under "In-Memory Databases".

Redis is an open source tool with 37.1K GitHub stars and 14.3K GitHub forks. Here's a link to Redis's open source repository on GitHub.

reddit, Instacart, and Slack are some of the popular companies that use Redis, whereas Cloudera Enterprise is used by Hammer Lab, JPush, and Jobrapido. Redis has a broader approval, being mentioned in 3239 company stacks & 1732 developers stacks; compared to Cloudera Enterprise, which is listed in 4 company stacks and 7 developer stacks.

Share your Stack

Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.

View Docs

CLI (Node.js)

Manual

Detailed Comparison

Cloudera Enterprise	Redis
Cloudera Enterprise includes CDH, the world’s most popular open source Hadoop-based platform, as well as advanced system management and data management tools plus dedicated support and community advocacy from our world-class team of Hadoop developers and experts.	Redis is an open source (BSD licensed), in-memory data structure store, used as a database, cache, and message broker. Redis provides data structures such as strings, hashes, lists, sets, sorted sets with range queries, bitmaps, hyperloglogs, geospatial indexes, and streams.
Unified – one integrated system, bringing diverse users and application workloads to one pool of data on common infrastructure; no data movement required;Secure – perimeter security, authentication, granular authorization, and data protection;Governed – enterprise-grade data auditing, data lineage, and data discovery;Managed – native high-availability, fault-tolerance and self-healing storage, automated backup and disaster recovery, and advanced system and data management;Open – Apache-licensed open source to ensure your data and applications remain yours, and an open platform to connect with all of your existing investments in technology and skills	-
Statistics
GitHub Stars -	GitHub Stars 42
GitHub Forks -	GitHub Forks 6
Stacks 126	Stacks 61.9K
Followers 172	Followers 46.5K
Votes 5	Votes 3.9K
Pros & Cons
Pros 1 Cheeper 1 Scalability 1 Easily management 1 Multicloud 1 Hybrid cloud	Pros 888 Performance 542 Super fast 514 Ease of use 444 In-memory cache 324 Advanced key-value cache Cons 15 Cannot query objects directly 3 No secondary indexes for non-numeric data types 1 No WAL

What are some alternatives to Cloudera Enterprise, Redis?

Google BigQuery

Run super-fast, SQL-like queries against terabytes of data in seconds, using the processing power of Google's infrastructure. Load data with ease. Bulk load your data using Google Cloud Storage or stream it in. Easy access. Access BigQuery by using a browser tool, a command-line tool, or by making calls to the BigQuery REST API with client libraries such as Java, PHP or Python.

Amazon Redshift

It is optimized for data sets ranging from a few hundred gigabytes to a petabyte or more and costs less than $1,000 per terabyte per year, a tenth the cost of most traditional data warehousing solutions.

Qubole

Qubole is a cloud based service that makes big data easy for analysts and data engineers.

Hazelcast

With its various distributed data structures, distributed caching capabilities, elastic nature, memcache support, integration with Spring and Hibernate and more importantly with so many happy users, Hazelcast is feature-rich, enterprise-ready and developer-friendly in-memory data grid solution.

Amazon EMR

It is used in a variety of applications, including log analysis, data warehousing, machine learning, financial analysis, scientific simulation, and bioinformatics.

Aerospike

Aerospike is an open-source, modern database built from the ground up to push the limits of flash storage, processors and networks. It was designed to operate with predictable low latency at high throughput with uncompromising reliability – both high availability and ACID guarantees.

MemSQL

MemSQL converges transactions and analytics for sub-second data processing and reporting. Real-time businesses can build robust applications on a simple and scalable infrastructure that complements and extends existing data pipelines.

Apache Ignite

It is a memory-centric distributed database, caching, and processing platform for transactional, analytical, and streaming workloads delivering in-memory speeds at petabyte scale

Altiscale

we run Apache Hadoop for you. We not only deploy Hadoop, we monitor, manage, fix, and update it for you. Then we take it a step further: We monitor your jobs, notify you when something’s wrong with them, and can help with tuning.

Snowflake

Snowflake eliminates the administration and management demands of traditional data warehouses and big data platforms. Snowflake is a true data warehouse as a service running on Amazon Web Services (AWS)—no infrastructure to manage and no knobs to turn.