Kudu logo
Fast Analytics on Fast Data. A columnar storage manager developed for the Hadoop platform

What is Kudu?

A new addition to the open source Apache Hadoop ecosystem, Kudu completes Hadoop's storage layer to enable fast analytics on fast data.
Kudu is a tool in the Big Data Tools category of a tech stack.
Kudu is an open source tool with 794 GitHub stars and 266 GitHub forks. Here’s a link to Kudu's open source repository on GitHub

Who uses Kudu?

5 companies reportedly use Kudu in their tech stacks, including Sensel Telematics, Data Pipeline, and HIS.

21 developers on StackShare have stated that they use Kudu.

Kudu Integrations

Why developers like Kudu?

Here’s a list of reasons why companies and developers use Kudu
Top Reasons

Kudu Alternatives & Comparisons

What are some alternatives to Kudu?
Apache Spark
Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters through YARN or Spark's standalone mode, and it can process data in HDFS, HBase, Cassandra, Hive, and any Hadoop InputFormat. It is designed to perform both batch processing (similar to MapReduce) and new workloads like streaming, interactive queries, and machine learning.
Apache Flink
Apache Flink is an open source system for fast and versatile data analytics in clusters. Flink supports batch and streaming analytics, in one system. Analytical programs can be written in concise and elegant APIs in Java and Scala.
Amazon Athena
Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.
Druid is a distributed, column-oriented, real-time analytics data store that is commonly used to power exploratory dashboards in multi-tenant environments. Druid excels as a data warehousing solution for fast aggregate queries on petabyte sized data sets. Druid supports a variety of flexible filters, exact calculations, approximate algorithms, and other useful calculations.
Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes.
See all alternatives

Kudu's Stats

Kudu's Followers
36 developers follow Kudu to keep up with related blogs and decisions.
Himansu Sekhar
Alexander Adam
Irtiza Ali
JiaGen Zhou
Praneeth Ramesh
Dinh Man Nguyen
Ruslan Dautkhanov