Get Advice Icon

Need advice about which tool to choose?Ask the StackShare community!

Airflow
Airflow

309
240
+ 1
16
Apache Impala
Apache Impala

58
55
+ 1
8
Add tool

What is Airflow?

Use Airflow to author workflows as directed acyclic graphs (DAGs) of tasks. The Airflow scheduler executes your tasks on an array of workers while following the specified dependencies. Rich command lines utilities makes performing complex surgeries on DAGs a snap. The rich user interface makes it easy to visualize pipelines running in production, monitor progress and troubleshoot issues when needed.

What is Apache Impala?

Impala is a modern, open source, MPP SQL query engine for Apache Hadoop. Impala is shipped by Cloudera, MapR, and Amazon. With Impala, you can query data, whether stored in HDFS or Apache HBase – including SELECT, JOIN, and aggregate functions – in real time.
Get Advice Icon

Need advice about which tool to choose?Ask the StackShare community!

Why do developers choose Airflow?
Why do developers choose Apache Impala?

Sign up to add, upvote and see more prosMake informed product decisions

    Be the first to leave a con
      Be the first to leave a con
      What companies use Airflow?
      What companies use Apache Impala?

      Sign up to get full access to all the companiesMake informed product decisions

      What tools integrate with Airflow?
      What tools integrate with Apache Impala?
        No integrations found

        Sign up to get full access to all the tool integrationsMake informed product decisions

        What are some alternatives to Airflow and Apache Impala?
        Luigi
        It is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
        Apache NiFi
        An easy to use, powerful, and reliable system to process and distribute data. It supports powerful and scalable directed graphs of data routing, transformation, and system mediation logic.
        Jenkins
        In a nutshell Jenkins CI is the leading open-source continuous integration server. Built with Java, it provides over 300 plugins to support building and testing virtually any project.
        Apache Beam
        It implements batch and streaming data processing jobs that run on any execution engine. It executes pipelines on multiple execution environments.
        Apache Oozie
        It is a server-based workflow scheduling system to manage Hadoop jobs. Workflows in it are defined as a collection of control flow and action nodes in a directed acyclic graph. Control flow nodes define the beginning and the end of a workflow as well as a mechanism to control the workflow execution path.
        See all alternatives
        Decisions about Airflow and Apache Impala
        No stack decisions found
        Interest over time
        Reviews of Airflow and Apache Impala
        No reviews found
        How developers use Airflow and Apache Impala
        Avatar of Eugene Ivanchenko
        Eugene Ivanchenko uses AirflowAirflow

        Manage the calculation pipeline and data distribution procedures.

        Avatar of Christopher Davison
        Christopher Davison uses AirflowAirflow

        Used for scheduling ETL jobs

        How much does Airflow cost?
        How much does Apache Impala cost?
        Pricing unavailable
        Pricing unavailable
        News about Apache Impala
        More news