It implements batch and streaming data processing jobs that run on any execution engine. It executes pipelines on multiple execution environments.