Get Advice Icon

Need advice about which tool to choose?Ask the StackShare community!

AWS Glue
AWS Glue

58
37
+ 1
0
Mara
Mara

2
6
+ 1
3
Add tool

AWS Glue vs Mara: What are the differences?

Developers describe AWS Glue as "Fully managed extract, transform, and load (ETL) service". A fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and load their data for analytics. On the other hand, Mara is detailed as "A lightweight ETL framework". A lightweight ETL framework with a focus on transparency and complexity reduction.

AWS Glue and Mara can be primarily classified as "Big Data" tools.

Some of the features offered by AWS Glue are:

  • Easy - AWS Glue automates much of the effort in building, maintaining, and running ETL jobs. AWS Glue crawls your data sources, identifies data formats, and suggests schemas and transformations. AWS Glue automatically generates the code to execute your data transformations and loading processes.
  • Integrated - AWS Glue is integrated across a wide range of AWS services.
  • Serverless - AWS Glue is serverless. There is no infrastructure to provision or manage. AWS Glue handles provisioning, configuration, and scaling of the resources required to run your ETL jobs on a fully managed, scale-out Apache Spark environment. You pay only for the resources used while your jobs are running.

On the other hand, Mara provides the following key features:

  • Data integration pipelines as code: pipelines, tasks and commands are created using declarative Python code.
  • PostgreSQL as a data processing engine.
  • Extensive web ui. The web browser as the main tool for inspecting, running and debugging pipelines.

Mara is an open source tool with 1.24K GitHub stars and 51 GitHub forks. Here's a link to Mara's open source repository on GitHub.

- No public GitHub repository available -

What is AWS Glue?

A fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and load their data for analytics.

What is Mara?

A lightweight ETL framework with a focus on transparency and complexity reduction.
Get Advice Icon

Need advice about which tool to choose?Ask the StackShare community!

Why do developers choose AWS Glue?
Why do developers choose Mara?
    Be the first to leave a pro
      Be the first to leave a con
        Be the first to leave a con
        Jobs that mention AWS Glue and Mara as a desired skillset
        What companies use AWS Glue?
        What companies use Mara?
          No companies found

          Sign up to get full access to all the companiesMake informed product decisions

          What tools integrate with AWS Glue?
          What tools integrate with Mara?
            No integrations found

            Sign up to get full access to all the tool integrationsMake informed product decisions

            What are some alternatives to AWS Glue and Mara?
            AWS Data Pipeline
            AWS Data Pipeline is a web service that provides a simple management system for data-driven workflows. Using AWS Data Pipeline, you define a pipeline composed of the “data sources” that contain your data, the “activities” or business logic such as EMR jobs or SQL queries, and the “schedule” on which your business logic executes. For example, you could define a job that, every hour, runs an Amazon Elastic MapReduce (Amazon EMR)–based analysis on that hour’s Amazon Simple Storage Service (Amazon S3) log data, loads the results into a relational database for future lookup, and then automatically sends you a daily summary email.
            Airflow
            Use Airflow to author workflows as directed acyclic graphs (DAGs) of tasks. The Airflow scheduler executes your tasks on an array of workers while following the specified dependencies. Rich command lines utilities makes performing complex surgeries on DAGs a snap. The rich user interface makes it easy to visualize pipelines running in production, monitor progress and troubleshoot issues when needed.
            Talend
            It is an open source software integration platform helps you in effortlessly turning data into business insights. It uses native code generation that lets you run your data pipelines seamlessly across all cloud providers and get optimized performance on all platforms.
            Apache Spark
            Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters through YARN or Spark's standalone mode, and it can process data in HDFS, HBase, Cassandra, Hive, and any Hadoop InputFormat. It is designed to perform both batch processing (similar to MapReduce) and new workloads like streaming, interactive queries, and machine learning.
            Alooma
            Get the power of big data in minutes with Alooma and Amazon Redshift. Simply build your pipelines and map your events using Alooma’s friendly mapping interface. Query, analyze, visualize, and predict now.
            See all alternatives
            Decisions about AWS Glue and Mara
            No stack decisions found
            Interest over time
            Reviews of AWS Glue and Mara
            No reviews found
            How developers use AWS Glue and Mara
            No items found
            How much does AWS Glue cost?
            How much does Mara cost?
            Pricing unavailable
            Pricing unavailable
            News about AWS Glue
            More news
            News about Mara
            More news