Amazon EMR

I use AWS Glue because I thought it was worth all they hype Fall 2018. However, you had to use Python 2.7 with no pandas support, and cold starts lasted as long as 15 minutes. Also, setting up a dev environment for iterative development was near impossible at the time.

It was a terrible experience for me. I recommend using Amazon EMR instead. Even talking with a friend that works at Amazon, they use EMR instead of Glue for internal spark workloads. Just because a company makes something doesn't mean they use that something :/

Amazon EMR

What is Amazon EMR?

Key Features

Amazon EMR Pros & Cons

Pros of Amazon EMR

Cons of Amazon EMR

Amazon EMR Integrations

Amazon EMR Discussions

Amazon EMR Alternatives & Comparisons

Google BigQuery

Amazon Redshift

Snowflake

Stitch

Cloudera Enterprise

Dremio

Try It

Adoption

Amazon EMR Integrations

Amazon EMR Discussions