What is Amazon EMR?
What is Amazon Redshift?
Want advice about which of these to choose?Ask the StackShare community!
What are the cons of using Amazon EMR?
What are the cons of using Amazon Redshift?
What tools integrate with Amazon EMR?
What tools integrate with Amazon Redshift?
We ultimately migrated our Hadoop jobs to Qubole, a rising player in the Hadoop as a Service space. Given that EMR had become unstable at our scale, we had to quickly move to a provider that played well with AWS (specifically, spot instances) and S3. Qubole supported AWS/S3 and was relatively easy to get started on. After vetting Qubole and comparing its performance against alternatives (including managed clusters), we decided to go with Qubole
Aggressive archiving of historical data to keep the production database as small as possible. Using our in-house soon-to-be-open-sourced ETL library, SharpShifter.