Writing Scala is an enjoyable experience. What's more satisfying then using map to process your data in an array.
Spark is good at parallel data processing management. We wrote a neat program to handle the TBs data we get everyday.
The system will be deployed to our customers' data warehouses with no Internet connection.
Therefore, a simple deployment tool is necessary.