Needs adviceThe system will be deployed to our customers' data warehouses with no Internet connection. Therefore, a simple deployment tool is necessary. Docker1 upvote10 views0CommentsCopy link
Needs adviceSpark is good at parallel data processing management. We wrote a neat program to handle the TBs data we get everyday. Apache Spark1 upvote10 views0CommentsCopy link
Needs adviceWriting Scala is an enjoyable experience. What's more satisfying then using map to process your data in an array. Scala1 upvote10 views0CommentsCopy link