Pig vs OpenRefine: What are the differences?
What is Pig? Platform for analyzing large data sets. Pig is a dataflow programming environment for processing very large files. Pig's language is called Pig Latin. A Pig Latin program consists of a directed acyclic graph where each node represents an operation that transforms data Operations are of two flavors: (1) relational-algebra style operations such as join, filter, project; (2) functional-programming style operators such as map, reduce. .
What is OpenRefine? Desktop application for data cleanup and transformation. It is a powerful tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web services and external data.
Pig and OpenRefine can be primarily classified as "Big Data" tools.
Pig and OpenRefine are both open source tools. It seems that OpenRefine with 6.54K GitHub stars and 1.15K forks on GitHub has more adoption than Pig with 583 GitHub stars and 447 GitHub forks.