Pig

Pig

59
111
+ 1
5
Manage your open source components, licenses, and vulnerabilities
Learn More
Pros of Pig
  • 2
    Finer-grained control on parallelization
  • 1
    Proven at Petabyte scale
  • 1
    Open-source
  • 1
    Join optimizations for highly skewed data

What is Pig?

Pig is a dataflow programming environment for processing very large files. Pig's language is called Pig Latin. A Pig Latin program consists of a directed acyclic graph where each node represents an operation that transforms data. Operations are of two flavors: (1) relational-algebra style operations such as join, filter, project; (2) functional-programming style operators such as map, reduce.
What companies use Pig?
Manage your open source components, licenses, and vulnerabilities
Learn More
What tools integrate with Pig?