Manage your open source components, licenses, and vulnerabilities
Learn MorePros of Pig
Pros of Pig
- Finer-grained control on parallelization2
- Proven at Petabyte scale1
- Open-source1
- Join optimizations for highly skewed data1
What is Pig?
Pig is a dataflow programming environment for processing very large files. Pig's
language is called Pig Latin. A Pig Latin program consists of a directed
acyclic graph where each node represents an operation that transforms data.
Operations are of two flavors: (1) relational-algebra style operations such as
join, filter, project; (2) functional-programming style operators such as map,
reduce.
What companies use Pig?
Manage your open source components, licenses, and vulnerabilities
Learn MoreWhat tools integrate with Pig?