Pig Tutorials/concepts

Posted Posted in Hadoop, Pig

What is Pig ? Pig is an engine for running data flows in Hadoop. Pig uses a language called Pig Latin. Pig runs on Hadoop. It uses HDFS for storage and internally uses MapReduce for data flow operations. Examples of pig operations include join, filter, group by, order by etc.   Why Pig ? Does […]