Pig is a data-flow language for working with Big Data. It allows you to express your processing requirements as a series of transformations; the result of one flowing into another. Pig then translates your specifications into Map and Reduce tasks.
It is similar to LINQ; once you play around with a few samples, you will have a good idea of how it works. If you need a complete introduction to Pig, Programming Pig by Alan Gates is recommended. An online version of this book is made available by its publisher here. Additionally we also recommend reviewing the information on the Apache Pig website. You can skip over installation information and simply follow samples and explanation provided.
Pig tab provides user friendly interface to manage and run Pig scripts at ease. It provides following features.
Interactively run Pig scripts
Pig scripts can be run interactively from within Big Data Studio by directly typing into the provided console.
Execute complete Pig script.
You can execute complete script file loaded in Editor by clicking the “Execute” button.
On executing script file using “Execute” button, the output is displayed in a separate “Result” tab either in grid or plain view based on the “Result View” selection under Pig Tab.
Logs generated during execution are displayed under Logs tab.
History of Pig jobs submitted by clicking “Execute” button are maintained separately and can be accessed through History tab.
You can run all commands in the script file loaded in Editor through interactive console one by one by clicking Run All button or by choosing the “Run in Console” option in context menu.
You can run selected commands in the script file through interactive console one by one by clicking Run Selection button or by choosing the “Run Selection in Console” option in context menu.
Autocomplete feature is added in the Editor. It will provide suggestion for the keywords based on user typing and allows the user to accept the suggestion or select by pressing “down arrow” key.
Manage script files
You can create new script file and load a file using “Script” button.
You can save as a file using “Save As” button.
You have option to import scripts from folder, create new script and delete scripts present in the tree view.
We ship several samples which you can use it for getting started.
Running Pig scripts in Tez mode:
You can run Pig scripts in Tez mode by switching the mode to Tez in the dropdown button in the ribbon menu.
Working with HBase
You can directly work with the data reside in HBase shipped with our SDK through PIG. We shipped samples to show case them as well.