Apache Hadoop is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment.
Hadoop makes it possible to run applications on systems with thousands of nodes involving thousands of terabytes. Its distributed file system facilitates rapid data transfer rates among nodes and allows the system to continue operating uninterrupted in case of a node failure.
Hadoop tab provides user friendly interface to manage and run Hadoop commands at ease. It provides following features.
Interactively run Hadoop scripts
Hadoop scripts can be run interactively from within Big Data Studio by directly typing Hadoop commands into the provided console.
You can run all queries in the script file loaded in Editor through interactive console one by one by clicking “Run All” button or by choosing the “Run in Console” option in context menu.
You can run selected queries in the script file through interactive console one by one by clicking “Run Selection” button or by choosing the “Run Selection in Console” option in context menu.
Autocomplete feature is added in the Editor. It will provide suggestion for the keywords based on user typing and allows the user to accept the suggestion or select by pressing “down arrow” key.
Manage script files
You can create new script file and load a file using “Script” button.
You can save as a file using “Save As” button.
You have option to import scripts from folder, create new script and delete scripts present in the tree view.
We ship several samples which you can use it for getting started.
Command Shell shortcuts:
Big Data Studio provides short cuts to Command shell and PowerShell. By clicking the shortcuts you can open command line interfaces pre-configured with Hadoop environment and ready to interact with Hadoop cluster.