Big Data Hadoop Administration and Developer Course This course is designed to understand and implement the concepts of Big data and Hadoop. This will cover right from setting up Hadoop environment in Linux system followed up setting programming environment. This course also covers the development aspect of Hadoop Map Reduce and its eco-systems like Pig, Hive, Hbase, Sqoop etc. Module 1 - Introduction to Big Data and Hadoop Introduction to Big data Why Big data? Case Studies Introduction to Hadoop Understanding Features of Hadoop Module 2 - Hadoop Architecture & Ecosystem Hadoop Architecture Hadoop Services Hadoop Ecosystem Components Module 3 Essentials for Hadoop Re-visiting Ubuntu Server 12.04 Installation and Configuration of Ubuntu Server 12.04 Revisiting Ubuntu Desktop 12.04 Installation and Configuration of Ubuntu Desktop 12.04 Module 4 - Hadoop Installation (Single Node) & Configuration Getting Hadoop Starting installation and configuration of Hadoop
Verifying the installation and testing Hadoop Module 5 - Multi-node Cluster Hadoop Installation & Configuration Creating multiple Hadoop single node instances Setting up multi-node cluster Testing the multi-node cluster Module 6 HDFS Understanding HDFS Exploring Files -Handling Commands in HDFS Exploring Admin commands in HDFS Exploring Hadoop Web UI for HDFS Module 7 - Introduction to Map Reduce What is map reduce? Understanding the flow of Map Reduce Realtime examples Exploring Hadoop UI for Map Reduce operations Module 8 - Map Reduce Development using Java Setting up eclipse environment Writing First Program in Map Reduce Mappers Reducers
Module 9 Hadoop Administration and Troubleshooting Understanding and Exploring Administrating Hadoop Logs Common Troubleshooting Techniques Module 10 - Introduction to Pig What is Pig? Why Pig? Pig Features Installation and Configuration Module 11 - Development using PigLATIN Understanding Pig Environment Pig LATIN Syntax Loading Data Simple Data types Viewing the schema Filtering and Sorting the data Grouping the Data User Defined Functions Troubleshooting Pig Using Hadoop Web UI Module 12- Introduction to Hive What is Hive?
Hive Schema and Storage Comparison between Hive and SQL Installation and Configuration of Hive Module 13 - Developement using Hive Relational data analysis using Hive Hive Databases and Syntax Basic Hive Query Language Syntax Data Types Joining Data Sets Loading data in to hive Altering databases in hive Self Managed tables Storing Query Results Examples Module 14 - Introduction to Hbase. What is Hbase? Why use Hbase? Hbase and RDBMS Hbase Concepts Hbase Architecture Installation and Configuration of Hbase Hbase Administration API Accessing data with Hbase API
Module 15 - Introduction to Sqoop Why Sqoop? What Sqoop provides? Sqoop processing methods Sqoop Import Sqoop Export Installation and configuration of SQOOP Module 16 - Introduction to Zookeeper What is Zookeeper? Why Zookeeper? Zookeeper goals Zookeeper Architecture Zookeeper installation and configuration Module 17 - Introduction and working with Cloudera VM What is Cloudera? Features of Cloudera Downloading the Cloudera VM Starting the Cloudera VM Exploring Hue Exploring Cloudera Administration