Welcome to the HadoopExam Hadoo/BigData Professional Training Course. Please follow the below steps to view the training contents step by step.
Covered Syllabus: Module 1 : Introduction to BigData, Hadoop (HDFS and MapReduce) : Available (Length 35 Minutes) 1. BigData Inroduction Module 2 : Deep Dive in HDFS : Available (Length 48 Minutes) + Useful for CCA175 1. HDFS Design Module 2A : HDFS File Operation Lifecycle (Supplementary) : Available (Length 45 Minutes) 1. File Read Cycel from HDFS Module 3 : Understanding MapReduce : Available (Length 60 Minutes) 1. JobTracker and TaskTracker Module 4 : MapReduce Internals -1 (In Detail) : Available (Length 57 Minutes) 1. How MapReduce Works Module 5 : MapReduce-2 (YARN : Yet Another Resource Negotiator Hadoop 2.x.x ) : Available (Length 52 Minutes) 1. Limitation of Current Architecture (Classic) Module 6 : Advanced Topic for MapReduce (Performance and Optimization) : Available (Length 58 Minutes) Module 7 : Advanced MapReduce Algorithm : Available (Length 87 Minutes) 1. MapReduce Joining - Reduce Side Join - MapSide Join - Semi Join 2. MapReduce Job Chaining - MapReduce Sequence Chaining - MapReduce Complex Chaining Module 9 : Features of MapReduce : Available : Private (Length 61 Minutes) Introduction to MapReduce Counters Module 10: MapReduce DataTypes and Formats : Available : Private (Length 77 Minutes) Module 11 : Apache Pig : Available (Length 52 Minutes) 1. What is Pig ? 1. Working with Grunt shell 2. Create word count application 3. Execute word count application 4. Accessing HDFS from grunt shell Module 11B : Hands On : Apache Pig Complex Datatypes : Available (Length 14 Minutes) 1. Understand Map, Tuple and Bag 2. Create Outer Bag and Inner Bag 3. Defining Pig Schema Module 11C : Hands On : Apache Pig Data loading : Available (Length 14 Minutes) 1. Understand Load statement 2. Loading csv file 3. Loading csv file with schema 4. Loading Tab separated file 5. Storing back data to HDFS. Module 11D : Hands On : Apache Pig Statements : Available (Length 8 Minutes) 1. ForEach statement 2. Example 1 : Data projecting and foreach statement 3. Example 2 : Projection using schema 4. Example 3 : Another way of selecting columns using two dots .. Module 11E : Hands On : Apache Pig Complex Datatype practice : Available (Length 16 Minutes) 1. Example 1 : Loading Complex Datatypes 2. Example 2 : Loading compressed files 3. Example 3 : Store relation as compressed files 4. Example 4 : Nested FOREACH statements to solved same problem. Module 12 : Fundamental of Apache Hive Part-1 : Available (Length 60 Minutes) + Useful for CCA175 1. What is Hive ? Module 14 : Understanding NGram algorithm Available (Length 14 Minutes) : Newly Replaced Module 15 : Hands On : Step by Step Process creating and Configuring eclipse for writing MapReduce Code Available (Length 29 Minutes) : Newly ReplacedModule 16 : Hands On : Analyzing the Result by Running NGram application (UniGram, BiGram, TriGram etc.) Available (Length 19 Minutes) : Newly Replaced Module 17 : NOSQL Introduction and Implementation : Available (Length 56 Minutes) New 1. What is NoSQL ? Module 18 : HBase Introduction : : Available (Part-1 Length 48 Minutes and Part-2 Length-37 Minutes) New 1. Fundamentals of HBase Video URL : Watch Private Video Part-1 and Part-2 Module 19 : Hands On Creating MapReduce application and deploying on Hadoop Cluster. Available (Length 33 Minutes) : Newly Replaced1. Creating MapReduce Program Module 20 : Apache Cassandra : Available (Length 63 Minutes) New 1. BigData and Apache Cassandra Module 21: Hands On MRUnit (MapReduce Testing Framework) : Available (Length 48 Minutes) New 1. Practice Basic MapReduce Without Installing Hadoop Framework Module 22 : Apache Sqoop (SQL To Hadoop) : Available (Length 66 Minutes) New + Useful for CCA175 1. Sqoop Tutorial Module 23 : Apache Flume : Available (Length 28 Minutes) New 1. Data Acquisition : Apache Flume Introduction Module 24 : Advanced Apache Flume :Available (Length 48 Minutes) New 1. Sample Twiteer Feed Configuration Module 25 : YARN Introduction (Length 52 Mins) Available Hadoop 2.x. YARN Training 1. Why to think Beyond MapReduce2. New Components of YARN 3. Revisit Hadoop 1.0 4. How YARN fits in Hadoop Framework 5. Hadoop MR1 Components Revisit 6. Need for Non-MapReduce 7. YARN Components Introduction Module 26 : Fundamental Overview of YARN (Length 40 Mins) Available Hadoop 2.x. YARN Training 1. YARN Functional Component 2. YARN Architecture Overview 3. Claiming and Re-claiming Resources 4. Functional Properties of Resource Manager Node Manager Application Master 5. YARN Scheduling Component 6. Introduction to FIFO Scheduler 7. Introduction to Capacity Scheduler Module 27 : Powerfull Hadoop 2.0 Framework (Length : 27 Mins) Available Hadoop 2.x. YARN Training 1. HDFS 1.0 Versus Hadoop 2.0 2. Resource Manager - Subcomponent 3. Details About Fair Share Scheduler 4. Hierarchical Queues in Scheduler 5. Containers 6. Node Manager and Its Responsbility 7. Role of Application Master while submitting Jobs Module 28 : Submitting the Application to YARN Hadoop Cluster (Length : 27 Mins) Available Hadoop 2.x. YARN Training 1. Submitting the Application to YARN Hadoop Cluster 2. Managing Application Dependencies 3. Writing a YARN Application : Birdseye View Module 29 : LocalResources of the Application Available Hadoop 2.x. YARN Training 1. Understanding of YARN Application/Jobs Dependencies 2. Types of LocalResource 3. Visibilites of Local Resources 4. Lifetime of Local Resources 5. Good and Bad Local Resources 6. Target Directories of Local Resources Module 30 : Deep Dive in Capacity Schedular (Length 39 Mins) Available Hadoop 2.x. YARN Training 1. Introduction and Enabling Capacity Schedular 2. Setting Up Quesues in the CS 3. Access Control List Setup 4. Managing Cluster Capacity in with Queues 5. Resource Distribution Workflow Example Module 31 : Managing Capacity Schedular (Length 39 Mins) Available Hadoop 2.x. YARN Training 1. Managing Capacity with Queues 2. Resource Distribution Example 3. Understanding User Limits 4. Application Reservation 5. Understanding the Preemption Module 32 : Hadoop Security : Kerberos Authentication (Length 23 Mins) Available Hadoop Security Training 1. Kerberos Authentication 2. Important entity of Kerberos Autherization 3. How Kerberos Process works Module 33 : Apache Spark : Introduction to Apache Spark (Length 48 Mins) Available 100 Time Faster Data Processing + Useful for CCA175 1. Introduction to Apache Spark 2. Features of Apache Spark 3. Apache Spark Stack 4. Introduction to RDD's 5. RDD's Transformation 6. What is Good and Bad In MapReduce 7. Why to use Apache Spark Module 34 : Cloudera QuickStart VM Step By Step Installation (Length 19 Mins) Available + Steps in PDF+ Hands On Lab 1. It Includes Hadoop 2.0 2. YARN 3. Hive 4. Pig 5. Hue 6. Apache Spark 7. Workflow Module 35 : Load data in HDFS using the HDFS commands (Length 35 Mins) Available + Steps in PDF + Hands On Lab + Useful for CCA175 Module 36 : Importing Data from RDBMS to HDFS (Length 21 Mins) Available + Steps in PDF+Hands On Lab + Useful for CCA175 1. Without Specifying Directory 2. With target Directory 3. With warehouse directory Module 37 : Sqoop Import Module (Length 41 Mins) Available + Steps in PDF +Hands On Lab + Useful for CCA175 1. Importing Subset of data from RDBMS 2. Chnaging the delimiter during Import 3. Encoding Null values 4. Importing Entire schema or all tables Module 38 : Importing data to HIve Using Sqoop (Length 41 Mins) Available +Steps in PDF + Hands On Lab + Useful for CCA175 Module 39 : Apache Avro Introduction (Length 26 Mins) Available + PDF Download + Useful for CCA175 1. Why Avro files 2. Avro file Serialization and Deserialization 3. Adding fields 4. Deleting fields Module 40 : Apache Avro Schema In Depth (Length 12 Mins) Available + PDF Download + Useful for CCA175 1. Avro schema example 2. Avro embedded schema 3. Avro schema primitive data types 4. Avro schema Complex data types Record, Map, Array, Union, Enum, Fixed etc. Module 41 : Apache Avro Schema Evolution (Length 16 Mins) Available + PDF Download + Useful for CCA1751. Understand Avro Schema Evolution 2. Reader Schema and Writer Schema 3. JSON schema Adding new fields 4. JSON schema removing a filed All above 41 modules are available and ready to Watch/Learn (To Buy go on Top) |