Module-7

Content 

Module 7 : Advanced MapReduce Algorithm : Available (Length 87 Minutes) 

File Based Data Structure

- Sequence File

- MapFile

Default Sorting In MapReduce

- Data Filtering (Map-only jobs)

- Partial Sorting

Data Lookup Stratgies

- In MapFiles

Sorting Algorithm

- Total Sort (Globally Sorted Data)

- InputSampler

- Secondary Sort

Spark Specialization Components

1. Oreilly Apache Spark Certification

2. Apache Spark Training

3. Cloudera CCA175 Hadoop and Spark Certification

4. Apache Hadoop Professional Training