Tuesday 23 July 2013

Big data training in CANADA

Big Data Platform and Analysis Service reduce data processing time and operational costs, improves computational capability, and enhances data analysis, coverage, and depth. It also enables customers to regain access to data stored over a long timeframe, which was previously inaccessible due to the sheer volume of data.

Visit: www.magnifictraining.com

Hadoop Training Course Content:

1. Understanding Big Data – What is Big Data ?

  • Real world issues with BIG Data – Ex: How facebook manage peta bytes of data.

  • Will regular traditional approach works?

2. How Hadoop Evolved

  • Back to Hadoop evolution.

  • The ecosystem and stack: HDFS, MapReduce, Hive, Pig…

  • Cluster architecture overview

3. Environment for Hadoop development

  • Hadoop distribution and basic commands

  • Eclipse development

4. Understanding HDFS

  • Command line and web interfaces for HDFS

  • Exercises on HDFS Java API

5. Understanding MapReduce

  • Core Logic: move computation, not data

  • Base concepts: Mappers, reducers, drivers

  • The MapReduce Java API (lab).



6. Real-World MapReduce

  • Optimizing with Combiners and Partitioners (lab)

  • More common algorithms: sorting, indexing and searching (lab)

  • Relational manipulation: map-side and reduce-side joins (lab)

  • Chaining Jobs

  • Testing with MRUnit

7. Higher-level Tools

  • Patterns to abstract “thinking in MapReduce”

  • The Cascading library (lab)

  • The Hive database (lab)

Interested ? Enroll into our online Apache Hadoop training program now.


0 comments:

Post a Comment