skip to main |
skip to left sidebar
skip to right sidebar
Big data training in CANADA
Big Data Platform and Analysis Service reduce data processing time and operational costs, improves computational capability, and enhances data analysis, coverage, and depth. It also enables customers to regain access to data stored over a long timeframe, which was previously inaccessible due to the sheer volume of data.
Hadoop Training Course Content:
1. Understanding Big Data – What is Big Data ?
Real world issues with BIG Data – Ex: How facebook manage peta bytes of data.
Will regular traditional approach works?
2. How Hadoop Evolved
Back to Hadoop evolution.
The ecosystem and stack: HDFS, MapReduce, Hive, Pig…
Cluster architecture overview
3. Environment for Hadoop development
Hadoop distribution and basic commands
Eclipse development
4. Understanding HDFS
Command line and web interfaces for HDFS
Exercises on HDFS Java API
5. Understanding MapReduce
Core Logic: move computation, not data
Base concepts: Mappers, reducers, drivers
The MapReduce Java API (lab).
6. Real-World MapReduce
Optimizing with Combiners and Partitioners (lab)
More common algorithms: sorting, indexing and searching (lab)
Relational manipulation: map-side and reduce-side joins (lab)
Chaining Jobs
Testing with MRUnit
7. Higher-level Tools
Patterns to abstract “thinking in MapReduce”
The Cascading library (lab)
The Hive database (lab)
Interested ? Enroll into our online Apache Hadoop training program now.
0 comments:
Post a Comment