Cloudera"Developer"Training"
for"Apache"Hadoop"
©"Copyright"2010/2012"Cloudera."All"rights"reserved."Not"to"be"reproduced"without"prior"wri=en"consent."
01#1$
201210"
IntroducDon"
Chapter"1"
©"Copyright"2010/2012"Cloudera."All"rights"reserved."Not"to"be"reproduced"without"prior"wri=en"consent."
01#2$
Course"Chapters"
! $Introduc/on$
! "The"MoDvaDon"for"Hadoop"
! "Hadoop:"Basic"Concepts"
! "WriDng"a"MapReduce"Program"
! "Unit"TesDng"MapReduce"Programs"
! "Delving"Deeper"into"the"Hadoop"API"
! "PracDcal"Development"Tips"and"Techniques"
! "Data"Input"and"Output"
! "Common"MapReduce"Algorithms"
! "Joining"Data"Sets"in"MapReduce"Jobs"
! "IntegraDng"Hadoop"into"the"Enterprise"Workflow"
! "Machine"Learning"and"Mahout"
! "An"IntroducDon"to"Hive"and"Pig"
! "An"IntroducDon"to"Oozie"
! "Conclusion"
! "Cloudera"Enterprise"
! "Graph"ManipulaDon"in"MapReduce"""
Course$Introduc/on$
IntroducDon"to"Apache"Hadoop"
and"its"Ecosystem"
Basic"Programming"with"the"
Hadoop"Core"API"
Problem"Solving"with"MapReduce"
The"Hadoop"Ecosystem"
Course"Conclusion"and"Appendices"
©"Copyright"2010/2012"Cloudera."All"rights"reserved."Not"to"be"reproduced"without"prior"wri=en"consent."
01#3$
Chapter"Topics"
Course$Introduc/on$
! About$this$course$
! About"Cloudera"
! Course"logisDcs"
Introduc/on$
©"Copyright"2010/2012"Cloudera."All"rights"reserved."Not"to"be"reproduced"without"prior"wri=en"consent."
01#4$
Course"ObjecDves"
During$this$course,$you$will$learn:$
! The$core$technologies$of$Hadoop$
! How$HDFS$and$MapReduce$work$
! How$to$develop$MapReduce$applica/ons$
! How$to$unit$test$MapReduce$applica/ons$
! How$to$use$MapReduce$combiners,$par//oners,$and$the$distributed$cache$
! Best$prac/ces$for$developing$and$debugging$MapReduce$applica/ons$
! How$to$implement$data$input$and$output$in$MapReduce$applica/ons$
©"Copyright"2010/2012"Cloudera."All"rights"reserved."Not"to"be"reproduced"without"prior"wri=en"consent."
01#5$
Course"ObjecDves"(cont’d)"
! Algorithms$for$common$MapReduce$tasks$
! How$to$join$data$sets$in$MapReduce$
! How$Hadoop$integrates$into$the$data$center$
! How$to$use$Mahout’s$Machine$Learning$algorithms$
! How$Hive$and$Pig$can$be$used$for$rapid$applica/on$development$
! How$to$create$large$workflows$using$Oozie$
©"Copyright"2010/2012"Cloudera."All"rights"reserved."Not"to"be"reproduced"without"prior"wri=en"consent."
01#6$
Chapter"Topics"
Course$Introduc/on$
Introduc/on$
! About"This"Course"
! About$Cloudera$
! Course"LogisDcs"
©"Copyright"2010/2012"Cloudera."All"rights"reserved."Not"to"be"reproduced"without"prior"wri=en"consent."
01#7$
About"Cloudera"
! Cloudera$is$“The$commercial$Hadoop$company”$
! Founded$by$leading$experts$on$Hadoop$from$Facebook,$Google,$Oracle$
and$Yahoo$
! Provides$services$and$products$for$Hadoop$users$
– ConsulDng"and"training"services"
– Management"tools"
! Staff$includes$commi]ers$to$virtually$all$Hadoop$projects$
©"Copyright"2010/2012"Cloudera."All"rights"reserved."Not"to"be"reproduced"without"prior"wri=en"consent."
01#8$