Cloudera Administrator
Training for Apache Hadoop
Introduction
Chapter 1
Course Chapters
§ Introduction
§ The Case for Apache Hadoop
§ Hadoop Cluster Installation
§ The Hadoop Distributed File System (HDFS)
§ MapReduce and Spark on YARN
§ Hadoop Configuration and Daemon Logs
§ Getting Data Into HDFS
§ Planning Your Hadoop Cluster
§ Installing and Configuring Hive, Impala, and Pig
§ Hadoop Clients Including Hue
§ Advanced Cluster Configuration
§ Hadoop Security
§ Managing Resources
§ Cluster Maintenance
§ Cluster Monitoring and Troubleshooting
§ Conclusion
© Copyright 2010-2017 Cloudera. All rights reserved. Not to be reproduced or shared without prior written consent from Cloudera.
01-3
Trademark Information
§ The names and logos of Apache products mentioned in Cloudera training courses,
including those listed below, are trademarks of the Apache Software Foundation
– Apache Accumulo
– Apache Avro
– Apache Bigtop
– Apache Crunch
– Apache Flume
– Apache Hadoop
– Apache HBase
– Apache HCatalog
– Apache Hive
– Apache Impala (incubating)
– Apache Kafka
– Apache Kudu
– Apache Lucene
– Apache Mahout
– Apache Oozie
– Apache Parquet
– Apache Pig
– Apache Sentry
– Apache Solr
– Apache Spark
– Apache Sqoop
– Apache Tika
– Apache Whirr
– Apache ZooKeeper
§ All other product names, logos, and brands cited herein are the property of their
respective owners
© Copyright 2010-2017 Cloudera. All rights reserved. Not to be reproduced or shared without prior written consent from Cloudera.
01-4
Chapter Topics
Introduction
§ About This Course
§ About Cloudera
§ Course Logistics
§ Introductions
© Copyright 2010-2017 Cloudera. All rights reserved. Not to be reproduced or shared without prior written consent from Cloudera.
01-5
Course Objectives (1)
During this course, you will learn:
§ The functions of the core technologies of Hadoop
§ How Cloudera Manager simplifies Hadoop installation and administration
§ How to deploy a Hadoop cluster using Cloudera Manager
§ How to run YARN applications, including MapReduce and Spark
§ How to populate HDFS from external sources using Sqoop and Flume
§ How to plan your Hadoop cluster hardware and software
§ What issues to consider when installing Hive and Impala
§ What issues to consider when deploying Hadoop clients
§ How to configure HDFS for high availability
© Copyright 2010-2017 Cloudera. All rights reserved. Not to be reproduced or shared without prior written consent from Cloudera.
01-6
Course Objectives (2)
§ What issues to consider when implementing Hadoop security
§ How to configure resource scheduling on the cluster
§ How to maintain your cluster
§ How to monitor, troubleshoot, and optimize the cluster
© Copyright 2010-2017 Cloudera. All rights reserved. Not to be reproduced or shared without prior written consent from Cloudera.
01-7
Chapter Topics
Introduction
§ About This Course
§ About Cloudera
§ Course Logistics
§ Introductions
© Copyright 2010-2017 Cloudera. All rights reserved. Not to be reproduced or shared without prior written consent from Cloudera.
01-8