公開課
| 培訓(xùn)對象
系統(tǒng)管理員或者任何需要管理Apache Hadoop機(jī)群的人員(包括產(chǎn)品及開發(fā)環(huán)境)。
企業(yè)管理者、CIO、CTO、政府信息部門官員、項(xiàng)目(開發(fā))經(jīng)理、咨詢顧問
IT經(jīng)理,IT咨詢顧問,IT支持專家
系統(tǒng)工程師、數(shù)據(jù)中心管理員、云計(jì)算管理員及想加入云計(jì)算隊(duì)伍的您
| 學(xué)員基礎(chǔ)
需要具備和掌握Linux系統(tǒng)管理和網(wǎng)絡(luò)相關(guān)技能和經(jīng)驗(yàn)。無需具備Hadoop基礎(chǔ)和經(jīng)驗(yàn)。
| 培訓(xùn)時(shí)間
4天
| 認(rèn)證考試
參加培訓(xùn)的學(xué)員將獲得Cloudera Certified Administrator for Apache Hadoop (CCAH) 認(rèn)證, 考試代碼:CCA-410
| 學(xué)習(xí)內(nèi)容
Hadoop分布式文件系統(tǒng)和MapReduce工作原理
Hadoop集群硬件配置規(guī)劃
Hadoop集群網(wǎng)絡(luò)配置規(guī)劃
Hadoop集群配置及優(yōu)化
如何配置NameNode HA
任何配置NameNode Federation
任何配置FairScheduler為多用戶共享Hadoop集群
任何為Hadoop集群安裝和實(shí)現(xiàn)基于Kerberos的安全性
如何維護(hù)和監(jiān)測Hadoop集群
如何使用Flume加載動(dòng)態(tài)產(chǎn)生的文件以及使用Sqoop連接關(guān)系數(shù)據(jù)庫進(jìn)行數(shù)據(jù)導(dǎo)入導(dǎo)出
Hive、Pig和HBase等Hadoop生態(tài)系統(tǒng)工具相關(guān)的系統(tǒng)管理工作
模塊 | 內(nèi)容 |
The Case for Apache Hadoop | Why Hadoop? A Brief History of Hadoop Core Hadoop Components Fundamental Concepts |
HDFS
| HDFS Features Writing and Reading Files NameNode Considerations Overview of HDFS Security Using the Namenode Web UI Using the Hadoop File Shell |
Getting Data into HDFS | Ingesting Data from External Sources with Flume Ingesting Data from Relational Databases with Sqoop REST Interfaces Best Practices for Importing Data |
MapReduce | What Is MapReduce? Features of MapReduce Basic Concepts Architectural Overview MapReduce Version 2 Failure Recovery Using the JobTracker Web UI |
Planning Your Hadoop Cluster
| General Planning Considerations Choosing the Right Hardware Network Considerations Configuring Nodes Planning for Cluster Management |
Hadoop Installation and Initial Configuration
| Deployment Types Installing Hadoop Specifying the Hadoop Configuration Performing Initial HDFS Configuration Performing Initial MapReduce Configuration Log File Locations |
Installing and Configuring Hive, Impala, and Pig
| Hive Impala Pig |
Hadoop Clients
| What is a Hadoop Client? Installing and Configuring Hadoop Clients Installing and Configuring Hue Hue Authentication and Configuration |
Cloudera Manager
| The Motivation for Cloudera Manager Cloudera Manager Features Standard and Enterprise Versions Cloudera Manager Topology Installing Cloudera Manager Installing Hadoop Using Cloudera Manager Performing Basic Administration Tasks Advanced Cluster Configuration Advanced Configuration Parameters Configuring Hadoop Ports Explicitly Including and Excluding Hosts Configuring HDFS for Rack Awareness Configuring HDFS High Availability |
Hadoop Security
| Why Hadoop Security Is Important Hadoop’s Security System Concepts What Kerberos Is and How it Works Securing a Hadoop Cluster with Kerberos |
Managing and Scheduling Jobs
| Managing Running Jobs Scheduling Hadoop Jobs Configuring the FairScheduler Cluster Maintenance Checking HDFS Status Copying Data Between Clusters Adding and Removing Cluster Nodes Rebalancing the Cluster NameNode Metadata Backup Cluster Upgrading |
Cluster Monitoring and Troubleshooting
| General System Monitoring Managing Hadoop’s Log Files Monitoring Hadoop Clusters Common Troubleshooting Issues |