Are you interested in this course? Please let us know.
 Book nowWaitinglist
Prices are displayed without VAT by default.
  • Quick Contact Form

Cloudera Data Analyst Training

This four days hands-on data analyst training, focusing on Apache Pig and Hive and Cloudera Impala, will teach you to apply traditional data analytics and business intelligence skills to Big Data. Learn the tools data professionals need to access, manipulate, and analyze complex data sets using SQL and familiar scripting languages.

Programme and Course Overview

Xebia University (based in Hilversum, Amsterdam area) is an official training partner of Cloudera, the leader in Apache Hadoop-based software and services.

You will benefit from this course if..

  • You are a data analyst, business analyst, developer or administrator,
  • You have experience with SQL and basic UNIX or Linux commands.

Prior knowledge of Java and Apache Hadoop is not required.

What you will achieve:

This training alternates between instructional sessions and hands-on labs.
After completing this 4 days training:

You will know:

  • The fundamentals of Apache Hadoop and data ETL (extract, transform, load), ingestion, and processing with Hadoop tools,
  • How to apply the fundamentals of familiar scripting languages to the Hadoop cluster with Apache Pig.

You will have hands-on experience in: 

  • Joining multiple data sets and analyzing disparate data with Pig,
  • Organizing data into tables, performing transformations, and simplifying complex queries with Hive,
  • Making multi-structures data accessible with Hive.

You will have the skills to:

  • Perform real-time interactive analyses on massive data sets stored in HDFS or HBase using SQL with Impala,
  • Pick the best analysis tool for a given task in Hadoop
  • Enable real-time interactive analysis of the data stored in Hadoop via a native SQL environment with Cloudera Impala.

Please note, that you need to bring your own laptop for this training. This laptop should meet the following requirements:

  • At least 4GB RAM;
  • 15GB of free hard disk space;
  • VMware Player 5.x or above (Windows)/ VMware Fusion 4.x or above (Mac);
  • Your laptop must support a 64-bit VMware guest image. If the machines are running a 64-bit version of WIndows, or Mac OS X on a Core DUO 2 processor or later, not other test is required. Otherwise, VMware provides a tool to check compatibility, which can be downloaded from;
  • Your laptop must have VT-x virtualization support enabled in the BIOS;
  • If running Windows XP: 7-Zip or WinZip  is needed (due to a bug in Windows XP's built-in Zip utility).

Target Group & Prerequisites:

This course is best suited to data analysts, business analysts, developers and administrators who have experience with SQL and basic UNIX or Linux commands.

Prior knowledge of Java and Apache Hadoop is not required.