e-learning: 70-775: Perform Data Engineering on Microsoft Azure HDInsight
Description:This e-learning covers how to plan and implement big data workflows on HDInsight. It covers the administration and provisioning of HDInsight clusters, the implementation of a big data batch, and interactive and real-time processing solutions. It will help prepare you for exam 70-775: Perform Data Engineering on Microsoft Azure HDInsight.
Prerequisites:In addition to their professional experience, students who attend this course should have:
Programming experience using R, and familiarity with common R packages
Knowledge of common statistical methods and data analysis best practices.
Basic knowledge of the Microsoft Windows operating system and its core functionality.
Working knowledge of relational databases.
Deploy HDInsight Clusters.
Authorizing Users to Access Resources.
Loading Data into HDInsight.
Implement Batch Solutions.
Design Batch ETL Solutions for Big Data with Spark
Analyze Data with Spark SQL.
Analyze Data with Hive and Phoenix.
Describe Stream Analytics.
Implement Spark Streaming Using the DStream API.
Develop Big Data Real-Time Processing Solutions with Apache Storm.
Build Solutions that use Kafka and HBase.
Who should attend:The primary audience for this course is data engineers, data architects, data scientists, and data developers who plan to implement big data engineering workflows on HDInsight..
Getting Started with Microsoft Azure HDInsight and Administering clusters
Working with HDInsight Clusters
Managing HDInsight Data, Jobs, and Security
Batch Solutions with Hive and Apache Pig
Operationalize and Design with Spark
Interactive Queries with Spark SQL and Interactive Hive
Data Analysis Using Spark SQL and Hive
Interactive Processing using Apache Phoenix on HBase
Create Spark Streaming Applications
Develop Real-time Processing Solutions with Apache Storm
Building Solutions using Kafka and HBase
Tips & Tricks
Optional: MeasureUp Exam simulation