BLUEGRANITE LAB

Create an Analytic Pipeline with Azure Data Factory

LP Arrow

Please note: this lab content is in the process of being updated and may contain out of date information. If you have any questions, please do not hesitate to contact us

This lab explores creating a big data analytics pipeline with Azure Data Factory. Data Factory is a key component of the Cortana Intelligence Suite that orchestrates data movement through different storage layers and computer processes, turning raw data into intelligent actions.

In this lab, you will upload a file to Blob storage and use Data Factory to create a pipeline that will automate processing using Hive scripts in an HDInsight cluster.

Download the lab documentation and you’ll learn about:

  • Creating an instance of the Azure Data Factory service
  • Exploring key concepts such as linked services, datasets, pipelines, and activities
  • Creating linked services, datasets, and pipelines in the Azure Portal
  • Deploying and scheduling a pipeline
  • Monitoring and managing a pipeline using the Monitor & Manage tool

BlueGranite developed this material in conjuction with Microsoft, and the lab and its contents are property of Microsoft. Microsoft holds no legal obligations on quality or performance of the lab material.   Fill out the form below to get access to the step-by-step lab document.  Click here for additional hands-on labs.

Download our Azure Data Factory LAB