My DP-201 Completion Badge

Azure Data engineer DP-201 /DP-203 Certification Tips

Cloud & Data Science
4 min readFeb 27, 2021

In this story, we are going to see

How to prepare for Azure Data Engineer DP-201 Certification

Exam overview and rough breakup of questions

Important concepts and links

Exam Taking Tips

Recently I started you tube channel to share my knowledge on cloud and data science. I am going to upload more videos !! please subscribe for getting notification Link: https://youtu.be/ZOvFifemhHM

How to prepare for Azure Data Engineer DP-201 Certification

I have cleared Azure data Engineer DP-201certification with 882/1000 marks. If you read this story, you can also clear this exam with good marks.I took the exam from my home.

I prepared for a week and refreshed my knowledge in Azure gained from my current project on Microsoft website and practiced free mock test.

Understanding the design decisions for Batch ad Stream data pipelines is key to this DP-201 exam.

Exam overview and Rough breakup of Questions

The exam duration is 180 minutes
The number of questions that I got in exam are around 45. The following is the high-level break of the questions in the exam.

Lambda and kappa architectures :
Hot path and Cold Path — 5 to 6 questions.

Azure storage solutions :

Azure Blob storage and ADLS Gen2 — 5 to 6 questions

Azure Cosmos db — 4 to 5 questions
Azure SQL db — 4 to 5 questions
Azure synopsis — 4 to 5 questions

Data processing solutions :

Azure databricks — 5 to 6 questions

Azure stream analytics — 3 to 4

Azure data factory — 4 to 5

Data security and compliance :

Data encryption — 2 to 3 questions

Key Vault — 2 to 3 questions

Backup and disaster recovery –1or 2

Use case — around 8 questions

Important Concepts and Links

ARCHITECTURE

What are the various layers/paths in Lambda Architecture

-Hot path, cold path, batch layer, speed layer, serving layer Reference:

What is the difference between Lambda and Kappa architecture

Kappa uses same tech stack for both real time and batch processing

Reference:

https://hazelcast.com/glossary/kappa-architecture/

COSMOS DB

When to use Azure Cosmos db

-low latency in ms, high availability -99.999, High throughout, multi regional

https://docs.microsoft.com/en-us/azure/cosmos-db/use-cases

What are the various types of API

-Core (SQL) api, Gremlin (graph) api, Mongo DB api, Cassandra api, Table api

https://docs.microsoft.com/en-us/learn/modules/choose-api-for-cosmos-db/

AZURE DATABRICKS

What are the various types of Data bricks cluster types

Interactive cluster ,Automatic Job cluster and concurrency cluster https://docs.databricks.com/clusters/index.html

When to use Stream analytics

eg:Real time alerts, dashboards, job service

e.g. https://docs.microsoft.com/en-us/azure/stream-analytics/streaming-technologies

What are various window types in stream analytics?

Hopping, Tumbling, Sliding, Session

https://docs.microsoft.com/en-us/azure/stream-analytics/stream-analytics-window-functions

AZURE DATA FACTORY

What are various Integration Runtimes
Self hosted, Azure hosted, Azure SSIS

https://docs.microsoft.com/en-us/azure/data-factory/concepts-integration-runtime

AZURE SYNOPSE

What are the various partitions and when to choose what partition type

Hash, Round Robin and Replicated

https://docs.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/massively-parallel-processing-mpp-architecture

When to use Azure SQL db and when to use Azure Synopsis

Azure SQL DB is for OLTP and Azure synopsis is for OLAP (Massive Parallel processing)

Azure storage accounts: Comparison of various storage accounts

E.g. Blob, Gen 1, Gen 2

What are the various Storage account tiers and life cycle management

Hot, Cold and Archive

Various Redundancy support for different storage accounts

E.g. LRS (Locally Redundant Storage)and GRS(Geo Redundant Storage)

AZURE SQL DB

What are the various data migration tools from on premise to azure. E.g. Data base migration service(DMS)

https://docs.microsoft.com/en-us/azure/azure-sql/migration-guides/database/sql-server-to-sql-database-overview

What are various back up policies for disaster recovery.

Azure SQL db default Back up retention is last 7 days

https://docs.microsoft.com/en-us/azure/azure-sql/database/automated-backups-overview?tabs=single-database

What is Elastic pool. What are various purchasing models.

DTU and vCore model

DATA SECURITY

What are various data encryption and data masking techniques?

Always Encrypted, Encryption at Rest

https://azure.microsoft.com/en-us/blog/transparent-data-encryption-or-always-encrypted/

When to use what masking technique? Eg:Credit card

How to store secrets and keys in azure?

Azure Key Vault

https://docs.microsoft.com/en-us/azure/key-vault/general/basic-concepts

TIPS AND PRACTICE QUESTIONS

  • On the day of exam, I completed all the 46 questions in 2 hours and reviewed the questions for last 10 min

Use elimination rule to eliminate the wrong answers and focus on key words like stream/real time/cold path/hot path/un-structured

  • Skip the unknown questions and answer at the end
  • Practice mock questions before actual exam
  • You can watch my video on “AZURE DATA ENGINEER DP-201 Mock Questions” Link: https://youtu.be/ZOvFifemhHM

--

--

Cloud & Data Science

Passionate about Data Science, AI, ML, & Cloud Computing. Please check my channel for latest IT videos: www.youtube.com/c/CloudDataScience