×

Special Offer! November Sale at DumpsCity! Get 20% Off on All Certification Exam Questions. Use Code: DC20OFF

Free Microsoft DP-203 Exam Questions

Try our Free Demo Practice Tests for Comprehensive DP-203 Exam Preparation

  • Microsoft DP-203 Exam Questions
  • Provided By: Microsoft
  • Exam: Data Engineering on Microsoft Azure
  • Certification: Azure Data Engineer Associate
  • Total Questions: 373
  • Updated On: Nov 12, 2024
  • Rated: 4.9 |
  • Online Users: 746
Page No. 1 of 75
Add To Cart
  • Question 1
    • You are creating an Azure Data Factory data flow that will ingest data from a CSV file, cast columns to specified types of data, and insert the data into a table in an

      Azure Synapse Analytic dedicated SQL pool. The CSV file contains three columns named username, comment, and date.

      The data flow already contains the following:

      ✑ A source transformation.

      ✑ A Derived Column transformation to set the appropriate types of data.

      ✑ A sink transformation to land the data in the pool.

      You need to ensure that the data flow meets the following requirements:

      ✑ All valid rows must be written to the destination table.

      ✑ Truncation errors in the comment column must be avoided proactively.

      ✑ Any rows containing comment values that will cause truncation errors upon insert must be written to a file in blob storage.

      Which two actions should you perform? Each correct answer presents part of the solution.

      NOTE: Each correct selection is worth one point.


      Answer: A,B
  • Question 2
    • You are implementing a batch dataset in the Parquet format.

      Data files will be produced be using Azure Data Factory and stored in Azure Data Lake Storage Gen2. The files will be consumed by an Azure Synapse Analytics serverless SQL pool.

      You need to minimize storage costs for the solution.

      What should you do?


      Answer: C
  • Question 3
    • You plan to perform batch processing in Azure Databricks once daily.

      Which type of Databricks cluster should you use?


      Answer: B
  • Question 4
    • You plan to build a structured streaming solution in Azure Databricks. The solution will count new events in five-minute intervals and report only events that arrive during the interval. The output will be sent to a Delta Lake table.

      Which output mode should you use?


      Answer: C
  • Question 5
    • You are designing a financial transactions table in an Azure Synapse Analytics dedicated SQL pool. The table will have a clustered columnstore index and will include the following columns:

      ✑ TransactionType: 40 million rows per transaction type

      ✑ CustomerSegment: 4 million per customer segment

      ✑ TransactionMonth: 65 million rows per month

      AccountType: 500 million per account type

      You have the following query requirements:

      ✑ Analysts will most commonly analyze transactions for a given month.

      ✑ Transactions analysis will typically summarize transactions by transaction type, customer segment, and/or account type

      You need to recommend a partition strategy for the table to minimize query times.

      On which column should you recommend partitioning the table?


      Answer: D
PAGE: 1 - 75
Add To Cart

© Copyrights Dumpscity 2024. All Rights Reserved

We use cookies to ensure your best experience. So we hope you are happy to receive all cookies on the Dumpscity.