Databricks Databricks-Certified-Professional-Data-Engineer Exam (page: 4)
Databricks Certified Data Engineer Professional
Updated on: 02-Jan-2026

A table is registered with the following code:


Both users and orders are Delta Lake tables. Which statement describes the results of querying recent_orders?

  1. All logic will execute at query time and return the result of joining the valid versions of the source tables at the time the query finishes.
  2. All logic will execute when the table is defined and store the result of joining tables to the DBFS; this stored data will be returned when the table is queried.
  3. Results will be computed and cached when the table is defined; these cached results will incrementally update as new records are inserted into source tables.
  4. All logic will execute at query time and return the result of joining the valid versions of the source tables at the time the query began.
  5. The versions of each source table will be stored in the table transaction log; query results will be saved to DBFS with each query.

Answer(s): B



A production workload incrementally applies updates from an external Change Data Capture feed to a Delta Lake table as an always-on Structured Stream job. When data was initially migrated for this table, OPTIMIZE was executed and most data files were resized to 1 GB. Auto Optimize and Auto Compaction were both turned on for the streaming production job. Recent review of data files shows that most data files are under 64 MB, although each partition in the table contains at least 1 GB of data and the total table size is over 10 TB.
Which of the following likely explains these smaller file sizes?

  1. Databricks has autotuned to a smaller target file size to reduce duration of MERGE operations
  2. Z-order indices calculated on the table are preventing file compaction
  3. Bloom filter indices calculated on the table are preventing file compaction
  4. Databricks has autotuned to a smaller target file size based on the overall size of data in the table
  5. Databricks has autotuned to a smaller target file size based on the amount of data in each partition

Answer(s): A



Which statement regarding stream-static joins and static Delta tables is correct?

  1. Each microbatch of a stream-static join will use the most recent version of the static Delta table as of each microbatch.
  2. Each microbatch of a stream-static join will use the most recent version of the static Delta table as of the job's initialization.
  3. The checkpoint directory will be used to track state information for the unique keys present in the join.
  4. Stream-static joins cannot use static Delta tables because of consistency issues.
  5. The checkpoint directory will be used to track updates to the static Delta table.

Answer(s): A



A junior data engineer has been asked to develop a streaming data pipeline with a grouped aggregation using DataFramedf. The pipeline needs to calculate the average humidity and average temperature for each non-overlapping five-minute interval. Events are recorded once per minute per device.

Streaming DataFramedf has the following schema:
"device_id INT, event_time TIMESTAMP, temp FLOAT, humidity FLOAT"
Code block:


Choose the response that correctly fills in the blank within the code block to complete this task.

  1. to_interval("event_time", "5 minutes").alias("time")
  2. window("event_time", "5 minutes").alias("time")
  3. "event_time"
  4. window("event_time", "10 minutes").alias("time")
  5. lag("event_time", "10 minutes").alias("time")

Answer(s): B



A data architect has designed a system in which two Structured Streaming jobs will concurrently write to a single bronze Delta table. Each job is subscribing to a different topic from an Apache Kafka source, but they will write data with the same schema. To keep the directory structure simple, a data engineer has decided to nest a checkpoint directory to be shared by both streams.
The proposed directory structure is displayed below:


Which statement describes whether this checkpoint directory structure is valid for the given scenario and why?

  1. No; Delta Lake manages streaming checkpoints in the transaction log.
  2. Yes; both of the streams can share a single checkpoint directory.
  3. No; only one stream can write to a Delta Lake table.
  4. Yes; Delta Lake supports infinite concurrent writers.
  5. No; each of the streams needs to have its own checkpoint directory.

Answer(s): E



Viewing Page 4 of 37



Share your comments for Databricks Databricks-Certified-Professional-Data-Engineer exam with other users:

Saravana Kumar TS 12/8/2023 9:49:00 AM

question: 93 which statement is true regarding the result? sales contain 6 columns and values contain 7 columns so c is not right answer.
INDIA


Lue 3/30/2023 11:43:00 PM

highly recommend just passed my exam.
CANADA


DC 1/7/2024 10:17:00 AM

great practice! thanks
UNITED STATES


Anonymus 11/9/2023 5:41:00 AM

anyone who wrote this exam recently?
SOUTH AFRICA


Khalid Javid 11/17/2023 3:46:00 PM

kindly share the dump
Anonymous


Na 8/9/2023 8:39:00 AM

could you please upload cfe fraud prevention and deterrence questions? it will be very much helpful.
Anonymous


shime 10/23/2023 10:03:00 AM

this is really very very helpful for mcd level 1
ETHIOPIA


Vnu 6/3/2023 2:39:00 AM

very helpful!
Anonymous


Steve 8/17/2023 2:19:00 PM

question #18s answer should be a, not d. this should be corrected. it should be minvalidityperiod
CANADA


RITEISH 12/24/2023 4:33:00 AM

thanks for the exact solution
Anonymous


SB 10/15/2023 7:58:00 AM

need to refer the questions and have to give the exam
INDIA


Mike Derfalem 7/16/2023 7:59:00 PM

i need it right now if it was possible please
Anonymous


Isak 7/6/2023 3:21:00 AM

i need it very much please share it in the fastest time.
Anonymous


Maria 6/23/2023 11:40:00 AM

correct answer is d for student.java program
IRELAND


Nagendra Pedipina 7/12/2023 9:10:00 AM

q:37 c is correct
INDIA


John 9/16/2023 9:37:00 PM

q6 exam topic: terramearth, c: correct answer: copy 1petabyte to encrypted usb device ???
GERMANY


SAM 12/4/2023 12:56:00 AM

explained answers
INDIA


Andy 12/26/2023 9:35:00 PM

plan to take theaws certified developer - associate dva-c02 in the next few weeks
SINGAPORE


siva 5/17/2023 12:32:00 AM

very helpfull
Anonymous


mouna 9/27/2023 8:53:00 AM

good questions
Anonymous


Bhavya 9/12/2023 7:18:00 AM

help to practice csa exam
Anonymous


Malik 9/28/2023 1:09:00 PM

nice tip and well documented
Anonymous


rodrigo 6/22/2023 7:55:00 AM

i need the exam
Anonymous


Dan 6/29/2023 1:53:00 PM

please upload
Anonymous


Ale M 11/22/2023 6:38:00 PM

prepping for fsc exam
AUSTRALIA


ahmad hassan 9/6/2023 3:26:00 AM

pd1 with great experience
Anonymous


Žarko 9/5/2023 3:35:00 AM

@t it seems like azure service bus message quesues could be the best solution
UNITED KINGDOM


Shiji 10/15/2023 1:08:00 PM

helpful to check your understanding.
INDIA


Da Costa 8/27/2023 11:43:00 AM

question 128 the answer should be static not auto
Anonymous


bot 7/26/2023 6:45:00 PM

more comments here
UNITED STATES


Kaleemullah 12/31/2023 1:35:00 AM

great support to appear for exams
Anonymous


Bsmaind 8/20/2023 9:26:00 AM

useful dumps
Anonymous


Blessious Phiri 8/13/2023 8:37:00 AM

making progress
Anonymous


Nabla 9/17/2023 10:20:00 AM

q31 answer should be d i think
FRANCE