Fast review map for this domain:
| Exam signal | First object to inspect | Correct-answer pattern |
|---|---|---|
| A pipeline, notebook, or dataflow fails | Run history, refresh history, activity output | Diagnose the failed boundary before retrying or scaling |
| A shortcut cannot read files | Shortcut target, credential, permission, path | Fix the referenced-data dependency before changing transformation code |
| Query or job performance is slow | Metrics, query evidence, Spark stage details, table layout | Optimize the measured bottleneck, not a generic adjacent feature |
| Need to know who changed or accessed an item | Microsoft Purview audit evidence | Use audit logs for actor, action, item, and timestamp questions |
| Need production readiness | Run success, freshness, duration, quality checks, access review | Combine execution, data quality, performance, and governance evidence |
flowchart TD
A[Symptom reported] --> B{What kind of signal?}
B -->|Execution failed| C[Run or refresh history]
B -->|Access or change investigation| D[Audit and permission evidence]
B -->|Slow workload| E[Metrics and engine evidence]
B -->|Bad data| F[Validation and quality checks]
C --> G[Fix owner dependency]
D --> G
E --> G
F --> G
Practice Question: A pipeline fails only when invoking a notebook from the pipeline. The notebook succeeds when run manually with hardcoded values. What should the data engineer inspect first?
A. The pipeline activity parameter mapping and run output.
B. The workspace sensitivity label.
C. The deployment pipeline stage comparison.
D. The endorsement status of the lakehouse.
Correct Answer: A.
Explanation: A is correct because the failure appears only through orchestration, so parameter binding and activity output are the nearest evidence. B, C, and D do not control notebook runtime parameters. Exam Takeaway: Diagnose at the boundary where success turns into failure; distractors use valid Fabric features outside the failing execution path.
Error resolution begins with run evidence. Pipeline run history identifies the failed activity and output payload. Dataflow refresh history shows connector, schema, transformation, or destination errors. Notebook output shows cell-level exceptions and Spark behavior. Shortcut status reveals path, credential, or external location issues. Permission views show whether the identity can access the workspace, item, or data object.
The why-layer is dependency localization. Every Fabric failure has an owner: trigger, activity, notebook, dataflow, shortcut, credential, schema, permission, or engine. Fixing the wrong owner wastes time and may introduce risk. A schema drift error needs schema handling, not capacity scaling. A shortcut credential error needs path or permission correction, not notebook code tuning.
| Object | Attribute | Value Range | Default State | Dependency | Failure State |
|---|---|---|---|---|---|
| Pipeline run | Activity status and output | Succeeded, failed, canceled, skipped | No run until triggered | Activity configuration and credentials | Failed activity blocks downstream process |
| Dataflow Gen2 refresh | Step and connector error | Refresh success or failure details | No refresh evidence until run | Source credential and destination | Refresh stops at connector or transformation step |
| Notebook run | Cell exception and Spark state | Completed, failed, canceled | Manual or pipeline run context | Spark runtime, parameters, data access | Cell fails or writes incomplete output |
| OneLake shortcut | Resolution and credential state | Healthy or error state | Not validated until accessed | Target path and permission | Files cannot be browsed or read |
| Permission assignment | Identity and scope | Workspace, item, data object | No access unless assigned | Microsoft Entra identity | 401, 403, hidden item, or query denial |
Use Fabric run history, refresh logs, notebook output, shortcut browse behavior, item permission panels, SQL/KQL error messages, and capacity metrics as evidence.
A scheduled or manual operation creates a service execution context. That context resolves identity, item configuration, parameters, source path, schema, and target write permissions. If any dependency fails, the owning runtime returns an error at the nearest observable boundary. Reading that boundary first prevents symptom-only remediation. A retry without fixing credential, path, schema, or parameter state simply repeats the same dependency failure.
| Task | Precise Command or Path | Verification Standard |
|---|---|---|
| Validate failed activity evidence | Fabric portal > Pipeline > Run history > Failed activity output | Error message identifies activity and dependency |
| Validate Dataflow error step | Fabric portal > Dataflow Gen2 > Refresh history > Details | Failed connector, step, or destination is visible |
| Validate notebook exception | Notebook run output or pipeline notebook activity output | Failing cell and parameter context are visible |
| Validate shortcut access | Fabric portal > Lakehouse > Shortcuts > Browse target | Shortcut opens expected path without credential error |
Practice Question: Queries over a lakehouse Delta table scan many small files and take longer after frequent incremental writes. What is the most aligned optimization target?
A. Optimize the lakehouse table layout and file organization.
B. Endorse the item as promoted.
C. Add a row-level security rule.
D. Create a new workspace domain.
Correct Answer: A.
Explanation: A is correct because the symptom is file layout and scan cost after incremental writes. B affects trust signal. C affects access filtering. D affects organization. Exam Takeaway: Let performance evidence name the object; distractors often improve governance rather than execution.
Optimization starts with measurement. Lakehouse table optimization focuses on file size, table maintenance, partitioning, and scan reduction. Pipeline optimization focuses on activity order, parallelism, copy settings, dependency paths, and retry behavior. Warehouse optimization focuses on query shape, distribution of work, statistics or supported tuning features, and relational design. Eventstreams and Eventhouses focus on ingestion throughput, backlog, retention, and query efficiency. Spark optimization focuses on partitioning, shuffle, skew, caching, and runtime settings. Query optimization focuses on filters, joins, projections, and engine-specific execution plans.
The why-layer is bottleneck ownership. A slow query over a poorly maintained table will not be fixed by adding a pipeline retry. A Spark shuffle skew issue will not be fixed by a sensitivity label. Optimization must reduce the measured cost at the point where the system spends time, memory, or throughput.
| Object | Attribute | Value Range | Default State | Dependency | Failure State |
|---|---|---|---|---|---|
| Lakehouse Delta table | File layout and maintenance state | Small files, compacted files, partitioned data | Depends on writes | Table maintenance support and workload pattern | Slow scans and high metadata overhead |
| Pipeline | Activity concurrency and dependency graph | Sequential or parallel where supported | Authored by designer | Source and target throughput | Long wall-clock runtime or avoidable waits |
| Warehouse query | Predicate, join, aggregation, plan behavior | Engine-supported SQL patterns | Query text as submitted | Table design and statistics-like metadata where supported | Excessive scan or slow join |
| Spark job | Partitioning and shuffle behavior | Balanced or skewed partitions | Derived from data and code | Spark runtime and data distribution | Executor spill, skew, long stage runtime |
| Eventhouse or Eventstream | Ingestion and query throughput | Healthy, delayed, backlogged | Depends on source rate | Capacity and configuration | Late data, query latency, dropped or delayed processing |
Use supported Fabric monitoring views, run history, Spark UI or notebook metrics where exposed, warehouse query monitoring, Eventstream/Eventhouse monitoring, and table inspection evidence.
The workload reads data, schedules compute, executes transformations or query operators, and writes or returns results. Each stage consumes time and resources. Small files increase metadata and scan overhead. Bad joins increase shuffle or relational work. Sequential pipeline dependencies increase wall-clock time. Streaming backlog grows when ingestion exceeds processing. The optimization changes the cost driver at the measured stage; if the change targets a different object, the original cost remains.
| Task | Precise Command or Path | Verification Standard |
|---|---|---|
| Validate lakehouse table scan issue | Fabric Lakehouse table details or supported notebook table inspection | File count, partition layout, or scan pattern explains slow query |
| Validate pipeline bottleneck | Fabric portal > Pipeline > Run history > Activity durations | Longest activity or avoidable wait path is identified |
| Validate warehouse query behavior | Warehouse query monitoring or supported execution evidence | Slow query, scan, join, or wait pattern is visible |
| Validate Spark skew | Notebook Spark execution details or Spark UI where available | One or more stages or partitions dominate runtime |
Practice Question: A manager asks which user deleted a Fabric item last week. Which evidence source should the data engineer use?
A. Microsoft Purview audit logs for Fabric activity.
B. Spark workspace settings.
C. Lakehouse table optimization history only.
D. A Dataflow Gen2 transformation step.
Correct Answer: A.
Explanation: A is correct because audit logs record user activity and operations when available and permitted. B is configuration. C is table maintenance evidence. D is transformation logic. Exam Takeaway: Use audit evidence for user actions; distractors often name runtime or transformation features that cannot answer who performed an operation.
Monitoring a Fabric analytics solution requires evidence separation. Run history tells whether a pipeline, notebook, or dataflow ran and where it failed. Metrics show capacity pressure, throughput, latency, or resource patterns. Audit logs show user and administrative activity. Query and engine evidence show execution details. Readiness criteria combine these signals into operational standards such as successful scheduled runs, acceptable duration, controlled access, validated row counts, and known recovery steps.
The why-layer is operational confidence. A solution is not production-ready because it ran once manually. It must produce repeatable run evidence, expose failures, show acceptable performance, and provide auditability for sensitive operations. Without monitoring, failures become user-discovered incidents rather than controlled engineering events.
| Object | Attribute | Value Range | Default State | Dependency | Failure State |
|---|---|---|---|---|---|
| Run history | Status, duration, activity output | Succeeded, failed, canceled, skipped | Available after run | Pipeline, notebook, dataflow execution | Unknown failure point or missed SLA |
| Capacity or workload metric | Utilization, latency, throttling-style signal | Normal, elevated, saturated | Depends on workload | Monitoring access and capacity telemetry | Slow workloads without root evidence |
| Audit log | User, operation, item, timestamp | Searchable events where enabled | Requires audit capability | Microsoft Purview permissions and retention | Cannot prove who changed or accessed item |
| Data-quality checkpoint | Count, freshness, exception rows | Pass, warn, fail | Not present unless designed | Validation logic and target metadata | Bad data reaches consumers |
| Operational readiness criteria | SLA, recovery, validation, access review | Met or unmet | Undefined unless documented | Monitoring and ownership | Production handoff lacks measurable standard |
Use Fabric portal run history, Fabric monitoring views, Microsoft Purview audit search, query result checks, validation tables, and documented operational runbooks as evidence.
A scheduled analytics solution runs under service control and emits activity state. Engines and workloads emit performance signals as they consume capacity and process data. Governance systems emit audit events for user and administrative actions. Validation logic emits data-quality evidence. Monitoring connects these signals to operational decisions. If the wrong signal is used, the team may know that something is slow but not why, or know that an item changed but not who changed it.
| Task | Precise Command or Path | Verification Standard |
|---|---|---|
| Validate scheduled process health | Fabric portal > Pipeline/Dataflow/Notebook > Run or refresh history | Latest scheduled run succeeded within expected duration |
| Validate performance baseline | Fabric monitoring or workload metrics view | Current duration or utilization is compared against baseline |
| Validate audit investigation | Microsoft Purview audit search filtered to Fabric item and date | Event shows actor, action, item, and timestamp |
| Validate data-quality readiness | Query validation table or quality-check output | Freshness, counts, and exception thresholds meet criteria |
What should be inspected first when a Fabric pipeline fails during ingestion?
Inspect the pipeline run history, failed activity details, input and output parameters, and connector or permission errors.
Pipeline run history identifies the failing activity and provides the operational evidence needed to isolate the cause. Many ingestion failures come from connection settings, credentials, schema drift, missing files, timeout behavior, or incorrect parameters. DP-700 scenarios expect a data engineer to start with execution evidence before changing unrelated downstream artifacts.
Demand Score: 94
Exam Relevance Score: 98
How should a Dataflow Gen2 refresh failure be approached?
Review refresh history, transformation step errors, source credentials, gateway or connection status, and schema changes in the source data.
Dataflow Gen2 errors often originate from source access, changed column names or data types, invalid transformation steps, or refresh configuration issues. Checking the refresh evidence first narrows the failure to the exact step or dependency. This matches the exam pattern of resolving errors by identifying the Fabric item and dependency that owns the failure.
Demand Score: 91
Exam Relevance Score: 96
What is a practical first step when a OneLake shortcut stops resolving correctly?
Validate the shortcut target, source permissions, supported source configuration, and whether the referenced path or object still exists.
Shortcuts depend on both the Fabric shortcut definition and the external or internal target it references. If permissions change, the source object is moved, or the path becomes invalid, downstream items can fail even though their transformation logic is unchanged. DP-700 expects candidates to troubleshoot shortcut dependencies before rewriting pipelines or queries.
Demand Score: 89
Exam Relevance Score: 95
How can Lakehouse table performance commonly be improved for analytical workloads?
Optimize the table layout, reduce small-file problems, maintain useful Delta table statistics, and align partitioning or file organization with query patterns.
Lakehouse query performance depends heavily on how data is physically organized and how efficiently the engine can skip irrelevant files. Poor layout, too many small files, and ineffective partitioning can slow reads even when the query is logically correct. Exam questions often ask for optimization actions that target storage layout rather than changing business logic.
Demand Score: 92
Exam Relevance Score: 97
What should be reviewed when a Spark notebook runs slowly in Fabric?
Review Spark configuration, data volume, partitioning, shuffle behavior, joins, file layout, and whether transformations can be simplified or pushed down.
Slow Spark jobs can result from inefficient joins, skewed partitions, excessive shuffle, poor file organization, or unnecessary transformations. Workspace Spark settings may also affect runtime behavior. DP-700 optimization scenarios reward answers that identify execution bottlenecks and data layout issues before increasing capacity or rewriting unrelated components.
Demand Score: 93
Exam Relevance Score: 97
Why should alerts be configured for Fabric ingestion, transformation, or refresh processes?
Alerts notify operators when critical runs fail, exceed expected thresholds, or require action before downstream reporting is affected.
Monitoring is not only retrospective troubleshooting; it also supports operational readiness. Alerts help teams respond to failed ingestion, delayed transformations, semantic model refresh issues, or abnormal runtime behavior. DP-700 includes monitoring because production analytics solutions must be observable and recoverable, not merely functional during development.
Demand Score: 90
Exam Relevance Score: 95