[Delta Core 2.4][Spark 3.4] Execute MERGE using Dataframe API in Scala to ensure merge command appears in the Logical Execution Plan and subsequently picked up by QueryExecutionListener #4825

rkrumins · 2025-06-25T09:31:18Z

(cherrypick of #3456) and (cherrypick of #3585)

This change ensures that the MERGE command executed via the Scala API is properly captured in the Logical Execution Plan and recognized by the QueryExecutionListener. While Spark 3.5.X and 4.x support lineage capture from the logical plan, earlier versions (3.1–3.4) do not, necessitating a backward-compatible solution.

This update manually resolves the plan, then executes it via the DataFrame API, allowing the command to flow through Spark’s standard analysis and execution pipeline. As a result, Spark data lineage can be captured using tools like Spline Spark Agent and etc.

Resolves (original issue: #1521) Covered by existing tests.

References:
(Cherrypick of #3456)
(Original issue: #1521)

… Logical Execution Plan and subsequently picked up by QueryExecutionListener

Update DeltaMergeBuilder.scala to ensure merge command appears in the…

4f90d44

… Logical Execution Plan and subsequently picked up by QueryExecutionListener

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Delta Core 2.4][Spark 3.4] Execute MERGE using Dataframe API in Scala to ensure merge command appears in the Logical Execution Plan and subsequently picked up by QueryExecutionListener #4825

[Delta Core 2.4][Spark 3.4] Execute MERGE using Dataframe API in Scala to ensure merge command appears in the Logical Execution Plan and subsequently picked up by QueryExecutionListener #4825

Uh oh!

rkrumins commented Jun 25, 2025

Uh oh!

Uh oh!

[Delta Core 2.4][Spark 3.4] Execute MERGE using Dataframe API in Scala to ensure merge command appears in the Logical Execution Plan and subsequently picked up by QueryExecutionListener #4825

Are you sure you want to change the base?

[Delta Core 2.4][Spark 3.4] Execute MERGE using Dataframe API in Scala to ensure merge command appears in the Logical Execution Plan and subsequently picked up by QueryExecutionListener #4825

Uh oh!

Conversation

rkrumins commented Jun 25, 2025

Uh oh!

Uh oh!