BatchTableWrite Flow Execution¶
BatchTableWrite is a FlowExecution that writes a batch DataFrame to a Table.
Creating Instance¶
BatchTableWrite takes the following to be created:
-
TableIdentifier - ResolvedFlow
- DataflowGraph
- Table
- PipelineUpdateContext
- Configuration Properties
BatchTableWrite is created when:
FlowPlanneris requested to plan a CompleteFlow
Execute¶
FlowExecution
executeInternal(): Future[Unit]
executeInternal is part of the FlowExecution abstraction.
executeInternal activates the configuration properties in the current SparkSession.
executeInternal requests this PipelineUpdateContext for the FlowProgressEventLogger to recordRunning with this ResolvedFlow.
executeInternal requests this DataflowGraph to re-analyze this ResolvedFlow to get the DataFrame (the logical query plan)
executeInternal executes append batch write asynchronously:
- Creates a DataFrameWriter for the batch query's logical plan (the DataFrame).
- Sets the write format to the format of this Table.
- In the end,
executeInternalappends the rows to this Table (using DataFrameWriter.saveAsTable operator).
isStreaming¶
isStreaming is always disabled (false).