ResultStage — Final Stage in Job

A ResultStage is the final stage in a job that applies a function on one or many partitions of the target RDD to compute the result of an action.

dagscheduler job resultstage
Figure 1. Job creates ResultStage as the first stage

The partitions are given as a collection of partition ids (partitions) and the function func: (TaskContext, Iterator[_]) ⇒ _.

dagscheduler resultstage partitions
Figure 2. ResultStage and partitions
Read about TaskContext in TaskContext.

func Property

FIXME

setActiveJob Method

FIXME

removeActiveJob Method

FIXME

activeJob Method

activeJob: Option[ActiveJob]

activeJob returns the optional ActiveJob associated with a ResultStage.

FIXME When/why would that be NONE (empty)?