SerializeFromObjectExec Unary Physical Operator¶
SerializeFromObjectExec is a unary physical operator that supports Java code generation.
SerializeFromObjectExec supports Java code generation with the <
SerializeFromObjectExec is a <
SerializeFromObjectExec is <
[[inputRDDs]] [[outputPartitioning]] SerializeFromObjectExec uses the <
[[output]] SerializeFromObjectExec uses the <
=== [[creating-instance]] Creating SerializeFromObjectExec Instance
SerializeFromObjectExec takes the following when created:
- [[serializer]] Serializer (as
Seq[NamedExpression]) - [[child]] Child <
> (that supports Java code generation)
=== [[doExecute]] Executing Physical Operator (Generating RDD[InternalRow]) -- doExecute Method
[source, scala]¶
doExecute(): RDD[InternalRow]¶
doExecute is part of the SparkPlan abstraction.
doExecute requests the <RDD[InternalRow]) and transforms it by executing the following function on internal rows per partition with index (using RDD.mapPartitionsWithIndexInternal that creates another RDD):
. Creates an UnsafeProjection for the <
. Requests the UnsafeProjection to initialize (for the partition index)
. Executes the UnsafeProjection on all internal binary rows in the partition
NOTE: doExecute (by RDD.mapPartitionsWithIndexInternal) adds a new MapPartitionsRDD to the RDD lineage. Use RDD.toDebugString to see the additional MapPartitionsRDD.