BatchScanExec Physical Operator¶
BatchScanExec
is a leaf physical operator.
Creating Instance¶
BatchScanExec
takes the following to be created:
- Output Schema (
AttributeReference
s) - Scan
BatchScanExec
is created when:
- DataSourceV2Strategy execution planning strategy is executed (for physical operators with DataSourceV2ScanRelation relations)
Batch¶
batch: Batch
batch
requests the Scan to toBatch.
batch
is used when:
BatchScanExec
is requested for partitions and readerFactory
Input RDD¶
inputRDD: RDD[InternalRow]
inputRDD
is part of the DataSourceV2ScanExecBase abstraction.
inputRDD
creates a DataSourceRDD.
InputPartitions¶
partitions: Seq[InputPartition]
partitions
is part of the DataSourceV2ScanExecBase abstraction.
partitions
...FIXME
PartitionReaderFactory¶
readerFactory: PartitionReaderFactory
readerFactory
is part of the DataSourceV2ScanExecBase abstraction.
readerFactory
requests the Batch to createReaderFactory.