MicroBatchStream¶
MicroBatchStream is an extension of the SparkDataStream abstraction for data streams for Micro-Batch Stream Processing.
Contract¶
Creating PartitionReaderFactory¶
PartitionReaderFactory createReaderFactory()
PartitionReaderFactory (Spark SQL)
Used when:
MicroBatchScanExecphysical operator is requested for a PartitionReaderFactory
Latest Offset¶
Offset latestOffset()
Latest Offset
Used when:
MicroBatchExecutionis requested to constructing or skipping next streaming micro-batch
Input Partitions¶
InputPartition[] planInputPartitions(
Offset start,
Offset end)
InputPartitions (Spark SQL)
Used when:
MicroBatchScanExecphysical operator is requested for input partitions
Implementations¶
AvailableNowMicroBatchStreamWrapper- KafkaMicroBatchStream
- MemoryStream
RatePerMicroBatchStreamRateStreamMicroBatchStreamTextSocketMicroBatchStream