Skip to content

DeltaCDFRelation

DeltaCDFRelation is a BaseRelation (Spark SQL) and a PrunedFilteredScan (Spark SQL).

Creating Instance

DeltaCDFRelation takes the following to be created:

  • SnapshotWithSchemaMode
  • SQLContext (Spark SQL)
  • Optional start version
  • Optional end version

DeltaCDFRelation is created when:

buildScan

PrunedFilteredScan
buildScan(
  requiredColumns: Array[String],
  filters: Array[Filter]): RDD[Row]

buildScan is part of the PrunedFilteredScan (Spark SQL) abstraction.

buildScan changesToBatchDF.

In the end, buildScan selects the given requiredColumns (using Dataset.select operator) and requests it for the RDD[Row].