Skip to content

MultiDimClusteringFunctions

MultiDimClusteringFunctions utility offers Spark SQL functions for multi-dimensional clustering.

range_partition_id

range_partition_id(
  col: Column,
  numPartitions: Int): Column

range_partition_id creates a Column (Spark SQL) with RangePartitionId unary expression (for the given arguments).

range_partition_id is used when:

interleave_bits

interleave_bits(
  cols: Column*): Column

interleave_bits creates a Column (Spark SQL) with InterleaveBits expression (for the expressions of the given columns).

interleave_bits is used when: