Bloom Filter Join¶
Bloom Filter Join is an optimization of join queries by pre-filtering one side of a join using BloomFilter based on the values from the other side of the join.
Runtime Filtering
Bloom Filter Join is one of the two optimizations in Runtime Filtering optimization.
Bloom Filter Join uses BloomFilters as runtime filters when spark.sql.optimizer.runtime.bloomFilter.enabled configuration property is enabled.
Bloom Filter Join uses InjectRuntimeFilter logical optimization to inject up to spark.sql.optimizer.runtimeFilter.number.threshold filters (BloomFilters or InSubquerys).
SPARK-32268
Bloom Filter Join was introduced in SPARK-32268.