Bloom Filter Join¶
Bloom Filter Join is an optimization of join queries by pre-filtering one side of a join using BloomFilter based on the values from the other side of the join.
Bloom Filter Join is one of the two optimizations in Runtime Filtering optimization.
Bloom Filter Join uses InjectRuntimeFilter logical optimization to inject up to spark.sql.optimizer.runtimeFilter.number.threshold filters (BloomFilters or InSubquerys).
Bloom Filter Join was introduced in SPARK-32268.