CoGroupedRDD¶
CoGroupedRDD[K]
is an RDD that cogroups the parent RDDs.
RDD[(K, Array[Iterable[_]])]
For each key k
in parent RDDs, the resulting RDD contains a tuple with the list of values for that key.
Creating Instance¶
CoGroupedRDD
takes the following to be created:
- Key-Value RDDs (
Seq[RDD[_ <: Product2[K, _]]]
) - Partitioner
CoGroupedRDD
is created when:
- RDD.cogroup operator is used