CoGroupedRDD¶
CoGroupedRDD[K] is an RDD that cogroups the parent RDDs.
RDD[(K, Array[Iterable[_]])]
For each key k in parent RDDs, the resulting RDD contains a tuple with the list of values for that key.
Creating Instance¶
CoGroupedRDD takes the following to be created:
- Key-Value RDDs (
Seq[RDD[_ <: Product2[K, _]]]) - Partitioner
CoGroupedRDD is created when:
- RDD.cogroup operator is used