NewHadoopRDD
== [[NewHadoopRDD]] NewHadoopRDD
NewHadoopRDD is an rdd:index.md[RDD] of K keys and V values.
<
SparkContext.newAPIHadoopFileSparkContext.newAPIHadoopRDD- (indirectly)
SparkContext.binaryFiles - (indirectly)
SparkContext.wholeTextFiles
NOTE: NewHadoopRDD is the base RDD of BinaryFileRDD and WholeTextFileRDD.
=== [[getPreferredLocations]] getPreferredLocations Method
CAUTION: FIXME
=== [[creating-instance]] Creating NewHadoopRDD Instance
NewHadoopRDD takes the following when created:
- [[sc]] SparkContext.md[]
- [[inputFormatClass]] HDFS'
InputFormat[K, V] - [[keyClass]]
Kclass name - [[valueClass]]
Vclass name - [[_conf]] transient HDFS'
Configuration
NewHadoopRDD initializes the <