PrunedInMemoryFileIndex¶
:hadoop-version: 2.10.0 :url-hadoop-javadoc: https://hadoop.apache.org/docs/r{hadoop-version}/api
PrunedInMemoryFileIndex
is a InMemoryFileIndex.md[InMemoryFileIndex] for a <
PrunedInMemoryFileIndex
may be given the <
PrunedInMemoryFileIndex
is <CatalogFileIndex
is requested to CatalogFileIndex.md#filterPartitions[filter the partitions of a partitioned table].
[[logging]] [TIP] ==== Enable ALL
logging level for org.apache.spark.sql.execution.datasources.PrunedInMemoryFileIndex
logger to see what happens inside.
Add the following line to conf/log4j2.properties
:
log4j.logger.org.apache.spark.sql.execution.datasources.PrunedInMemoryFileIndex=ALL
Refer to spark-logging.md[Logging].¶
=== [[creating-instance]] Creating PrunedInMemoryFileIndex Instance
PrunedInMemoryFileIndex
takes the following to be created:
- [[sparkSession]] SparkSession.md[SparkSession]
- [[tableBasePath]] Location of the Hive metastore table (as a Hadoop {url-hadoop-javadoc}/org/apache/hadoop/fs/Path.html[Path])
- [[fileStatusCache]]
FileStatusCache
- [[partitionSpec]]
PartitionSpec
(from a Hive metastore) - [[metadataOpsTimeNs]] Optional time of the partition metadata listing