HiveTableScanExec Leaf Physical Operator¶
HiveTableScanExec is a leaf physical operator that represents a HiveTableRelation logical operator at execution time.
HiveTableScanExec is <HiveTableRelation logical operator (i.e. is executed on a logical query plan with a HiveTableRelation logical operator).
[[nodeName]] HiveTableScanExec uses the HiveTableRelation.md#tableMeta[fully-qualified name of the Hive table] (of the <
Scan hive [table]
Creating Instance¶
HiveTableScanExec takes the following when created:
- [[requestedAttributes]] Requested attributes
- [[relation]] HiveTableRelation
- [[partitionPruningPred]] Partition pruning predicates
- [[sparkSession]] SparkSession
HiveTableScanExec initializes the <
=== [[partition-pruning-predicates]] Partition Pruning Predicates
HiveTableScanExec physical operator supports partition pruning for <
HiveTableScanExec requires that either the <HiveTableScanExec throws an IllegalArgumentException.
HiveTableScans.md[HiveTableScans] execution planning strategy creates a HiveTableScanExec physical operator for every HiveTableRelation.md[HiveTableRelation] operator in a query plan. When created, HiveTableScanExec is given the <HiveTableRelation.
Performance Metrics¶
| Key | Name (in web UI) | Description |
|---|---|---|
| numOutputRows | number of output rows | Number of output rows |
=== [[doExecute]] Executing Physical Operator (Generating RDD[InternalRow]) -- doExecute Method
[source, scala]¶
doExecute(): RDD[InternalRow]¶
doExecute is part of the SparkPlan abstraction.
doExecute...FIXME
=== [[internal-properties]] Internal Properties
[cols="30m,70",options="header",width="100%"] |=== | Name | Description
| boundPruningPred a| [[boundPruningPred]] Catalyst ../expressions/Expression.md[expression] for the <
| hiveQlTable a| [[hiveQlTable]] Hive {url-hive-javadoc}/org/apache/hadoop/hive/ql/metadata/Table.html[Table] metadata (HiveClientImpl.md#toHiveTable[converted] from the HiveTableRelation.md#tableMeta[CatalogTable] of the <
Used when HiveTableScanExec is requested for the <
| hadoopReader a| [[hadoopReader]] HadoopTableReader.md[HadoopTableReader]
| rawPartitions a| [[rawPartitions]] HiveClientImpl.md#toHivePartition[Hive partitions] (Seq[Partition])
Used when HiveTableScanExec physical operator is <
| tableDesc a| [[tableDesc]] Hive {url-hive-javadoc}/org/apache/hive/hcatalog/templeton/TableDesc.html[TableDesc]
|===