Skip to content

ParquetTable

ParquetTable is a FileTable.

Creating Instance

ParquetTable takes the following to be created:

ParquetTable is created when:

  • ParquetDataSourceV2 is requested to getTable

formatName

formatName: String

formatName is Parquet.

formatName is part of the FileTable abstraction.

inferSchema

inferSchema(
  files: Seq[FileStatus]): Option[StructType]

inferSchema infers the schema (with the options and the input Hadoop FileStatuses).

inferSchema is part of the FileTable abstraction.

newScanBuilder

newScanBuilder(
  options: CaseInsensitiveStringMap): ParquetScanBuilder

newScanBuilder creates a ParquetScanBuilder (with the fileIndex, the schema and the dataSchema).

newScanBuilder is part of the FileTable abstraction.

newWriteBuilder

newWriteBuilder(
  info: LogicalWriteInfo): WriteBuilder

newWriteBuilder creates a WriteBuilder with build that, when executed, creates a ParquetWrite.

newWriteBuilder is part of the FileTable abstraction.