ParquetTable¶

ParquetTable uses ParquetScanBuilder for scanning and ParquetWrite for writing.

Creating Instance¶

ParquetTable takes the following to be created:

ParquetTable is created when:

Signature

formatName: String

formatName is part of the FileTable abstraction.

formatName is the following text:

Parquet

Signature

inferSchema(
  files: Seq[FileStatus]): Option[StructType]

inferSchema is part of the FileTable abstraction.

inferSchema infers the schema (with the options and the input Hadoop FileStatuses).

Signature

newScanBuilder(
  options: CaseInsensitiveStringMap): ParquetScanBuilder

newScanBuilder is part of the SupportsRead abstraction.

newScanBuilder creates a ParquetScanBuilder with the following:

Signature

newWriteBuilder(
  info: LogicalWriteInfo): WriteBuilder

newWriteBuilder is part of the SupportsWrite abstraction.

newWriteBuilder creates a WriteBuilder that creates a ParquetWrite (when requested to build a Write).

Signature

supportsDataType(
  dataType: DataType): Boolean

supportsDataType is part of the FileTable abstraction.

supportsDataType supports all AtomicTypes and the following complex DataTypes with AtomicTypes: