Skip to content

Configuration Properties

The following contains the configuration properties of EventLoggingListener and HistoryServer.

spark.eventLog

buffer.kb

spark.eventLog.buffer.kb

Size of the buffer to use when writing to output streams

Default: 100k

compress

spark.eventLog.compress

Enables event compression (using a CompressionCodec)

Default: false

compression.codec

spark.eventLog.compression.codec

The codec used to compress event log (with spark.eventLog.compress enabled). By default, Spark provides four codecs: lz4, lzf, snappy, and zstd. You can also use fully qualified class names to specify the codec.

Default: zstd

dir

spark.eventLog.dir

Directory where Spark events are logged to (e.g. hdfs://namenode:8021/directory)

Default: /tmp/spark-events

The directory must exist before SparkContext can be created

enabled

spark.eventLog.enabled

Enables persisting Spark events

Default: false

erasureCoding.enabled

spark.eventLog.erasureCoding.enabled

Default: false

gcMetrics.youngGenerationGarbageCollectors

spark.eventLog.gcMetrics.youngGenerationGarbageCollectors

Names of supported young generation garbage collectors. A name usually is the output of GarbageCollectorMXBean.getName.

Default: Copy, PS Scavenge, ParNew, G1 Young Generation (the built-in young generation garbage collectors)

gcMetrics.oldGenerationGarbageCollectors

spark.eventLog.gcMetrics.oldGenerationGarbageCollectors

Names of supported old generation garbage collectors. A name usually is the output of GarbageCollectorMXBean.getName.

Default: MarkSweepCompact, PS MarkSweep, ConcurrentMarkSweep, G1 Old Generation (the built-in old generation garbage collectors)

logBlockUpdates.enabled

spark.eventLog.logBlockUpdates.enabled

Enables log RDD block updates using EventLoggingListener

Default: false

logStageExecutorMetrics

spark.eventLog.logStageExecutorMetrics

Enables logging of per-stage peaks of executor metrics (for each executor) to the event log

Default: false

longForm.enabled

spark.eventLog.longForm.enabled

Default: false

overwrite

spark.eventLog.overwrite

Enables deleting (or at least overwriting) an existing .inprogress event log files

Default: false

rolling.enabled

spark.eventLog.rolling.enabled

Enables rolling over event log files. When enabled, cuts down each event log file to spark.eventLog.rolling.maxFileSize

Default: false

rolling.maxFileSize

spark.eventLog.rolling.maxFileSize

Max size of event log file to be rolled over (with spark.eventLog.rolling.enabled enabled)

Default: 128m

Must be at least 10 MiB

spark.history

fs.logDirectory

spark.history.fs.logDirectory

The directory for event log files. The directory has to exist before starting History Server.

Default: file:/tmp/spark-events

kerberos.enabled

spark.history.kerberos.enabled

Whether to enable (true) or disable (false) security when working with HDFS with security enabled (Kerberos).

Default: false

kerberos.keytab

spark.history.kerberos.keytab

Keytab to use for login to Kerberos. Required when spark.history.kerberos.enabled is enabled.

Default: (empty)

kerberos.principal

spark.history.kerberos.principal

Kerberos principal. Required when spark.history.kerberos.enabled is enabled.

Default: (empty)

provider

spark.history.provider

Fully-qualified class name of an ApplicationHistoryProvider for HistoryServer.

Default: org.apache.spark.deploy.history.FsHistoryProvider

retainedApplications

spark.history.retainedApplications

How many Spark applications HistoryServer should retain

Default: 50

store.path

spark.history.store.path

Local directory where to cache application history information (by )

Default: (undefined) (i.e. all history information will be kept in memory)

ui.maxApplications

spark.history.ui.maxApplications

How many Spark applications HistoryServer should show in the UI

Default: (unbounded)

ui.port

spark.history.ui.port

The port of History Server's web UI.

Default: 18080