Spark Configuration Properties of External Shuffle Service¶
The following are configuration properties of External Shuffle Service.
spark.shuffle.service.db.enabled¶
Whether to use db in ExternalShuffleService. Note that this only affects standalone mode.
Default: true
Used when:
ExternalShuffleServiceis requested for an ExternalBlockHandlerWorker(Spark Standalone) is requested to handle aWorkDirCleanupmessage
spark.shuffle.service.enabled¶
Controls whether to use the External Shuffle Service
Default: false
Note
LocalSparkCluster turns this property off explicitly when started.
Used when:
BlacklistTrackeris requested to updateBlacklistForFetchFailureExecutorMonitoris createdExecutorAllocationManageris requested to validateSettingsSparkEnvutility is requested to create a "base" SparkEnvExternalShuffleServiceis created and startedWorker(Spark Standalone) is requested to handle aWorkDirCleanupmessage or startedExecutorRunnable(Spark on YARN) is requested tostartContainer
spark.shuffle.service.fetch.rdd.enabled¶
Enables ExternalShuffleService for fetching disk persisted RDD blocks.
When enabled with Dynamic Resource Allocation executors having only disk persisted blocks are considered idle after spark.dynamicAllocation.executorIdleTimeout and will be released accordingly.
Default: false
Used when:
ExternalShuffleBlockResolveris createdSparkEnvutility is requested to create a "base" SparkEnvExecutorMonitoris created
spark.shuffle.service.port¶
Port of the external shuffle service
Default: 7337
Used when:
ExternalShuffleServiceis createdStorageUtilsutility is requested for the port of an external shuffle service