Skip to content

HadoopMapReduceCommitProtocol

HadoopMapReduceCommitProtocol is a FileCommitProtocol.

HadoopMapReduceCommitProtocol is a Serializable (Java) (to be sent out in tasks over the wire to executors).

Creating Instance

HadoopMapReduceCommitProtocol takes the following to be created:

  • Job ID
  • Path
  • dynamicPartitionOverwrite flag (default: false)

HadoopMapReduceCommitProtocol is created when:

Logging

Enable ALL logging level for org.apache.spark.internal.io.HadoopMapReduceCommitProtocol logger to see what happens inside.

Add the following line to conf/log4j2.properties:

logger.HadoopMapReduceCommitProtocol.name = org.apache.spark.internal.io.HadoopMapReduceCommitProtocol
logger.HadoopMapReduceCommitProtocol.level = all

Refer to Logging.