HadoopMapReduceCommitProtocol¶
HadoopMapReduceCommitProtocol is a FileCommitProtocol.
HadoopMapReduceCommitProtocol is a Serializable (Java) (to be sent out in tasks over the wire to executors).
Creating Instance¶
HadoopMapReduceCommitProtocol takes the following to be created:
- Job ID
- Path
-
dynamicPartitionOverwriteflag (default:false)
HadoopMapReduceCommitProtocol is created when:
HadoopWriteConfigUtilis requested to create a committerHadoopMapReduceWriteConfigUtilis requested to create a committerHadoopMapRedWriteConfigUtilis requested to create a committer
Logging¶
Enable ALL logging level for org.apache.spark.internal.io.HadoopMapReduceCommitProtocol logger to see what happens inside.
Add the following line to conf/log4j2.properties:
logger.HadoopMapReduceCommitProtocol.name = org.apache.spark.internal.io.HadoopMapReduceCommitProtocol
logger.HadoopMapReduceCommitProtocol.level = all
Refer to Logging.