Baton uses different blockiness algorithms for H264 and non-H264 formats. It is possible that the blockiness levels for both the videos are indeed different. This is because blockiness occurs due to compression, which in turn is dependent on video format, bit rate etc. We would suggest you to create two different test plans - one for H264 formats and the other for non-H264 formats and set different blockiness thresholds.