GPU H.264 motion estimation with contiguous diagonal parallelization and fusion of macroblock processing | IEEE Conference Publication | IEEE Xplore