16-bit FP sub-word parallelism to facilitate compiler vectorization and improve performance of image and media processing | IEEE Conference Publication | IEEE Xplore