Processor architecture driven algorithm optimization for fast 2D-DCT | IEEE Conference Publication | IEEE Xplore