VLSI architecture of a scalable matrix transposer | IEEE Conference Publication | IEEE Xplore