A Learned Performance Model With Transfer Learning Across GPUs on Tensorized Instructions | IEEE Journals & Magazine | IEEE Xplore