RTA: A Reconfigurable Transformer Accelerator Exploiting Sparsity via Low-Bit-Width Prediction | IEEE Journals & Magazine | IEEE Xplore