Latency-Critical Quantized Inference With Transformer Decoders on ARM and RISC-V CPUs | IEEE Journals & Magazine | IEEE Xplore