Efficient Transformer Inference with Statically Structured Sparse Attention | IEEE Conference Publication | IEEE Xplore