Efficient Transformer Inference with Statically Structured Sparse Attention | IEEE Conference Publication | IEEE Xplore

Efficient Transformer Inference with Statically Structured Sparse Attention | IEEE Conference Publication | IEEE Xplore