Gaussian Kernelized Self-Attention for Long Sequence Data and its Application to CTC-Based Speech Recognition | IEEE Conference Publication | IEEE Xplore