Folding Attention: Memory and Power Optimization for On-Device Transformer-Based Streaming Speech Recognition | IEEE Conference Publication | IEEE Xplore