Linearization Weight Compression and In-Situ Hardware-Based Decompression for Attention-Based Neural Machine Translation | IEEE Journals & Magazine | IEEE Xplore