Improving Transformer Inference Through Optimized Nonlinear Operations With Quantization-Approximation-Based Strategy | IEEE Journals & Magazine | IEEE Xplore