How Much Self-Attention Do We Need? Trading Attention for Feed-Forward Layers | IEEE Conference Publication | IEEE Xplore