An Ensemble-based Regularization Method for Multi-Head Attention | IEEE Conference Publication | IEEE Xplore