Speech Emotion Recognition via Swin-Transformer and Cross-Attention Fusion Model | IEEE Conference Publication | IEEE Xplore