Multimodal Emotion Recognition Based on Deep Temporal Features Using Cross-Modal Transformer and Self-Attention | IEEE Conference Publication | IEEE Xplore