Contextual and Cross-Modal Interaction for Multi-Modal Speech Emotion Recognition | IEEE Journals & Magazine | IEEE Xplore