Cross-Modal Fusion Techniques for Utterance-Level Emotion Recognition from Text and Speech | IEEE Conference Publication | IEEE Xplore