Multimodal Speech Emotion Recognition Using Modality-Specific Self-Supervised Frameworks | IEEE Conference Publication | IEEE Xplore