Multi-modal Emotion Recognition Utilizing Korean-English Vision and Language Information Alignment Pre-trained Model | IEEE Conference Publication | IEEE Xplore