A Unimodal Representation Learning and Recurrent Decomposition Fusion Structure for Utterance-Level Multimodal Embedding Learning | IEEE Journals & Magazine | IEEE Xplore