Learning to Fuse Latent Representations for Multimodal Data | IEEE Conference Publication | IEEE Xplore