End-to-end Multimodal Speech Recognition | IEEE Conference Publication | IEEE Xplore