Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech | IEEE Conference Publication | IEEE Xplore