Active and unsupervised learning for spoken word acquisition through a multimodal interface | IEEE Conference Publication | IEEE Xplore