I See What You Hear: A Vision-Inspired Method to Localize Words | IEEE Conference Publication | IEEE Xplore