Using vision, acoustics, and natural language for disambiguation | IEEE Conference Publication | IEEE Xplore