Eyes and Ears: Automated Annotation of Audio Data Using Computer Vision | IEEE Conference Publication | IEEE Xplore