Joint audio-video object localization and tracking | IEEE Journals & Magazine | IEEE Xplore