Learning Bimodal Structure in Audio–Visual Data | IEEE Journals & Magazine | IEEE Xplore