Speaker-Targeted Audio-Visual Speech Recognition Using a Hybrid CTC/Attention Model with Interference Loss | IEEE Conference Publication | IEEE Xplore