Audio-Visual Speech Recognition with a Hybrid CTC/Attention Architecture | IEEE Conference Publication | IEEE Xplore