Fusing Information Streams in End-to-End Audio-Visual Speech Recognition | IEEE Conference Publication | IEEE Xplore