Robust Audio-visual Speech Recognition Using Bimodal Dfsmn with Multi-condition Training and Dropout Regularization | IEEE Conference Publication | IEEE Xplore