Incorporating Lip Features into Audio-Visual Multi-Speaker DOA Estimation by Gated Fusion | IEEE Conference Publication | IEEE Xplore