Multi-speaker Direction of Arrival Estimation Using Audio and Visual Modalities with Convolutional Neural Network | IEEE Conference Publication | IEEE Xplore