Integrating vision and speech for conversations with multiple persons | IEEE Conference Publication | IEEE Xplore