Audio-visual speech recognition incorporating facial depth information captured by the Kinect | IEEE Conference Publication | IEEE Xplore