Using audio-visual features for robust voice activity detection in clean and noisy speech | IEEE Conference Publication | IEEE Xplore