Video content extraction and representation using a joint audio and video processing | IEEE Conference Publication | IEEE Xplore