Context-dependent audio-visual and temporal features fusion for TV commercial detection | IEEE Conference Publication | IEEE Xplore