Improving Video Saliency Detection via Localized Estimation and Spatiotemporal Refinement | IEEE Journals & Magazine | IEEE Xplore

Improving Video Saliency Detection via Localized Estimation and Spatiotemporal Refinement


Abstract:

Video saliency detection aims to pop out the most salient regions in every frame of a video. Up to now, many efforts have been made from various aspects for video salienc...Show More

Abstract:

Video saliency detection aims to pop out the most salient regions in every frame of a video. Up to now, many efforts have been made from various aspects for video saliency detection. Unfortunately, the existing video saliency models are very likely to fail in challenging videos with complicated motions and complex scenes. Therefore, in this paper, we propose a novel framework to improve the saliency detection results generated by existing video saliency models. The proposed framework consists of three key steps including localized estimation, spatiotemporal refinement, and saliency update. Specifically, the initial saliency map of each frame in a video is first generated by using an existing saliency model. Then, by considering the temporal consistency and strong correlation among adjacent frames, the localized estimation models, which are generated by training the random forest regressor within a local temporal window, are employed to generate the temporary saliency map. Finally, by taking the appearance and motion information of salient objects into consideration, the spatiotemporal refinement step is deployed to further improve the temporary saliency map and generate the final saliency map. Furthermore, such an improved saliency map is then utilized to update the initial saliency map and provide reliable cues for saliency detection in the next frame. The experimental results on four challenging datasets demonstrate that the proposed framework is able to consistently and significantly improve the saliency detection performance of various video saliency models, thereby achieving the state-of-the-art performance.
Published in: IEEE Transactions on Multimedia ( Volume: 20, Issue: 11, November 2018)
Page(s): 2993 - 3007
Date of Publication: 24 April 2018

ISSN Information:

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.