Caption location and extraction in digital video based on SVM
Zhi-Guo Cheng; Yun-Cai Liu
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Volume 6, Issue , 26-29 Aug. 2004 Page(s): 3515 - 3519 vol.6
Digital Object Identifier
Summary: Text that appears in a scene or graphically added to video can provide an important supplemental source of index information as well as clues for decoding the video's structure and for classification, and we call them closed caption. In this work, a novel algorithm is presented for detecting and locating caption in digital video. The first module of the system divides an image into small blocks featured by pixel value that is fed to SVM (support vector machine) to classify whether they are text blocks or not. The other module is to do post-processing on the classified text blocks to identify the rectangle region of them and OCR can be used further and easily. Experiments conducted with a variety of video sources show that our method could detect and locate caption region successfully by SVM with comparatively less samples.
View citation and abstract |