In this paper, we proposed a novel prototype of automated music video generation using Web image resource. In this prototype, the salient words/phrases of a song's lyrics are first automatically extracted and then used as queries to retrieve related high-quality images from Web search engines. To guarantee the coherence among the chosen images' visual representation and the music song, the returned images are further re-ranked and filtered based on their content characteristics such as color, face, landscape, as well as the song's mood type. Finally, those selected images are concatenated to generate a music video using the Photo2Video technique, based on the rhythm information of the music. Preliminary evaluations of the proposed prototype have shown promising results
Published in:
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
(Volume:2
)
Date of Conference: 15-20 April 2007