Abstract:
Image aesthetic assessment is an important issue in multimedia, but most existing studies employ supervised learning methods that rely on large-scale annotated data. Howe...Show MoreMetadata
Abstract:
Image aesthetic assessment is an important issue in multimedia, but most existing studies employ supervised learning methods that rely on large-scale annotated data. However, aesthetic scoring annotations are difficult to obtain in large quantities. Therefore, this paper explores zero-shot image aesthetic assessment. We predict aesthetic scores by introducing knowledge of different attributes (e.g., Focus). First, we use prompt tuning to obtain a unique prompt for each aesthetic attribute as external knowledge. Second, we leverage image relations considering sentiment polarity as internal knowledge. Specifically, we obtain aesthetic attribute representations from pre-trained models via prompt learning, then select anchor images on specific attributes by sentiment polarity, computing aesthetic scores. Notably, annotated aesthetic scores are not used in the process. Experiments show that our zero-shot approach outperforms many comparisons using only a few anchor images.
Published in: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Date of Conference: 14-19 April 2024
Date Added to IEEE Xplore: 18 March 2024
ISBN Information:
ISSN Information:
University of International Business and Economics
University of International Business and Economics
University of International Business and Economics
University of International Business and Economics
University of International Business and Economics
University of International Business and Economics
University of International Business and Economics
University of International Business and Economics