Abstract:
Question answering (QA)-based re-ranking methods for cross-modal retrieval have been recently proposed to further narrow down similar candidate images. The conventional Q...Show MoreMetadata
Abstract:
Question answering (QA)-based re-ranking methods for cross-modal retrieval have been recently proposed to further narrow down similar candidate images. The conventional QA-based re-ranking methods provide questions to users by analyzing candidate images, and the initial retrieval results are re-ranked based on the user's feedback. Contrary to these developments, only focusing on performance improvement makes it difficult to efficiently elicit the user's retrieval intention. To realize more useful QA-based re-ranking, considering the user interaction for eliciting the user's retrieval intention is required. In this paper, we propose a QA-based re-ranking method with considering two important factors for eliciting the user's retrieval intention: query-image relevance and recallability. Considering the query-image relevance enables to only focus on the candidate images related to the provided query text, while, focusing on the recallability enables users to easily answer the provided question. With these procedures, our method can efficiently and effectively elicit the user's retrieval intention. Experimental results using Microsoft Common Objects in Context and computationally constructed dataset including similar candidate images show that our method can improve the performance of the cross-modal retrieval methods and the QA-based re-ranking methods.
Published in: IEEE Open Journal of Signal Processing ( Volume: 4)
Funding Agency:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Cross-modal Retrieval ,
- Question Answering ,
- Similar Images ,
- Usage Intention ,
- Retrieval Results ,
- Objects In Context ,
- Text Query ,
- Comparative Method ,
- Hyperparameters ,
- Number Of Images ,
- Semantic Information ,
- Target Image ,
- Learning-based Methods ,
- Semantic Segmentation ,
- Baseline Methods ,
- Effective Imaging ,
- Shared Space ,
- Image Retrieval ,
- Open Dataset ,
- Retrieval Performance ,
- Desirable Image ,
- Experimental Perspective ,
- Semantic Labels ,
- Semantic Segmentation Models ,
- Dataset Bias ,
- Median Rank ,
- Information Ratio ,
- Mean Rank ,
- Pearson Correlation
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Cross-modal Retrieval ,
- Question Answering ,
- Similar Images ,
- Usage Intention ,
- Retrieval Results ,
- Objects In Context ,
- Text Query ,
- Comparative Method ,
- Hyperparameters ,
- Number Of Images ,
- Semantic Information ,
- Target Image ,
- Learning-based Methods ,
- Semantic Segmentation ,
- Baseline Methods ,
- Effective Imaging ,
- Shared Space ,
- Image Retrieval ,
- Open Dataset ,
- Retrieval Performance ,
- Desirable Image ,
- Experimental Perspective ,
- Semantic Labels ,
- Semantic Segmentation Models ,
- Dataset Bias ,
- Median Rank ,
- Information Ratio ,
- Mean Rank ,
- Pearson Correlation
- Author Keywords