Abstract:
Text-based person retrieval is the process of searching a massive visual resource library for images of a particular pedestrian, based on a textual query. Existing approa...Show MoreMetadata
Abstract:
Text-based person retrieval is the process of searching a massive visual resource library for images of a particular pedestrian, based on a textual query. Existing approaches often suffer from a problem of color (CLR) over-reliance, which can result in a suboptimal person retrieval performance by distracting the model from other important visual cues such as texture and structure information. To handle this problem, we propose a novel framework to Excavate All-round Information Beyond Color for the task of text-based person retrieval, which is therefore termed EAIBC. The EAIBC architecture includes four branches, namely an RGB branch, a grayscale (GRS) branch, a high-frequency (HFQ) branch, and a CLR branch. Furthermore, we introduce a mutual learning (ML) mechanism to facilitate communication and learning among the branches, enabling them to take full advantage of all-round information in an effective and balanced manner. We evaluate the proposed method on three benchmark datasets, including CUHK-PEDES, ICFG-PEDES, and RSTPReid. The experimental results demonstrate that EAIBC significantly outperforms existing methods and achieves state-of-the-art (SOTA) performance in supervised, weakly supervised, and cross-domain settings.
Published in: IEEE Transactions on Neural Networks and Learning Systems ( Volume: 36, Issue: 3, March 2025)
Funding Agency:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Person Retrieval ,
- Structural Information ,
- Important Cues ,
- Suboptimal Performance ,
- Mutual Learning ,
- Retrieval Performance ,
- Balanced Manner ,
- Massive Resources ,
- Text Query ,
- Visualization Library ,
- Semantic ,
- Feature Maps ,
- Visual Representation ,
- Visual Features ,
- RGB Images ,
- Attention Module ,
- Visual Modality ,
- Textual Descriptions ,
- Word Embedding ,
- Feature Matching ,
- Text Representation ,
- Person Image ,
- Red Border ,
- Text Modality ,
- Re-identification ,
- Global Representation ,
- Single Branch ,
- RGB Data ,
- Local Parts ,
- Complementary Effects
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Person Retrieval ,
- Structural Information ,
- Important Cues ,
- Suboptimal Performance ,
- Mutual Learning ,
- Retrieval Performance ,
- Balanced Manner ,
- Massive Resources ,
- Text Query ,
- Visualization Library ,
- Semantic ,
- Feature Maps ,
- Visual Representation ,
- Visual Features ,
- RGB Images ,
- Attention Module ,
- Visual Modality ,
- Textual Descriptions ,
- Word Embedding ,
- Feature Matching ,
- Text Representation ,
- Person Image ,
- Red Border ,
- Text Modality ,
- Re-identification ,
- Global Representation ,
- Single Branch ,
- RGB Data ,
- Local Parts ,
- Complementary Effects
- Author Keywords