Abstract:
Currently, surveillance cameras are extensively employed in public security, and vehicle reidentification has emerged as a burgeoning research area in computer vision. Ne...Show MoreMetadata
Abstract:
Currently, surveillance cameras are extensively employed in public security, and vehicle reidentification has emerged as a burgeoning research area in computer vision. Nevertheless, vehicle reidentification grapples with the challenges of low intraclass similarity and high interclass similarity. This study tackles these challenges by introducing a novel vehicle reidentification method that integrates convolution and vision transformer features. Specifically, channel-by-channel convolution is incorporated into the feedforward layer to bolster the extraction of local features. Concurrently, the information from the last layer’s class token and other patches is fused to yield a comprehensive and rich featured representation. Experiments conducted on the VeRi776 and VehicleID datasets validate that the proposed method outperforms current state-of-the-art vehicle reidentification methods.
Published in: IEEE MultiMedia ( Volume: 31, Issue: 2, April-June 2024)
Funding Agency:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Vision Transformer ,
- Vehicle Re-identification ,
- High Similarity ,
- Convolutional Neural Network ,
- Local Features ,
- Feature Learning ,
- Global Features ,
- Multilayer Perceptron ,
- Deep Convolutional Neural Network ,
- Rich Features ,
- Intelligent Transportation ,
- Surveillance Cameras ,
- Local Feature Extraction ,
- Use Of Transformation ,
- Global Feature Extraction ,
- Vehicle Images ,
- Computer Vision Area ,
- Loss Function ,
- Spatial Dimensions ,
- Attention Mechanism ,
- Encoder Layer ,
- Linear Projection ,
- Re-identification Task ,
- Adjacent Patches ,
- Baseline Methods ,
- Multilayer Perceptron Layer ,
- Discriminative Features ,
- Convolutional Neural Network Architecture ,
- Input Tokens
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Vision Transformer ,
- Vehicle Re-identification ,
- High Similarity ,
- Convolutional Neural Network ,
- Local Features ,
- Feature Learning ,
- Global Features ,
- Multilayer Perceptron ,
- Deep Convolutional Neural Network ,
- Rich Features ,
- Intelligent Transportation ,
- Surveillance Cameras ,
- Local Feature Extraction ,
- Use Of Transformation ,
- Global Feature Extraction ,
- Vehicle Images ,
- Computer Vision Area ,
- Loss Function ,
- Spatial Dimensions ,
- Attention Mechanism ,
- Encoder Layer ,
- Linear Projection ,
- Re-identification Task ,
- Adjacent Patches ,
- Baseline Methods ,
- Multilayer Perceptron Layer ,
- Discriminative Features ,
- Convolutional Neural Network Architecture ,
- Input Tokens