Journals & Magazines >IEEE Transactions on Multimedia >Volume: 25

MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Many RGB-T trackers attempt to attain robust feature representation by utilizing an adaptive weighting scheme (or attention mechanism). Different from these works, we pro...Show More

Metadata

Abstract:

Many RGB-T trackers attempt to attain robust feature representation by utilizing an adaptive weighting scheme (or attention mechanism). Different from these works, we propose a new dynamic modality-aware filter generation module (named MFGNet) to boost the message communication between visible and thermal data by adaptively adjusting the convolutional kernels for various input images in practical tracking. Given the image pairs as input, we first encode their features with the backbone network. Then, we concatenate these feature maps and generate dynamic modality-aware filters with two independent networks. The visible and thermal filters will be used to conduct a dynamic convolutional operation on their corresponding input feature maps respectively. Inspired by residual connection, both the generated visible and thermal feature maps will be summarized with input feature maps. The augmented feature maps will be fed into the RoI align module to generate instance-level features for subsequent classification. To address issues caused by heavy occlusion, fast motion and out-of-view, we propose to conduct a joint local and global search by exploiting a new direction-aware target driven attention mechanism. The spatial and temporal recurrent neural network is used to capture the direction-aware context for accurate global attention prediction. Extensive experiments on three large-scale RGB-T tracking benchmark datasets validated the effectiveness of our proposed algorithm.

Published in: IEEE Transactions on Multimedia ( Volume: 25)

Page(s): 4335 - 4348

Date of Publication: 11 May 2022

ISSN Information:

DOI: 10.1109/TMM.2022.3174341

Funding Agency:

Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.

IEEE Keywords
- Target tracking ,
- Tracking ,
- Heuristic algorithms ,
- Visualization ,
- Task analysis ,
- Kernel ,
- Vehicle dynamics
Index Terms
- Dynamic Filter ,
- RGBT Tracking ,
- Dynamic Filter Generation ,
- Contralateral ,
- Neural Network ,
- Input Image ,
- Feature Maps ,
- Feature Representation ,
- Input Features ,
- Local Search ,
- Attention Mechanism ,
- Large-scale Datasets ,
- Convolution Operation ,
- Image Pairs ,
- Dynamic Performance ,
- Dynamic Mode ,
- Backbone Network ,
- Global Search ,
- Global Attention ,
- Residual Connection ,
- Spatial Attention ,
- Robust Tracking ,
- Tracking Results ,
- Attention Map ,
- Feature Tracking ,
- Channel Attention ,
- Target Object ,
- Track Model ,
- Bounding Box ,
- Dynamic Network
Author Keywords

Contents

I. Introduction

Object tracking is a popular research topic in computer vision that aims to locate a determined object (initialized in the first frame) in each video frame. It has been widely used in many applications, such as intelligent surveillance, automatic driving, and unmanned aerial vehicles. Although it has already achieved great success in recent years with robust target representation brought by deep neural network [1]–[15], these trackers still suffer from challenging factors, e.g., illumination, scale variation, and fast motion.

Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.

IEEE Keywords
- Target tracking ,
- Tracking ,
- Heuristic algorithms ,
- Visualization ,
- Task analysis ,
- Kernel ,
- Vehicle dynamics
Index Terms
- Dynamic Filter ,
- RGBT Tracking ,
- Dynamic Filter Generation ,
- Contralateral ,
- Neural Network ,
- Input Image ,
- Feature Maps ,
- Feature Representation ,
- Input Features ,
- Local Search ,
- Attention Mechanism ,
- Large-scale Datasets ,
- Convolution Operation ,
- Image Pairs ,
- Dynamic Performance ,
- Dynamic Mode ,
- Backbone Network ,
- Global Search ,
- Global Attention ,
- Residual Connection ,
- Spatial Attention ,
- Robust Tracking ,
- Tracking Results ,
- Attention Map ,
- Feature Tracking ,
- Channel Attention ,
- Target Object ,
- Track Model ,
- Bounding Box ,
- Dynamic Network
Author Keywords

References is not available for this document.

MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?