Conferences >2024 IEEE/CVF Conference on C...

CAFF-DINO: Multi-spectral object detection transformers with cross-attention features fusion

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Object detection on images can find benefit from coupling multiple spectra, each presenting specific useful features. However, building an efficient architecture coupling...Show More

Metadata

Abstract:

Object detection on images can find benefit from coupling multiple spectra, each presenting specific useful features. However, building an efficient architecture coupling the different modalities is a complex task. Transformers, due to their ability to extract meaningful correlations between the different regions of the inputs appear as a promising way to perform features fusion across different spectra. This work presents a multi-spectral object detection architecture based on cross-attention features fusion (CAFF), combined with a transformer based detector (DINO). We demonstrate here the performance of the proposed approach in object detection compared with state-of-the-art approaches, on infrared-visible multi-spectral datasets. Moreover the robustness to systematic misalignment between image pairs is studied. The proposed approach is generic to any mono-spectrum transformer based detectors. The model developed in this study will be available in a dedicated github repository.

Published in: 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Date of Conference: 17-18 June 2024

Date Added to IEEE Xplore: 27 September 2024

ISBN Information:

ISSN Information:

DOI: 10.1109/CVPRW63382.2024.00309

Conference Location: Seattle, WA, USA

Funding Agency:

Contents

References is not available for this document.

CAFF-DINO: Multi-spectral object detection transformers with cross-attention features fusion

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

CAFF-DINO: Multi-spectral object detection transformers with cross-attention features fusion

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

Authors

Figures

References

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?