Journals & Magazines >IEEE Transactions on Pattern ... >Volume: 46 Issue: 12

TCFormer: Visual Recognition via Token Clustering Transformer

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Transformers are widely used in computer vision areas and have achieved remarkable success. Most state-of-the-art approaches split images into regular grids and represent...Show More

Metadata

Abstract:

Transformers are widely used in computer vision areas and have achieved remarkable success. Most state-of-the-art approaches split images into regular grids and represent each grid region with a vision token. However, fixed token distribution disregards the semantic meaning of different image regions, resulting in sub-optimal performance. To address this issue, we propose the Token Clustering Transformer (TCFormer), which generates dynamic vision tokens based on semantic meaning. Our dynamic tokens possess two crucial characteristics: (1) Representing image regions with similar semantic meanings using the same vision token, even if those regions are not adjacent, and (2) concentrating on regions with valuable details and represent them using fine tokens. Through extensive experimentation across various applications, including image classification, human pose estimation, semantic segmentation, and object detection, we demonstrate the effectiveness of our TCFormer.

Published in: IEEE Transactions on Pattern Analysis and Machine Intelligence ( Volume: 46, Issue: 12, December 2024)

Page(s): 9521 - 9535

Date of Publication: 11 July 2024

ISSN Information:

PubMed ID: 38990751

DOI: 10.1109/TPAMI.2024.3425768

Funding Agency:

Contents

References is not available for this document.

TCFormer: Visual Recognition via Token Clustering Transformer

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

TCFormer: Visual Recognition via Token Clustering Transformer

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?