Conferences >2024 IEEE/CVF Conference on C...

EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

We present EfficientViT-SAM, a new family of accelerated segment anything models. We retain SAM’s lightweight prompt encoder and mask decoder while replacing the heavy im...Show More

Metadata

Abstract:

We present EfficientViT-SAM, a new family of accelerated segment anything models. We retain SAM’s lightweight prompt encoder and mask decoder while replacing the heavy image encoder with EfficientViT. For the training, we begin with the knowledge distillation from the SAM-ViT-H image encoder to EfficientViT. Subsequently, we conduct end-to-end training on the SA-1B dataset. Benefiting from EfficientViT’s efficiency and capacity, EfficientViT-SAM delivers 48.9× measured TensorRT speedup on A100 GPU over SAM-ViT-H without sacrificing performance. Our code and pre-trained models are released at https://github.com/mit-han-lab/efficientvit.

Published in: 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Date of Conference: 17-18 June 2024

Date Added to IEEE Xplore: 27 September 2024

ISBN Information:

ISSN Information:

DOI: 10.1109/CVPRW63382.2024.00782

Conference Location: Seattle, WA, USA

Related Articles are not available for this document.

Contents

References is not available for this document.

EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Authors

Figures

References

Citations

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?