Loading [a11y]/accessibility-menu.js
Vision Transformer (ViT)-based Applications in Image Classification | IEEE Conference Publication | IEEE Xplore

Vision Transformer (ViT)-based Applications in Image Classification


Abstract:

In recent years, the ViT model has been widely used in the field of computer vision, especially for image classification tasks. This paper summarizes the application of V...Show More

Abstract:

In recent years, the ViT model has been widely used in the field of computer vision, especially for image classification tasks. This paper summarizes the application of ViT in image classification tasks, first introduces the image classification imple- mentation process and the basic architecture of the ViT model, then analyzes and summarizes the image classification methods, including traditional image classification methods, CNN-based image classification methods, and ViT-based image classification methods, and provides a comparative analysis of CNN and ViT. Subsequently, this paper outlines the application prospects of ViT in image classification and its future development and also outlines some shortcomings of ViT and its solutions.
Date of Conference: 06-08 May 2023
Date Added to IEEE Xplore: 26 May 2023
ISBN Information:
Conference Location: New York, NY, USA

References

References is not available for this document.