Bangla Image Caption Generation Using Vision Transformer (ViT) Based Model | IEEE Conference Publication | IEEE Xplore