Image Captioning Using Vision Transformer Encoder Decoder Model | IEEE Conference Publication | IEEE Xplore