FashionVLM - Fashion Captioning Using Pretrained Vision Transformer and Large Language Model | IEEE Conference Publication | IEEE Xplore