Vision Transformer based UNet with Multi-Head Attention for Medical Image Segmentation | IEEE Conference Publication | IEEE Xplore