Swin Transformer-based Image Captioning with Feature Enhancement and Multi-stage Fusion | IEEE Conference Publication | IEEE Xplore