Image Captioning Based on Convolutional Neural Network and Transformer | VDE Conference Publication | IEEE Xplore