Vision Language Transformation for Medical Image Captioning: Comparison of Four Pretrained CNN Networks | IEEE Conference Publication | IEEE Xplore