ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic | IEEE Conference Publication | IEEE Xplore