VirTex: Learning Visual Representations from Textual Annotations | IEEE Conference Publication | IEEE Xplore