Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality | IEEE Conference Publication | IEEE Xplore