@ CREPE: Can Vision-Language Foundation Models Reason Compositionally? | IEEE Conference Publication | IEEE Xplore