VGDIFFZERO: Text-To-Image Diffusion Models Can Be Zero-Shot Visual Grounders | IEEE Conference Publication | IEEE Xplore