Learning Visual Grounding from Generative Vision and Language Model | IEEE Conference Publication | IEEE Xplore