Lightening the Load: Lightweighting Multimodal Understanding for Visual Grounding Tasks | IEEE Conference Publication | IEEE Xplore