Text-to-Audio Grounding: Building Correspondence Between Captions and Sound Events | IEEE Conference Publication | IEEE Xplore