Audio Visual Segmentation through Text Embeddings | IEEE Conference Publication | IEEE Xplore