Vision-Language Transformer and Query Generation for Referring Segmentation | IEEE Conference Publication | IEEE Xplore