Towards Open-Vocabulary Video Instance Segmentation | IEEE Conference Publication | IEEE Xplore