DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment | IEEE Conference Publication | IEEE Xplore