UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training | IEEE Conference Publication | IEEE Xplore