Trust Your Partner’s Friends: Hierarchical Cross-Modal Contrastive Pre-Training for Video-Text Retrieval | IEEE Conference Publication | IEEE Xplore