Improving Cross-Modal Understanding in Visual Dialog Via Contrastive Learning | IEEE Conference Publication | IEEE Xplore