Enhancing Contrastive Learning with Temporal Cognizance for Audio-Visual Representation Generation | IEEE Conference Publication | IEEE Xplore