Audio-Visual Representation Learning For Lip-Sync Estimation Through Ranking Augmented Contrastive Training | IEEE Conference Publication | IEEE Xplore