Self-Supervised Contrastive Learning for Audio-Visual Action Recognition | IEEE Conference Publication | IEEE Xplore