Few-Shot Multimodal Learning for Social Relation Understanding With Supervised Bi-Transformers | IEEE Conference Publication | IEEE Xplore